An Algorithm for Unsupervised Transliteration Mining with an Application to Word Alignment


We propose a language-independent method for the automatic extraction of transliteration pairs from parallel corpora. In contrast to previous work, our method uses no form of supervision, and does not require linguistically informed preprocessing. We conduct experiments on data sets from the NEWS 2010 shared task on transliteration mining and achieve an F… (More)


5 Figures and Tables