Extracting Transliteration Pairs from Comparable Corpora

  title={Extracting Transliteration Pairs from Comparable Corpora},
  author={Slaven Bilac and Hozumi Tanaka},
Transliterating words and names from one language to another is a frequent and highly productive phenomenon. For example, English word cacheis transliterated in Japanese asキャッシュ “kyasshu ”. In many cases, recent transliterations are not recorded in machine readable dictionaries so it is impossible to rely on dictionary lookup to find transliteration equivalents. In this paper we describe a method for extracting transliteration pairs from comparable corpora. Proposed method exploits the… CONTINUE READING
Highly Cited
This paper has 18 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 12 extracted citations


Publications referenced by this paper.
Showing 1-10 of 17 references

Auto - matically harvesting katakanaEnglish term pairs from search engine query logs

G. Kacmarcik, C. Brockett
Proc . of NLPRS • 2001

Bilingual parallel corpora and language engineering,

Harold Somers
inIn Proc. of Workshop on Language Engineering for South-Asian languages • 2001
View 1 Excerpt

Similar Papers

Loading similar papers…