Statistical transliteration for english-arabic cross language information retrieval

  title={Statistical transliteration for english-arabic cross language information retrieval},
  author={Nasreen Abdul Jaleel and Leah S. Larkey},
Out of vocabulary (OOV) words are problematic for cross language information retrieval. One way to deal with OOV words when the two languages have different alphabets, is to transliterate the unknown words, that is, to render them in the orthography of the second language. In the present study, we present a simple statistical technique to train an English to Arabic transliteration model from pairs of names. We call this a selected n-gram model because a two-stage training procedure first learns… CONTINUE READING
Highly Influential
This paper has highly influenced 14 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 167 citations. REVIEW CITATIONS
108 Citations
3 References
Similar Papers


Publications citing this paper.

167 Citations

Citations per Year
Semantic Scholar estimates that this publication has 167 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.

Similar Papers

Loading similar papers…