Capturing Out-of-Vocabulary Words in Arabic Text

  title={Capturing Out-of-Vocabulary Words in Arabic Text},
  author={Abdusalam F. A. Nwesri and Seyed M. M. Tahaghoghi and Falk Scholer},
The increasing flow of information between languages has led to a rise in the frequency of non-native or loan words, where terms of one language appear transliterated in another. Dealing with such out of vocabulary words is essential for successful cross-lingual information retrieval. For example, techniques such as stemming should not be applied indiscriminately to all words in a collection, and so before any stemming, foreign words need to be identified. In this paper, we investigate three… CONTINUE READING
Highly Cited
This paper has 19 citations. REVIEW CITATIONS