Robust Part-of-speech Tagging of Arabic Text

  title={Robust Part-of-speech Tagging of Arabic Text},
  author={Hanan Aldarmaki and Mona T. Diab},
We present a new and improved part of speech tagger for Arabic text that incorporates a set of novel features and constraints. This framework is presented within the MADAMIRA software suite, a state-of-the-art toolkit for Arabic language processing. Starting from a linear SVM model with basic lexical features, we add a range of features derived from morphological analysis and clustering methods. We show that using these features significantly improves part-of-speech tagging accuracy, especially… CONTINUE READING

Similar Papers

Loading similar papers…