Robust Part-of-speech Tagging of Arabic Text

@inproceedings{Aldarmaki2015RobustPT,
  title={Robust Part-of-speech Tagging of Arabic Text},
  author={Hanan Aldarmaki and Mona T. Diab},
  booktitle={ANLP@ACL},
  year={2015}
}
We present a new and improved part of speech tagger for Arabic text that incorporates a set of novel features and constraints. This framework is presented within the MADAMIRA software suite, a state-of-the-art toolkit for Arabic language processing. Starting from a linear SVM model with basic lexical features, we add a range of features derived from morphological analysis and clustering methods. We show that using these features significantly improves part-of-speech tagging accuracy, especially… CONTINUE READING

Similar Papers

Loading similar papers…