Part-of-speech tagging of Modern Hebrew text

  title={Part-of-speech tagging of Modern Hebrew text},
  author={Roy Bar-Haim and Khalil Sima'an and Yoad Winter},
  journal={Natural Language Engineering},
Words in Semitic texts often consist of a concatenation of word segments, each corresponding to a part-of-speech (POS) category. Semitic words may be ambiguous with regard to their segmentation as well as to the POS tags assigned to each segment. When designing POS taggers for Semitic languages, a major architectural decision concerns the choice of the atomic input tokens (terminal symbols). If the tokenization is at the word level, the output tags must be complex, and represent both the… CONTINUE READING

Similar Papers

Loading similar papers…