Lemmatization and Lexicalized Statistical Parsing of Morphologically-Rich Languages: the Case of French

@inproceedings{Seddah2010LemmatizationAL,
  title={Lemmatization and Lexicalized Statistical Parsing of Morphologically-Rich Languages: the Case of French},
  author={Djam{\'e} Seddah and Grzegorz Chrupala and {\"O}zlem Çetinoglu and Josef van Genabith and Marie Candito},
  booktitle={SPMRL@NAACL-HLT},
  year={2010}
}
This paper shows that training a lexicalized parser on a lemmatized morphologically-rich treebank such as the French Treebank slightly improves parsing results. We also show that lemmatizing a similar in size subset of the English Penn Treebank has almost no effect on parsing performance with gold lemmas and leads to a small drop of performance when automatically assigned lemmas and POS tags are used. This highlights two facts: (i) lemmatization helps to reduce lexicon data-sparseness issues… CONTINUE READING
Highly Cited
This paper has 22 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.
Showing 1-10 of 15 extracted citations

Similar Papers

Loading similar papers…