Statistical Parsing of Spanish and Data Driven Lemmatization

  title={Statistical Parsing of Spanish and Data Driven Lemmatization},
  author={Joseph Le Roux and Beno{\^i}t Sagot and Djam{\'e} Seddah},
  booktitle={SPMRL@ACL 2012},
Although parsing performances have greatly improved in the last years, grammar inference from treebanks for morphologically rich languages, especially from small treebanks, is still a challenging task. In this paper we investigate how state-of-the-art parsing performances can be achieved on Spanish, a language with a rich verbal morphology, with a non-lexicalized parser trained on a treebank containing only around 2,800 trees. We rely on accurate part-of-speech tagging and datadriven… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.

Similar Papers

Loading similar papers…