The Penn Arabic Treebank : Building a Large-Scale Annotated Arabic Corpus

@inproceedings{Maamouri2004ThePA,
  title={The Penn Arabic Treebank : Building a Large-Scale Annotated Arabic Corpus},
  author={Mohamed Maamouri and Ann Bies and Tim Buckwalter and Wigdan Mekki},
  year={2004}
}
From our three year experience of developing a large-scale corpus of annotated Arabic text, our paper will address the following: (a) review pertinent Arabic language issues as they relate to methodology choices, (b) explain our choice to use the Penn English Treebank style of guidelines, (requiring the Arabic-speaking annotators to deal with a new grammatical system) rather than doing the annotation in a more traditional Arabic grammar style (requiring NLP researchers to deal with a new system… CONTINUE READING

Tables and Topics from this paper.

Citations

Publications citing this paper.
SHOWING 1-10 OF 242 CITATIONS, ESTIMATED 29% COVERAGE

FILTER CITATIONS BY YEAR

2005
2019

CITATION STATISTICS

  • 71 Highly Influenced Citations

  • Averaged 22 Citations per year over the last 3 years

References

Publications referenced by this paper.
SHOWING 1-10 OF 22 REFERENCES

Resources for Arabic Natural Language Processing at the Linguistic Data Consortium

  • M. Maamouri, C. Cieri
  • Proceedings of the International Symposium on…
  • 2002
Highly Influential
3 Excerpts

Arabic Treebank: Part 2 v 2.0. Linguistic Data Consortium, catalog number LDC2004T02, ISBN: 158563-282-1

  • M. Maamouri, A. Bies, T. Buckwalter, H. Jin
  • 2004
Highly Influential
1 Excerpt

Arabic Treebank: Part 3 v 1.0. Linguistic Data Consortium, catalog number LDC2004T11, ISBN: 158563-298-8

  • M. Maamouri, A. Bies, T. Buckwalter, H. Jin
  • 2004
Highly Influential
1 Excerpt

Arabic Treebank: Part 3(a) v 1.1. Linguistic Data Consortium, catalog number LDC2004E71

  • M. Maamouri, A. Bies, T. Buckwalter, H. Jin
  • 2004
Highly Influential
1 Excerpt

Arabic Treebank: Part 1 v 2.0. Linguistic Data Consortium, catalog number LDC2003T06, ISBN: 158563-261-9

  • M. Maamouri, A. Bies, H. Jin, T. Buckwalter
  • 2003
Highly Influential
1 Excerpt

Buckwalter Arabic Morphological Analyzer Version 1.0. Linguistic Data Consortium, catalog number LDC2002L49, ISBN 1-58563-257-0

  • T. Buckwalter
  • 2002
Highly Influential
4 Excerpts

Propbanking in Parallel

  • P. Kingsbury, N. Xue, M. Palmer
  • Proceedings of the Workshop on the Amazing…
  • 2004

Similar Papers

Loading similar papers…