2 Arabic Treebanks and Related Corpora


The Linguistic Data Consortium (LDC) has developed hundreds of data corpora for natural language processing (NLP) research. Among these are a number of annotated treebank corpora for Arabic. Typically, these corpora consist of a single collection of annotated documents. NLP research, however, usually requires multiple data sets for the purposes of training… (More)

10 Figures and Tables


  • Presentations referencing similar topics