2 Arabic Treebanks and Related Corpora

Abstract

The Linguistic Data Consortium (LDC) has developed hundreds of data corpora for natural language processing (NLP) research. Among these are a number of annotated treebank corpora for Arabic. Typically, these corpora consist of a single collection of annotated documents. NLP research, however, usually requires multiple data sets for the purposes of training… (More)

10 Figures and Tables

Topics

  • Presentations referencing similar topics