Skip to search formSkip to main contentSkip to account menu

Text corpus

Known as: Text corpora, Linguistic corpus, Text item 
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2010
Highly Cited
2010
We present a novel scheme to apply factored phrase-based SMT to a language pair with very disparate morphological structures. Our… 
Highly Cited
2007
Highly Cited
2007
Statistical machine translation, as well as other areas of human language processing, have recently pushed toward the use of… 
Highly Cited
2000
Highly Cited
2000
The most effective paradigm for word sense disambiguation, supervised learning, seems to be stuck because of the knowledge… 
Highly Cited
2000
Highly Cited
2000
The paper describes a tagging scheme designed for the Russian Treebank, and presents tools used for corpus creation. 
Review
1993
Review
1993
Statistical computational linguistics is entering a consolidation phase, signaled by the appearance of book-length tracts devoted… 
Review
1989
Review
1989
This book (which has been long in the making!) is a compilation of a large number of papers written over the years (1971-1981… 
Highly Cited
1976
Highly Cited
1961
Highly Cited
1961
The pars intercerebralis of the brain of the desert locust contains about 2,400 cells in two groups, which stain with chrome…