Skip to search formSkip to main contentSkip to account menu

Text corpus

Known as: Text corpora, Linguistic corpus, Text item 
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2010
Highly Cited
2010
We present a novel scheme to apply factored phrase-based SMT to a language pair with very disparate morphological structures. Our… 
Highly Cited
2007
Highly Cited
2007
Statistical machine translation, as well as other areas of human language processing, have recently pushed toward the use of… 
Review
2007
Review
2007
The effectiveness of a video retrieval system largely depends on the choice of underlying text and image retrieval components… 
Highly Cited
2005
Highly Cited
2005
In this paper, we present a machine learning system for identifying non-referential it. Types of non-referential it are examined… 
Highly Cited
2004
Highly Cited
2004
Parsing systems which rely on hand-coded linguistic descriptions can only perform adequately in as far as these descriptions are… 
Highly Cited
2000
Highly Cited
2000
The most effective paradigm for word sense disambiguation, supervised learning, seems to be stuck because of the knowledge… 
Review
1989
Review
1989
This book (which has been long in the making!) is a compilation of a large number of papers written over the years (1971-1981… 
Highly Cited
1961
Highly Cited
1961
The pars intercerebralis of the brain of the desert locust contains about 2,400 cells in two groups, which stain with chrome…