Skip to search formSkip to main contentSkip to account menu

Text corpus

Known as: Text corpora, Linguistic corpus, Text item 
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2010
Highly Cited
2010
We present a novel scheme to apply factored phrase-based SMT to a language pair with very disparate morphological structures. Our… 
Highly Cited
2009
Highly Cited
2009
This paper presents novel improvements to the induction of translation lexicons from monolingual corpora using multilingual… 
Highly Cited
2007
Highly Cited
2007
Statistical machine translation, as well as other areas of human language processing, have recently pushed toward the use of… 
Highly Cited
2007
Highly Cited
2007
We present in this article a new method for automatic extraction of bilingual lexicons from comparable corpora. We first anaylze… 
Highly Cited
2005
Highly Cited
2005
Corpora for training plan recognizers are scarce and difficult to gather from humans. However, corpora could be a boon to plan… 
Highly Cited
1998
Highly Cited
1998
This paper describes eight telephone-speech corpora at various stages of development at the Center for Spoken Language… 
Review
1989
Review
1989
This book (which has been long in the making!) is a compilation of a large number of papers written over the years (1971-1981… 
Highly Cited
1961
Highly Cited
1961
The pars intercerebralis of the brain of the desert locust contains about 2,400 cells in two groups, which stain with chrome… 
Highly Cited
1960