Skip to search formSkip to main contentSkip to account menu

Text corpus

Known as: Text corpora, Linguistic corpus, Text item 
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2007
Highly Cited
2007
A large number of different tags, limited corpora and the free word order are the main causes of low accuracy of tagging in… 
Highly Cited
2007
Highly Cited
2007
Statistical machine translation, as well as other areas of human language processing, have recently pushed toward the use of… 
Highly Cited
2002
Highly Cited
2002
This paper describes the evaluation methodology and results of the 2001 DARPA Communicator evaluation. The experiment spanned 6… 
Highly Cited
2000
Highly Cited
2000
Statistical part-of-speech (POS) taggers achieve high accuracy and robustness when based on large scale manually tagged corpora… 
Highly Cited
2000
Highly Cited
2000
The most effective paradigm for word sense disambiguation, supervised learning, seems to be stuck because of the knowledge… 
Review
1989
Review
1989
This book (which has been long in the making!) is a compilation of a large number of papers written over the years (1971-1981… 
Highly Cited
1981
Highly Cited
1981
We propose a vision of the structure of knowledge and processes of learning based upon the particularity of experience. Highly… 
Highly Cited
1961
Highly Cited
1961
The pars intercerebralis of the brain of the desert locust contains about 2,400 cells in two groups, which stain with chrome…