Skip to search formSkip to main contentSkip to account menu

Text corpus

Known as: Text corpora, Linguistic corpus, Text item 
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2006
Highly Cited
2006
We describe a method for discovering irregularities in temporal mood patterns appearing in a large corpus of blog posts, and… 
Highly Cited
2006
Highly Cited
2006
In this paper we report our work on building a POS tagger for a morphologically rich language- Hindi. The theme of the research… 
Highly Cited
2004
Highly Cited
2004
This paper introduces the SpamBayes classification engine and outlines the most important features and techniques which… 
Highly Cited
2000
Highly Cited
2000
Statistical part-of-speech (POS) taggers achieve high accuracy and robustness when based on large scale manually tagged corpora… 
Highly Cited
2000
Highly Cited
2000
The most effective paradigm for word sense disambiguation, supervised learning, seems to be stuck because of the knowledge… 
Highly Cited
1981
Highly Cited
1981
We propose a vision of the structure of knowledge and processes of learning based upon the particularity of experience. Highly… 
Highly Cited
1961
Highly Cited
1961
ALMOST two decades have elapsed since Abramowitz et al.1 first reported the occurrence of a ‘diabetogenic’ factor in the sinus…