Skip to search formSkip to main content
You are currently offline. Some features of the site may not work correctly.

Text corpus

Known as: Text corpora, Linguistic corpus, Text item 
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed… Expand
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Review
2020
Review
2020
With the development of high computational devices, deep neural networks (DNNs), in recent years, have gained significant… Expand
  • figure 1
  • table 1
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Review
2016
Review
2016
ASBTRACT In order to effectively analyze qualitative data one must use a systematic process to organize and highlight meaning… Expand
Is this relevant?
Highly Cited
2006
Highly Cited
2006
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and knowledge-based measures of… Expand
  • table 1
  • table 2
  • table 3
Is this relevant?
Highly Cited
2005
Highly Cited
2005
We collected a corpus of parallel text in 11 languages from the proceedings of the European Parliament, which are published on… Expand
  • figure 1
  • table 1
  • table 2
  • figure 3
  • table 3
Is this relevant?
Highly Cited
2004
Highly Cited
2004
John Sinclair is one of the major figures in applied linguistics and his work is essential study for students. This accessible… Expand
Is this relevant?
Highly Cited
2003
Highly Cited
2003
We have collected a corpus of data from natural meetings that occurred at the International Computer Science Institute (ICSI) in… Expand
  • figure 1
  • figure 2
Is this relevant?
Highly Cited
1997
Highly Cited
1997
This paper presents a new approach for measuring semantic similarity/distance between words and concepts. It combines a lexical… Expand
  • figure 1
  • table 1
  • table 2
  • table 3
  • figure 2
Is this relevant?
Highly Cited
1996
Highly Cited
1996
Many corpus-based natural language processing systems rely on text corpora that have been manually annotated with syntactic or… Expand
  • figure 2
  • figure 3
  • table 3
  • table 1
  • table 2
Is this relevant?
Highly Cited
1993
Highly Cited
1993
Abstract : As a result of this grant, the researchers have now published oil CDROM a corpus of over 4 million words of running… Expand
  • table 1
  • table 2
  • table 3
  • table 4
Is this relevant?
Highly Cited
1992
Highly Cited
1992
We describe a method for the automatic acquisition of the hyponymy lexical relation from unrestricted text. Two goals motivate… Expand
  • figure 1
  • figure 2
Is this relevant?