Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 218,522,505 papers from all fields of science
Search
Sign In
Create Free Account
Text corpus
Known as:
Text corpora
, Linguistic corpus
, Text item
Expand
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
Amarna letter EA 256
Amarna letter EA 365
Amarna letters–localities and their rulers
Amebis
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Review
2011
Review
2011
OCA: Opinion corpus for Arabic
Mohammed Rushdi-Saleh
,
M. Martín-Valdivia
,
L. A. U. López
,
José Manuel Perea Ortega
J. Assoc. Inf. Sci. Technol.
2011
Corpus ID: 16310031
Sentiment analysis is a challenging new task related to text mining and natural language processing. Although there are, at…
Expand
Review
2009
Review
2009
Applying corpus linguistics to pedagogy: a critical evaluation
Lynne Flowerdew
2009
Corpus ID: 11067261
This article reviews and discusses four somewhat contentious issues in the application of corpus linguistics to pedagogy, ESP in…
Expand
Highly Cited
2008
Highly Cited
2008
Anaphoric Annotation in the ARRAU Corpus
Massimo Poesio
,
Ron Artstein
International Conference on Language Resources…
2008
Corpus ID: 1749737
Arrau is a new corpus annotated for anaphoric relations, with information about agreement and explicit representation of multiple…
Expand
Review
2008
Review
2008
Constructions, Chunking, and Connectionism: The Emergence of Second Language Structure
N. Ellis
2008
Corpus ID: 34474804
schema. For a general summary, there are normative descriptions of stages of L2 proficiency that were drawn up in as atheoretical…
Expand
Highly Cited
2006
Highly Cited
2006
Corpus Linguistics and the Web
M. Hundt
,
Nadja Nesselhauf
,
Carolin Biewer
2006
Corpus ID: 58165312
Marianne HUNDT, Nadja NESSELHAUF and Carolin BIEWER: Corpus linguistics and the web Accessing the web as corpus Anke LUDELING…
Expand
Highly Cited
2004
Highly Cited
2004
Corpus Stylistics: Speech, Writing and Thought Presentation in a Corpus of English Writing
E. Semino
,
M. Short
2004
Corpus ID: 60853377
This book represents a new direction at the interface between the fields of stylistics and corpus linguistics, namely the use of…
Expand
Highly Cited
2000
Highly Cited
2000
Automatic Labeling of Semantic Roles Qualifying Exam Proposal
D. Gildea
,
Dan Jurafsky
2000
Corpus ID: 207747200
The problem of linking syntactic constituents of a sentence to semantic roles is an essential part of many natural language…
Expand
Review
1999
Review
1999
Book Reviews: An Introduction to Corpus Linguistics
Vincent B.Y. Ooi
International Conference on Computational Logic
1999
Corpus ID: 56939427
This timely book joins the growing number of leading introductory volumes on corpus linguistics, including McEnery and Wilson…
Expand
Highly Cited
1998
Highly Cited
1998
Generating Natural Language Summaries from Multiple On-Line Sources
Dragomir R. Radev
,
K. McKeown
International Conference on Computational Logic
1998
Corpus ID: 10019526
We present a methodology for summarization of news about current events in the form of briefings that include appropriate…
Expand
Highly Cited
1997
Highly Cited
1997
Automated Text Summarization in SUMMARIST
E. Hovy
,
Chin-Yew Lin
Annual Meeting of the Association for…
1997
Corpus ID: 2521538
SUMMARIST is an attempt to create a robust automated text summarization system, based on the ‘equation’: summarization = topic…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE