Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 233,434,869 papers from all fields of science
Search
Sign In
Create Free Account
Text corpus
Known as:
Text corpora
, Linguistic corpus
, Text item
Expand
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
Amarna letter EA 256
Amarna letter EA 365
Amarna letters–localities and their rulers
Amebis
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2010
Highly Cited
2010
Syntax-to-Morphology Mapping in Factored Phrase-Based Statistical Machine Translation from English to Turkish
Reyyan Yeniterzi
,
Kemal Oflazer
Annual Meeting of the Association for…
2010
Corpus ID: 14292100
We present a novel scheme to apply factored phrase-based SMT to a language pair with very disparate morphological structures. Our…
Expand
Highly Cited
2009
Highly Cited
2009
Improving Translation Lexicon Induction from Monolingual Corpora via Dependency Contexts and Part-of-Speech Equivalences
Nikesh Garera
,
Chris Callison-Burch
,
David Yarowsky
Conference on Computational Natural Language…
2009
Corpus ID: 1889871
This paper presents novel improvements to the induction of translation lexicons from monolingual corpora using multilingual…
Expand
Highly Cited
2007
Highly Cited
2007
Efficient Handling of N-gram Language Models for Statistical Machine Translation
Marcello Federico
,
M. Cettolo
WMT@ACL
2007
Corpus ID: 603858
Statistical machine translation, as well as other areas of human language processing, have recently pushed toward the use of…
Expand
Highly Cited
2007
Highly Cited
2007
Une nouvelle approche à l'extraction de lexiques bilingues à partir de corpus comparables
Hervé Déjean
,
É. Gaussier
2007
Corpus ID: 70271116
We present in this article a new method for automatic extraction of bilingual lexicons from comparable corpora. We first anaylze…
Expand
Highly Cited
2005
Highly Cited
2005
Generating Artificial Corpora for Plan Recognition
Nate Blaylock
,
James F. Allen
User Modeling
2005
Corpus ID: 10500675
Corpora for training plan recognizers are scarce and difficult to gather from humans. However, corpora could be a boon to plan…
Expand
Highly Cited
2002
Highly Cited
2002
Patterns and meanings: Using corpora for English language research and teaching. By ALAN PARTINGTON. (Studies in corpus linguistics 2.) Amsterdam & Philadelphia: John Benjamins, 1998
D. Noel
2002
Corpus ID: 165815406
Highly Cited
1998
Highly Cited
1998
TELEPHONE SPEECH CORPUS DEVELOPMENT AT CSLU
R. Cole
,
M. Fanty
,
M. Noel
,
T. Lander
1998
Corpus ID: 18082891
This paper describes eight telephone-speech corpora at various stages of development at the Center for Spoken Language…
Expand
Review
1989
Review
1989
Book Reviews: Machine Translation: Linguistic Characteristics of MT Systems and General Methodology of Evaluation
R. Mccardell
International Conference on Computational Logic
1989
Corpus ID: 10496230
This book (which has been long in the making!) is a compilation of a large number of papers written over the years (1971-1981…
Expand
Highly Cited
1961
Highly Cited
1961
The Histology of the Neurosecretory System of the Adult Female Desert Locust, Schistocerca gregaria
K. C. Highnam
1961
Corpus ID: 31721805
The pars intercerebralis of the brain of the desert locust contains about 2,400 cells in two groups, which stain with chrome…
Expand
Highly Cited
1960
Highly Cited
1960
Fonctions des Corpora allata chez Locusta migratoria (L.)
L. Joly
1960
Corpus ID: 82035844
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE