Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 231,125,682 papers from all fields of science
Search
Sign In
Create Free Account
Text corpus
Known as:
Text corpora
, Linguistic corpus
, Text item
Expand
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
Amarna letter EA 256
Amarna letter EA 365
Amarna letters–localities and their rulers
Amebis
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2010
Highly Cited
2010
Syntax-to-Morphology Mapping in Factored Phrase-Based Statistical Machine Translation from English to Turkish
Reyyan Yeniterzi
,
Kemal Oflazer
Annual Meeting of the Association for…
2010
Corpus ID: 14292100
We present a novel scheme to apply factored phrase-based SMT to a language pair with very disparate morphological structures. Our…
Expand
Highly Cited
2007
Highly Cited
2007
Efficient Handling of N-gram Language Models for Statistical Machine Translation
Marcello Federico
,
M. Cettolo
WMT@ACL
2007
Corpus ID: 603858
Statistical machine translation, as well as other areas of human language processing, have recently pushed toward the use of…
Expand
Highly Cited
2000
Highly Cited
2000
Exploring Automatic Word Sense Disambiguation with Decision Lists and the Web
Eneko Agirre
,
David Martínez
SAIC@COLING
2000
Corpus ID: 1238985
The most effective paradigm for word sense disambiguation, supervised learning, seems to be stuck because of the knowledge…
Expand
Highly Cited
2000
Highly Cited
2000
Dependency Treebank for Russian: Concept, Tools, Types of Information
I. Boguslavsky
,
S. Grigorieva
,
N. Grigoriev
,
Leonid Kreidlin
,
Nadezhda Frid
International Conference on Computational…
2000
Corpus ID: 5113236
The paper describes a tagging scheme designed for the Russian Treebank, and presents tools used for corpus creation.
Review
1993
Review
1993
Book Reviews: Statistically-Driven Computer Grammars of English: The IBM/Lancaster Approach
Dekai Wu
International Conference on Computational Logic
1993
Corpus ID: 17387777
Statistical computational linguistics is entering a consolidation phase, signaled by the appearance of book-length tracts devoted…
Expand
Review
1989
Review
1989
Book Reviews: Machine Translation: Linguistic Characteristics of MT Systems and General Methodology of Evaluation
R. Mccardell
International Conference on Computational Logic
1989
Corpus ID: 10496230
This book (which has been long in the making!) is a compilation of a large number of papers written over the years (1971-1981…
Expand
Highly Cited
1976
Highly Cited
1976
Anatomical organization of the corpus striatum and related nuclei.
Carpenter Mb
1976
Corpus ID: 88731448
Highly Cited
1970
Highly Cited
1970
Experiments in automatic extracting and indexing
L. Earl
Information Storage and Retrieval
1970
Corpus ID: 40114089
Highly Cited
1961
Highly Cited
1961
The Histology of the Neurosecretory System of the Adult Female Desert Locust, Schistocerca gregaria
K. C. Highnam
1961
Corpus ID: 31721805
The pars intercerebralis of the brain of the desert locust contains about 2,400 cells in two groups, which stain with chrome…
Expand
Highly Cited
1937
Highly Cited
1937
GRAVIMETRIC METHOD FOR THE DETERMINATION OF SODIUM PREGNANDIOL GLUCURONIDATE (AN EXCRETION PRODUCT OF PROGESTERONE)
E. Venning
1937
Corpus ID: 6783082
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE