Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 218,356,357 papers from all fields of science
Search
Sign In
Create Free Account
Noisy text
The noise can be seen as all the differences between the surface form of a coded representation of the text and the intended, correct, or original…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
15 relations
Broader (1)
Coding theory
Data quality
Email
Grammar checker
Jargon
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2018
Highly Cited
2018
MTNT: A Testbed for Machine Translation of Noisy Text
Paul Michel
,
Graham Neubig
Conference on Empirical Methods in Natural…
2018
Corpus ID: 52155427
Noisy or non-standard input text can cause disastrous mistranslations in most modern Machine Translation (MT) systems, and there…
Expand
2016
2016
Semi-supervised Named Entity Recognition in noisy-text
Shubhanshu Mishra
,
Jana Diesner
NUT@COLING
2016
Corpus ID: 13402028
Many of the existing Named Entity Recognition (NER) solutions are built based on news corpus data with proper syntax. These…
Expand
2016
2016
Text Extraction in Document Images: Highlight on Using Corner Points
Vikas Yadav
,
N. Ragot
International Workshop on Document Analysis…
2016
Corpus ID: 23866300
During past years, text extraction in document images has been widely studied in the general context of Document Image Analysis…
Expand
2014
2014
Correcting Grammatical Verb Errors
Alla Rozovskaya
,
D. Roth
,
Vivek Srikumar
Conference of the European Chapter of the…
2014
Corpus ID: 5223238
Verb errors are some of the most common mistakes made by non-native writers of English but some of the least studied. The reason…
Expand
Review
2012
Review
2012
Information Extraction from Text
Jing Jiang
Mining Text Data
2012
Corpus ID: 948803
Information extraction is the task of finding structured information from unstructured or semi-structured text. It is an…
Expand
Highly Cited
2010
Highly Cited
2010
Arabic Dialect Handling in Hybrid Machine Translation
H. Sawaf
Conference of the Association for Machine…
2010
Corpus ID: 37464800
In this paper, we describe an extension to a hybrid machine translation system for handling dialect Arabic, using a decoding…
Expand
Highly Cited
2009
Highly Cited
2009
Learning to recognize webpage genres
Ioannis Kanaris
,
E. Stamatatos
Information Processing & Management
2009
Corpus ID: 1471859
2008
2008
Adapting a WSJ-Trained Parser to Grammatically Noisy Text
Jennifer Foster
,
Joachim Wagner
,
Josef van Genabith
Annual Meeting of the Association for…
2008
Corpus ID: 34007
We present a robust parser which is trained on a treebank of ungrammatical sentences. The treebank is created automatically by…
Expand
Review
2003
Review
2003
Automating survey coding by multiclass text categorization techniques
D. Giorgetti
,
F. Sebastiani
J. Assoc. Inf. Sci. Technol.
2003
Corpus ID: 9547996
Survey coding is the task of assigning a symbolic code from a predefined set of such codes to the answer given in response to an…
Expand
1994
1994
Open Problems in "Systems That Learn"
Mark A. Fulk
,
Sanjay Jain
,
D. Osherson
Journal of computer and system sciences (Print)
1994
Corpus ID: 1139305
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE