Noisy text

The noise can be seen as all the differences between the surface form of a coded representation of the text and the intended, correct, or original… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2017
2017
We address the task of Named Entity Disambiguation (NED) for noisy text. We present WikilinksNED, a large-scale NED dataset of… (More)
  • figure 1
  • table 1
  • table 2
  • table 3
  • table 4
Is this relevant?
2014
2014
We present a new general and language independent approach to the noisy text correction problem developed and implemented in the… (More)
  • figure 1
  • figure 2
  • figure 3
  • table I
  • table II
Is this relevant?
2012
2012
INTRODUCTION In this chapter, we will illustrate how to use a combination of knowledge-based natural language processing and… (More)
Is this relevant?
2011
2011
The amount of data produced in usergenerated content continues to grow at a staggering rate. However, the text found in these… (More)
  • figure 1
  • figure 2
  • table 2
  • table 1
  • table 3
Is this relevant?
2010
2010
In this paper we look at the problem of cleansing noisy text using a statistical machine translation model. Noisy text is… (More)
  • figure 1
  • figure 2
  • figure 3
  • table 1
  • figure 4
Is this relevant?
Highly Cited
2009
Highly Cited
2009
The proliferation of Internet has not only led to the generation of huge volumes of unstructured information in the form of web… (More)
  • figure 1
  • figure 2
  • table 1
  • figure 3
  • figure 4
Is this relevant?
2008
2008
Identification of named entities such as person, organization and product names from text is an important task in information… (More)
  • table 1
  • table 2
  • table 3
  • table 4
Is this relevant?
2008
2008
We present a robust parser which is trained on a treebank of ungrammatical sentences. The treebank is created automatically by… (More)
  • table 1
  • table 3
  • table 5
  • table 2
Is this relevant?
2004
2004
This work presents document clustering experiments performed over noisy texts (i.e. text that have been extracted through an… (More)
Is this relevant?
2003
2003
Existing techniques for tokenisation and sentence boundary identification are extremely accurate when the data is perfectly clean… (More)
  • table 1
  • table 2
  • table 3
Is this relevant?