Noisy text

The noise can be seen as all the differences between the surface form of a coded representation of the text and the intended, correct, or original…

Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.

2018

AutoNet : Automated Network Construction and Exploration System from Domain-Specific Corpora

Jingbo ShangQi Zhu Kaplan
2018
Corpus ID: 52104522

As a collaborative project funded by US Army Research Lab, our goal is to turn massive unstructured text data into structured…

2016

Text Extraction in Document Images: Highlight on Using Corner Points

Vikas YadavN. Ragot
International Workshop on Document Analysis…
2016
Corpus ID: 23866300

During past years, text extraction in document images has been widely studied in the general context of Document Image Analysis…

Review

2015

Review

2015

HOMS: Hindi opinion mining system

Vandana JhaN. ManjunathP. ShenoyK. R. VenugopalL. Patnaik
International Conference on Recent Trends in…
2015
Corpus ID: 3063554

With the increasing popularity of the Web 2.0, we are provided with more documents which express opinions on different issues…

Highly Cited

2015

Highly Cited

2015

ConSent: Context-based sentiment analysis

2013

A LEXICON BASED ALGORITHM FOR NOISY TEXT NORMALIZATION AS PRE-PROCESSING FOR SENTIMENT ANALYSIS

S. RoySourish DharSaprativa BhattacharjeeAnirban DasTriguna Sen
2013
Corpus ID: 17650538

Sentiment analysis in the most general sense refers to the classification of a piece of text into either of the three classes…

2013

Character Recognition Using Conditional Random Field Based Recognition Engine

Anupama RayAnkit ChandawalaS. Chaudhury
IEEE International Conference on Document…
2013
Corpus ID: 26764560

The paper presents a novel script independent CRF based inferencing framework for character recognition. In this framework we…

2012

Entity oriented search and exploration for cultural heritage collections: the EU cultura project

In this paper we describe an entity oriented search and exploration system that we are developing for the EU Cultura project.

2011

Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data

Lipika DeyVenu GovindarajuD. LoprestiP. NatarajanChristoph RinglstetterShourya Roy
2011
Corpus ID: 56864395

It is our great pleasure to welcome all participants to the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy…

2010

TOWARDS A PRE-PROCESSING SYSTEM FOR CASUAL ENGLISH ANNOTATED WITH LINGUISTIC AND CULTURAL INFORMATION

Eleanor ClarkT. RobertsK. ArakiComput Intelligence
2010
Corpus ID: 55834675

We present a preliminary revision of a text processing system, CECS (Casual English Conversion System) the purpose of which is to…

2004

Noisy Text Clustering

David GrangierA. Vinciarelli
2004
Corpus ID: 5714270

This work presents document clustering experiments performed over noisy texts (i.e. text that have been extracted through an…

Noisy text

Related topics

Broader (1)

Papers overview