Skip to search formSkip to main contentSkip to account menu

Document classification

Known as: Topic spotting, Text categorisation, Classification 
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2013
2013
With increasing interest in sentiment analysis research and opinionated web content always on the rise, focus on analysis of text… 
Review
2010
Review
2010
In this report, we consider the task of automated assessment of English as a Second Language (ESOL) examination scripts written… 
2009
2009
In text classification (TC) and other tasks involving supervised learning, labelled data may be scarce or expensive to obtain… 
2008
2008
Feature selection plays an important role in text categorization. Many sophisticated feature selection methods such as… 
Highly Cited
2005
Highly Cited
2005
In this paper we propose and evaluate a technique to perform semi-supervised learning for Text Categorization. In particular we… 
Highly Cited
2005
Highly Cited
2005
In this paper, we compare the performance of three classifiers for Arabic text categorization. In particular, the naïve Bayes, k… 
2004
2004
Finite-state models are used to implement a handwritten text recognition and classification system for a real application… 
2003
2003
We suggest a corpus-independent feature set appropriate for style-based text categorization problems. To achieve this, we… 
2003
2003
Newspapers are one of the most challenging domains for information retrieval systems: new articles appear everyday written in…