Skip to search formSkip to main contentSkip to account menu

Document classification

Known as: Topic spotting, Text categorisation, Classification 
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2015
2015
In this work, we address the problem to model all the nodes (words or phrases) in a dependency tree with the dense… 
2013
2013
With increasing interest in sentiment analysis research and opinionated web content always on the rise, focus on analysis of text… 
Review
2010
Review
2010
In this report, we consider the task of automated assessment of English as a Second Language (ESOL) examination scripts written… 
2009
2009
In text classification (TC) and other tasks involving supervised learning, labelled data may be scarce or expensive to obtain… 
Highly Cited
2005
Highly Cited
2005
In this paper we propose and evaluate a technique to perform semi-supervised learning for Text Categorization. In particular we… 
Highly Cited
2005
Highly Cited
2005
In this paper, we compare the performance of three classifiers for Arabic text categorization. In particular, the naïve Bayes, k… 
2003
2003
We suggest a corpus-independent feature set appropriate for style-based text categorization problems. To achieve this, we… 
2003
2003
Newspapers are one of the most challenging domains for information retrieval systems: new articles appear everyday written in… 
1999
1999
This paper proposes an approach to full parsing suitable for Information Extraction from texts. Sequences of cascades of rules…