Skip to search formSkip to main contentSkip to account menu

Document classification

Known as: Topic spotting, Text categorisation, Classification 
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2019
2019
One of the principal tasks of machine learning with major applications is text classification. This paper focuses on the legal… 
Highly Cited
2009
Highly Cited
2009
Stories of people's everyday experiences have long been the focus of psychology and sociology research, and are increasingly… 
Highly Cited
2009
Highly Cited
2009
Text classification is a supervised technique that uses labelled training data to learn the classification system and then… 
Highly Cited
2007
Highly Cited
2007
We consider feature selection for text classification both theoretically and empirically. Our main result is an unsupervised… 
Review
2006
Review
2006
Nowadays,with the development of Internet and information explosion,automated techniques for analyzing author's attitudes towards… 
Highly Cited
2006
Highly Cited
2006
Spam filtering poses a special problem in text categorization, of which the defining characteristic is that filters face an… 
Highly Cited
2004
Highly Cited
2004
The focus of research in text classification has expanded from simple topic identification to more challenging tasks such as… 
Highly Cited
2001
Highly Cited
2001
This paper describes the application of statistical analysis of large corpora to the problem of extracting semantic relations… 
Highly Cited
1999
Highly Cited
1999
Grouping images into (semantically) meaningful categories using low level visual features is a challenging and important problem… 
Highly Cited
1999
Highly Cited
1999
This paper investigates the effect of prior feature selection in Support Vector Machine (SVM) text categorization. The input…