Skip to search formSkip to main contentSkip to account menu

Document classification

Known as: Topic spotting, Text categorisation, Classification 
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2016
Highly Cited
2016
We propose a hierarchical attention network for document classification. Our model has two distinctive characteristics: (i) it… 
Highly Cited
2014
Highly Cited
2014
Many machine learning algorithms require the input to be represented as a fixed-length feature vector. When it comes to texts… 
Review
2010
Review
2010
With the increasing availability of electronic documents and the rapid growth of the World Wide Web, the task of automatic… 
Review
2010
Review
2010
A major assumption in many machine learning and data mining algorithms is that the training and future data must be in the same… 
Highly Cited
2002
Highly Cited
2002
We implemented versions of the SVM appropriate for one-class classification in the context of information retrieval. The… 
Highly Cited
2001
Highly Cited
2001
Highly Cited
2000
Highly Cited
2000
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training… 
Highly Cited
2000
Highly Cited
2000
From the publisher: This is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning… 
Highly Cited
2000
Highly Cited
2000
In this paper we present a simple linear-time centroid-based document classification algorithm, that despite its simplicity and… 
Highly Cited
1998
Highly Cited
1998
We investigate four different classification methods for document classification: the naive Bayes classifier, nearest neighbor…