Skip to search formSkip to main contentSkip to account menu
You are currently offline. Some features of the site may not work correctly.

Document classification

Known as: Topic spotting, Text categorisation, Classification 
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a… Expand
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2016
Highly Cited
2016
We propose a hierarchical attention network for document classification. Our model has two distinctive characteristics: (i) it… Expand
  • figure 2
  • table 1
  • table 2
  • figure 3
  • figure 4
Highly Cited
2014
Highly Cited
2014
Many machine learning algorithms require the input to be represented as a fixed-length feature vector. When it comes to texts… Expand
  • figure 1
  • figure 2
  • figure 3
  • table 1
  • table 2
Highly Cited
2011
Highly Cited
2011
Machine learning approaches to multi-label document classification have to date largely relied on discriminative modeling… Expand
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • table 1
Review
2010
Review
2010
With the increasing availability of electronic documents and the rapid growth of the World Wide Web, the task of automatic… Expand
  • figure 1
  • table 2
  • figure 3
  • figure 5
  • figure 6
Review
2010
Review
2010
A major assumption in many machine learning and data mining algorithms is that the training and future data must be in the same… Expand
  • figure 1
  • table 1
  • table 2
  • figure 2
  • table 4
Highly Cited
2004
Highly Cited
2004
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training… Expand
  • figure 1
  • table 1
  • figure 2
  • figure 3
  • table 3
Highly Cited
2002
Highly Cited
2002
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to… Expand
  • figure 1
  • figure 2
  • table 1
  • table 2
  • figure 3
Highly Cited
2001
Highly Cited
2001
We implemented versions of the SVM appropriate for one-class classification in the context of information retrieval. The… Expand
  • table 1
  • figure 1
  • figure 2
  • figure 3
  • table 2
Highly Cited
2000
Highly Cited
2000
From the publisher: This is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning… Expand
Highly Cited
2000
Highly Cited
2000
In this paper we present a simple linear-time centroid-based document classification algorithm, that despite its simplicity and… Expand
  • table 1
  • table 2
  • table 3
  • table 4
  • figure 1