Skip to search formSkip to main contentSkip to account menu

Document classification

Known as: Topic spotting, Text categorisation, Classification 
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2015
2015
In this work, we address the problem to model all the nodes (words or phrases) in a dependency tree with the dense… 
2010
2010
In this study, we propose a method for encoding documents into string vectors, instead of numerical vectors. A traditional… 
Review
2009
Review
2009
With the proliferation of online reviews and sentiments the Web is becoming more and more useful and important information… 
Highly Cited
2008
Highly Cited
2008
The goal of online event analysis is to detect events and track their associated documents in real time from a continuous stream… 
Highly Cited
2007
Highly Cited
2007
This research proposes a new neural network for text categorization which uses alternative representations of documents to… 
Highly Cited
2004
Highly Cited
2004
This paper presents the clustering algorithm PoBOC (Pole-Based Overlapping Clustering). It has two main characteristics: the… 
Highly Cited
2003
Highly Cited
2003
This paper motivates and presents the Topic-based Vector Space Model (TVSM), a new vector-based approach for document comparison…