Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 219,937,492 papers from all fields of science
Search
Sign In
Create Free Account
Document classification
Known as:
Topic spotting
, Text categorisation
, Classification
Expand
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
48 relations
Artificial neural network
Categorization
Concept mining
Controlled vocabulary
Expand
Broader (2)
Machine learning
Natural language processing
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2019
2019
Long-length Legal Document Classification
Lulu Wan
,
G. Papageorgiou
,
M. Seddon
,
Mirko Bernardoni
arXiv.org
2019
Corpus ID: 209376473
One of the principal tasks of machine learning with major applications is text classification. This paper focuses on the legal…
Expand
Highly Cited
2009
Highly Cited
2009
Identifying Personal Stories in Millions of Weblog Entries
A. Gordon
,
Reid Swanson
International Conference on Web and Social Media
2009
Corpus ID: 5920196
Stories of people's everyday experiences have long been the focus of psychology and sociology research, and are increasingly…
Expand
Highly Cited
2009
Highly Cited
2009
Naïve Bayesian Based on Chi Square to Categorize Arabic Data
F. Thabtah
,
Mohammad Ali
2009
Corpus ID: 17700069
Text classification is a supervised technique that uses labelled training data to learn the classification system and then…
Expand
Highly Cited
2007
Highly Cited
2007
Feature selection methods for text classification
Anirban Dasgupta
,
P. Drineas
,
Boulos Harb
,
V. Josifovski
,
Michael W. Mahoney
Knowledge Discovery and Data Mining
2007
Corpus ID: 7566182
We consider feature selection for text classification both theoretically and empirically. Our main result is an unsupervised…
Expand
Review
2006
Review
2006
Semantic Orientation Computing Based on HowNet
Li-de Wu
2006
Corpus ID: 63010849
Nowadays,with the development of Internet and information explosion,automated techniques for analyzing author's attitudes towards…
Expand
Highly Cited
2006
Highly Cited
2006
Spam Filtering Using Statistical Data Compression Models
Andrej Bratko
,
G. Cormack
,
B. Filipič
,
T. Lynam
,
B. Zupan
Journal of machine learning research
2006
Corpus ID: 916091
Spam filtering poses a special problem in text categorization, of which the defining characteristic is that filters face an…
Expand
Highly Cited
2004
Highly Cited
2004
A Boosting Algorithm for Classification of Semi-Structured Text
Taku Kudo
,
Yuji Matsumoto
Conference on Empirical Methods in Natural…
2004
Corpus ID: 561128
The focus of research in text classification has expanded from simple topic identification to more challenging tasks such as…
Expand
Highly Cited
2001
Highly Cited
2001
Learning Relations Using Collocations
Gerhard Heyer
,
Martin Läuter
,
U. Quasthoff
,
T. Wittig
,
Christian Wolff
Workshop on Ontology Learning
2001
Corpus ID: 5051408
This paper describes the application of statistical analysis of large corpora to the problem of extracting semantic relations…
Expand
Highly Cited
1999
Highly Cited
1999
Content-based hierarchical classification of vacation images
Aditya Vailaya
,
Mário A. T. Figueiredo
,
Anil K. Jain
,
HongJiang Zhang
Proceedings IEEE International Conference on…
1999
Corpus ID: 8262367
Grouping images into (semantically) meaningful categories using low level visual features is a challenging and important problem…
Expand
Highly Cited
1999
Highly Cited
1999
Feature Selection in SVM Text Categorization
Hirotoshi Taira
,
M. Haruno
AAAI/IAAI
1999
Corpus ID: 3142786
This paper investigates the effect of prior feature selection in Support Vector Machine (SVM) text categorization. The input…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE