Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 234,944,438 papers from all fields of science
Search
Sign In
Create Free Account
Document classification
Known as:
Topic spotting
, Text categorisation
, Classification
Expand
Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
48 relations
Artificial neural network
Categorization
Concept mining
Controlled vocabulary
Expand
Broader (2)
Machine learning
Natural language processing
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2013
2013
Finding Opinion Strength Using Rule-Based Parsing for Arabic Sentiment Analysis
Shereen Oraby
,
Y. El-Sonbaty
,
M. A. El-Nasr
Mexican International Conference on Artificial…
2013
Corpus ID: 14091437
With increasing interest in sentiment analysis research and opinionated web content always on the rise, focus on analysis of text…
Expand
Review
2010
Review
2010
Automated assessment of ESOL free text examinations
Ted Briscoe
,
Ben Medlock
,
Øistein E. Andersen
2010
Corpus ID: 16253657
In this report, we consider the task of automated assessment of English as a Second Language (ESOL) examination scripts written…
Expand
2009
2009
Training Data Cleaning for Text Classification
Andrea Esuli
,
F. Sebastiani
International Conference on the Theory of…
2009
Corpus ID: 14123558
In text classification (TC) and other tasks involving supervised learning, labelled data may be scarce or expensive to obtain…
Expand
2008
2008
Arabic Text Classification using K-NN and Naive Bayes
Mohammed J. Bawaneh
,
M. Alkoffash
,
A. I. A. Rabea
2008
Corpus ID: 60521106
2008
2008
An Extended Document Frequency Metric for Feature Selection in Text Categorization
Yan Xu
,
Bin Wang
,
Jintao Li
,
Hongfang Jing
Asia Information Retrieval Symposium
2008
Corpus ID: 15709610
Feature selection plays an important role in text categorization. Many sophisticated feature selection methods such as…
Expand
Highly Cited
2005
Highly Cited
2005
Domain Kernels for Text Categorization
A. Gliozzo
,
C. Strapparava
Conference on Computational Natural Language…
2005
Corpus ID: 6006592
In this paper we propose and evaluate a technique to perform semi-supervised learning for Text Categorization. In particular we…
Expand
Highly Cited
2005
Highly Cited
2005
Categorization Rehab Duwairi Department of Computer Information Systems , Jordan University of Science and Technology , Jordan
R. Duwairi
2005
Corpus ID: 15302159
In this paper, we compare the performance of three classifiers for Arabic text categorization. In particular, the naïve Bayes, k…
Expand
2004
2004
Spontaneous handwriting recognition and classification
A. Rossi
,
Alfons Juan-Císcar
,
E. Vidal
Proceedings of the 17th International Conference…
2004
Corpus ID: 13386034
Finite-state models are used to implement a handwritten text recognition and classification system for a real application…
Expand
2003
2003
A Corpus-Independent Feature Set for Style-Based Text Categorization
Moshe Koppel
,
Navot Akiva
,
Ido Dagan
2003
Corpus ID: 14441055
We suggest a corpus-independent feature set appropriate for style-based text categorization problems. To achieve this, we…
Expand
2003
2003
Automatic Keyword Extraction for News Finder
J. Martínez-Fernández
,
Ana M. García-Serrano
,
Paloma Martínez
,
Julio Villena-Román
Adaptive Multimedia Retrieval
2003
Corpus ID: 87937
Newspapers are one of the most challenging domains for information retrieval systems: new articles appear everyday written in…
Expand