Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 233,434,874 papers from all fields of science
Search
Sign In
Create Free Account
Document layout analysis
Known as:
Document Segmentation
In computer vision, document layout analysis is the process of identifying and categorizing the regions of interest in the scanned image of a text…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
8 relations
Computer vision
Document processing
Image scanner
Layout (computing)
Expand
Broader (1)
Image processing
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2016
2016
Multi-document Topic Segmentation Using Bayesian Estimation
Pedro Mota
,
M. Eskénazi
,
Luísa Coheur
International Computer Science Conference
2016
Corpus ID: 9537027
This paper proposes the use of lexical similarity across different documents in order to improve a topic segmentation task. Given…
Expand
Review
2015
Review
2015
Web Document Segmentation for Better Extraction of Information: A Review
Hassan F. Eldirdiery
,
A. H. Ahmed
2015
Corpus ID: 10576228
This paper reviews the problem of web page segmentation. According to the recent studies, there exist different approaches used…
Expand
2014
2014
PCNN document segmentation method based on bacterial foraging optimization algorithm
Yanping Liao
,
P. Zhang
,
Qiang Guo
,
Jian Wan
CiiT international journal of digital image…
2014
Corpus ID: 7893338
Pulse Coupled Neural Network(PCNN) is widely used in the field of image processing, but it is a difficult task to define the…
Expand
Review
2012
Review
2012
Generic methods for document layout analysis and preprocessing
S. S. Bukhari
2012
Corpus ID: 28555051
Generic layout analysis--process of decomposing document image into homogeneous regions for a collection of diverse document…
Expand
2008
2008
A New and Efficient Algorithm to Binarize Document Images Removing Back-to-Front Interference
J. G. M. Silva
,
R. Lins
,
Fernando Mário Junqueira Martins
,
R. Wachenchauzer
Journal of universal computer science (Online)
2008
Corpus ID: 1918260
Back-to-front interference", "bleeding" and "show-through" is the name given to the phenomenon found whenever documents are…
Expand
2007
2007
NUS at DUC 2007: Using Evolutionary Models of Text
Ziheng Lin
,
Tat-Seng Chua
,
Min-Yen Kan
,
Wee Sun Lee
,
Long Qiu
,
Shiren Ye
2007
Corpus ID: 1595510
This paper presents our new, querybased multi-document summarization system used in DUC 2007. Current graph-based approaches to…
Expand
2004
2004
A Framework for collaborative writing with recording and post-meeting retrieval capabilities
M. Bouamrane
,
David King
,
S. Luz
,
M. Masoodian
2004
Corpus ID: 5803700
From a HCI perspective, elucidating and supporting the context in which collaboration takes place is key to implementing…
Expand
2003
2003
Learning Logic Programs for Layout Analysis Correction
Margherita Berardi
,
Michelangelo Ceci
,
F. Esposito
,
D. Malerba
2003
Corpus ID: 5424941
Layout analysis is the process of extracting a hierarchical structure describing the layout of a page. In the system WISDOM…
Expand
2001
2001
A fast and efficient method for document segmentation for OCR
Boontee Kiuatrachue
,
Phisetphong Suthaphan
Proceedings of IEEE Region 10 International…
2001
Corpus ID: 62691217
This paper describes fast and efficient method for page segmentation of a document containing a nonrectangular block. The…
Expand
1999
1999
Multiscale document segmentation using wavelet-domain hidden Markov models
Hyeokho Choi
,
Richard Baraniuk
Electronic imaging
1999
Corpus ID: 14926978
We introduce a new document image segmentation algorithm, HMTseg, based on wavelets and the hidden Markov tree (HMT) model. The…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE