Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 222,814,675 papers from all fields of science
Search
Sign In
Create Free Account
OCRopus
Known as:
OCRopus (software)
OCRopus is a free document analysis and optical character recognition (OCR) system released under the Apache License, Version 2.0 with a very modular…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
11 relations
C++
Comparison of optical character recognition software
Document layout analysis
FreeBSD
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2017
2017
Case Study of a highly automated Layout Analysis and OCR of an incunabulum: 'Der Heiligen Leben' (1488)
Christian Reul
,
M. Dittrich
,
Martin Gruner
Digital Access to Textual Cultural Heritage
2017
Corpus ID: 3448896
This paper provides the first thorough documentation of a high quality digitization process applied to an early printed book from…
Expand
2015
2015
Formalization and Preliminary Evaluation of a Pipeline for Text Extraction From Infographics
Falk Böschen
,
A. Scherp
LWA
2015
Corpus ID: 11452978
We propose a pipeline for text extraction from infographics that makes use of a novel combination of data mining and computer…
Expand
2015
2015
Evaluation of cursive and non-cursive scripts using recurrent neural networks
S. Ahmed
,
S. Naz
,
M. I. Razzak
,
Shiekh Faisal Rashid
,
Muhammad Zeshan Afzal
,
T. Breuel
Neural computing & applications (Print)
2015
Corpus ID: 254026514
Character recognition has been widely used since its inception in applications involved processing of scanned or camera-captured…
Expand
2014
2014
A Method to Provide High Volume Transaction Outputs Accessibility to Vision Impaired Using Layout Analysis
A. Nazemi
,
I. Murray
,
D. McMeekin
2014
Corpus ID: 55177252
The Documents in the financial services, insurance, utilities, and government sectors typically require a high volume of PDF…
Expand
2013
2013
An OCR System with OCRopus for Scientific Documents Containing Mathematical Formulas
Fumihiro Furukori
,
Shinpei Yamazaki
,
T. Miyagishi
,
K. Shirai
,
Masayuki Okamoto
IEEE International Conference on Document…
2013
Corpus ID: 36089285
This paper describes the installation of a mathematical formula recognition module into an open source OCR system: OCRopus. In…
Expand
2012
2012
Semi-automated OCR database generation for Nabataean scripts
A. Ul-Hasan
,
S. S. Bukhari
,
Sheikh Faisal Rashid
,
F. Shafait
,
T. Breuel
International Conference on Pattern Recognition
2012
Corpus ID: 14303031
A large amount of real-world data is required to train and benchmark any character recognition algorithm. Developing a page-level…
Expand
2012
2012
Ocropodium: open source OCR for small-scale historical archives
Tobias Blanke
,
Michael Bryant
,
M. Hedges
Journal of information science
2012
Corpus ID: 206454485
Large-scale digitization projects dealing with text-based historical material face challenges that are not well catered for by…
Expand
2011
2011
Boosting based text and non-text region classification
Bingqing Xie
,
G. Agam
Electronic imaging
2011
Corpus ID: 36257740
Layout analysis is a crucial process for document image understanding and information retrieval. Document layout analysis depends…
Expand
2010
2010
EXTENDING THE PAGE SEGMENTATION ALGORITHMS OF THE OCROPUS DOCUMENTATION LAYOUT ANALYSIS SYSTEM
A. Winder
2010
Corpus ID: 61524358
With the advent of more powerful personal computers, inexpensive memory, and digital cameras, curators around the world are…
Expand
2008
2008
The OCRopus open source OCR system (Proceedings Paper)
T. Breuel
,
B. Yanikoglu
,
K. Berkner
2008
Corpus ID: 62224309
Proceedings Vol. 6815 Document Recognition and Retrieval XV, Berrin A. Yanikoglu; Kathrin Berkner, Editors, 68150F Date: 28…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE