Skip to search formSkip to main contentSkip to account menu

OCRopus

Known as: OCRopus (software) 
OCRopus is a free document analysis and optical character recognition (OCR) system released under the Apache License, Version 2.0 with a very modular… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2017
2017
This paper provides the first thorough documentation of a high quality digitization process applied to an early printed book from… 
2015
2015
We propose a pipeline for text extraction from infographics that makes use of a novel combination of data mining and computer… 
2015
2015
Character recognition has been widely used since its inception in applications involved processing of scanned or camera-captured… 
2014
2014
The Documents in the financial services, insurance, utilities, and government sectors typically require a high volume of PDF… 
2013
2013
This paper describes the installation of a mathematical formula recognition module into an open source OCR system: OCRopus. In… 
2012
2012
A large amount of real-world data is required to train and benchmark any character recognition algorithm. Developing a page-level… 
2012
2012
Large-scale digitization projects dealing with text-based historical material face challenges that are not well catered for by… 
2011
2011
Layout analysis is a crucial process for document image understanding and information retrieval. Document layout analysis depends… 
2010
2010
With the advent of more powerful personal computers, inexpensive memory, and digital cameras, curators around the world are… 
2008
2008
Proceedings Vol. 6815 Document Recognition and Retrieval XV, Berrin A. Yanikoglu; Kathrin Berkner, Editors, 68150F Date: 28…