The OCRopus open source OCR system


OCRopus is a new, open source OCR system emphasizing modularity, easy extensibility, and reuse, aimed at both the research community and large scale commercial document conversions. This paper describes the current status of the system, its general architecture, as well as the major algorithms currently being used for layout analysis and text line recognition.

DOI: 10.1117/12.783598

Extracted Key Phrases

16 Figures and Tables

Citations per Year

132 Citations

Semantic Scholar estimates that this publication has 132 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Breuel2008TheOO, title={The OCRopus open source OCR system}, author={Thomas M. Breuel}, booktitle={DRR}, year={2008} }