A Fast Alignment Scheme for Automatic OCR Evaluation of Books

Abstract

This paper aims to evaluate the accuracy of optical character recognition (OCR) systems on real scanned books. The ground truth e-texts are obtained from the Project Gutenberg website and aligned with their corresponding OCR output using a fast recursive text alignment scheme (RETAS). First, unique words in the vocabulary of the book are aligned with unique… (More)
DOI: 10.1109/ICDAR.2011.157

3 Figures and Tables

Cite this paper

@article{Yalniz2011AFA, title={A Fast Alignment Scheme for Automatic OCR Evaluation of Books}, author={Ismet Zeki Yalniz and R. Manmatha}, journal={2011 International Conference on Document Analysis and Recognition}, year={2011}, pages={754-758} }