HMM-Based Alignment of Inaccurate Transcriptions for Historical Documents

Abstract

For historical documents, available transcriptions typically are inaccurate when compared with the scanned document images. Not only the position of the words and sentences are unknown, but also the correct image transcription may not be matched exactly. An error-tolerant alignment is needed to make the document images amenable to browsing and searching in… (More)
DOI: 10.1109/ICDAR.2011.20

Topics

5 Figures and Tables

Slides referencing similar topics