Nearest neighbor based collection OCR

Abstract

Conventional optical character recognition (OCR) systems operate on individual characters and words, and do not normally exploit document or collection context. We describe a Collection OCR which takes advantage of the fact that multiple examples of the same word (often in the same font) may occur in a document or collection. The idea here is that an OCR or… (More)
DOI: 10.1145/1815330.1815357

10 Figures and Tables

Topics

  • Presentations referencing similar topics