Robust text extraction in mixed-type binary documents


Text extraction from documents is an essential preprocessing stage of applications such as OCR (optical character recognition), document image compression, storage and retrieval. Although many different techniques have been proposed to date, they usually assume that text orientation and size is fixed throughout the document image. Our work faces the problem… (More)
DOI: 10.1109/MMSP.2008.4665110


6 Figures and Tables