An Efficient Framework for Searching Text in Noisy Document Images

  title={An Efficient Framework for Searching Text in Noisy Document Images},
  author={Ismet Zeki Yalniz and R. Manmatha},
  journal={2012 10th IAPR International Workshop on Document Analysis Systems},
An efficient word spotting framework is proposed to search text in scanned books. The proposed method allows one to search for words when optical character recognition (OCR) fails due to noise or for languages where there is no OCR. Given a query word image, the aim is to retrieve matching words in the book sorted by the similarity. In the offline stage, SIFT descriptors are extracted over the corner points of each word image. Those features are quantized into visual terms (visterms) using… CONTINUE READING
Highly Cited
This paper has 36 citations. REVIEW CITATIONS
24 Citations
15 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 24 extracted citations


Publications referenced by this paper.

Similar Papers

Loading similar papers…