Large scale document image retrieval by automatic word annotation

  title={Large scale document image retrieval by automatic word annotation},
  author={K. Pramod Sankar and R. Manmatha and C. V. Jawahar},
  journal={International Journal on Document Analysis and Recognition (IJDAR)},
In this paper, we present a practical and scalable retrieval framework for large-scale document image collections, for an Indian language script that does not have a robust OCR. OCR-based methods face difficulties in character segmentation and recognition, especially for the complex Indian language scripts. We realize that character recognition is only an intermediate step toward actually labeling words. Hence, we re-pose the problem as one of directly performing word annotation. This new… CONTINUE READING
6 Citations
59 References
Similar Papers


Publications referenced by this paper.
Showing 1-10 of 59 references

Guide to OCR for Indic Scripts

  • V. Govindaraju, Setlur, S. eds.
  • Springer, Berlin
  • 2009
Highly Influential
3 Excerpts

Similar Papers

Loading similar papers…