Keyword spotting in degraded document using mixed OCR and word shape coding

@article{Xia2010KeywordSI,
  title={Keyword spotting in degraded document using mixed OCR and word shape coding},
  author={Yong Xia and Guangri Quan and Yongdong Xu and Yushan Sun},
  journal={2010 IEEE International Conference on Intelligent Computing and Intelligent Systems},
  year={2010},
  volume={3},
  pages={411-414}
}
This paper presents a new way for keyword spotting in degraded imaged document. Two prevalent word indexing, OCR and word shape coding, are combined compactly based on the recognition confidence evaluation. The basic procedures are as follows. First, OCR candidates are used for OCR indexing. Second, a new stoke feature and convex-concave feature of word are adopted for word shape coding. Furthermore, an intelligent indexing based on recognition confidence is introduced, which is adaptive to… CONTINUE READING