Segmentation Based Urdu Nastalique OCR

  title={Segmentation Based Urdu Nastalique OCR},
  author={Sobia Tariq Javed and Sarmad Hussain},
Urdu Language is written in Nastalique writing style, which is highly cursive, context sensitive and is difficult to process as only the last character in its ligature resides on the baseline. This paper focuses on the development of OCR using Hidden Markov Model and rule based post-processor. The recognizer gets the main body (without diacritics) as input and recognizes the corresponding ligature. Accuracy of the system is 92.73% for printed and then scanned document images at 36 font size. 
14 Citations
24 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 14 extracted citations


Publications referenced by this paper.
Showing 1-10 of 24 references

Investigation into a Segmentation Based OCR for the Nastalique Writing System. Master’s thesis report at National University of Computer and Emerging Sciences, Lahore

  • S. T. Javed, S. Hussain
  • 2007
Highly Influential
3 Excerpts

Urdu Nastalique Optical Character Recognition

  • Z. Ahmad, J. K. Orakzai, I. Shamsher, A. Adnan
  • In the Proceedings of World Academy of Science…
  • 2007
1 Excerpt

Urdu. In A Study on Collation of Languages from Developing Asia, Center for Research in Urdu Language

  • S. Hussain, N. Durrani
  • 2007
1 Excerpt

Context Sensitive Shape-Substitution in Nastaliq Writing system: Analysis and Formulation

  • A. Wali, S. Hussain
  • In the Proceedings of International Joint…
  • 2006
2 Excerpts

Similar Papers

Loading similar papers…