Text Line Segmentation for Unconstrained Handwritten Document Images Using Neighborhood Connected Component Analysis

@inproceedings{Khandelwal2009TextLS,
  title={Text Line Segmentation for Unconstrained Handwritten Document Images Using Neighborhood Connected Component Analysis},
  author={Abhishek Khandelwal and Pritha Choudhury and Ram Sarkar and Subhadip Basu and Mita Nasipuri and Nibaran Das},
  booktitle={PReMI},
  year={2009}
}
Text line extraction is the first and one of the most critical steps in optical character recognition (OCR) of unconstrained handwritten documents. The present work reports a new methodology based on comparison of neighborhood connected components to determine whether they belong to the same text line. Components which are very small or very large compared to the average component height are ignored in the preprocessing step. During post-processing, such components are reconsidered and… CONTINUE READING

Similar Papers

Topics from this paper.

Citations

Publications citing this paper.
SHOWING 1-10 OF 18 CITATIONS

High-Performance OCR on Packing Boxes in Industry Based on Deep Learning

VIEW 10 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

CMATERdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image

  • International Journal on Document Analysis and Recognition (IJDAR)
  • 2011
VIEW 8 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

iLPR: an Indian license plate recognition system

  • Multimedia Tools and Applications
  • 2014
VIEW 1 EXCERPT
CITES BACKGROUND