Table detection in heterogeneous documents

  title={Table detection in heterogeneous documents},
  author={Faisal Shafait and Ray Smith},
  booktitle={Document Analysis Systems},
Detecting tables in document images is important since not only do tables contain important information, but also most of the layout analysis methods fail in the presence of tables in the document image. Existing approaches for table detection mainly focus on detecting tables in single columns of text and do not work reliably on documents with varying layouts. This paper presents a practical algorithm for table detection that works with a high accuracy on documents with varying layouts (company… CONTINUE READING
Highly Cited
This paper has 42 citations. REVIEW CITATIONS
33 Citations
3 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 33 extracted citations


Publications referenced by this paper.
Showing 1-3 of 3 references

An overview of the Tesseract OCR engine

  • R. Smith
  • In Proc. 9th Int. Conf. on Document Analysis and…
  • 2007
Highly Influential
10 Excerpts

Similar Papers

Loading similar papers…