Table extraction using spatial reasoning on the CSS2 visual box model

  title={Table extraction using spatial reasoning on the CSS2 visual box model},
  author={Wolfgang Gatterbauer and Paul Bohunsky},
  booktitle={AAAI 2006},
Tables on web pages contain a huge amount of semantically explicit information, which makes them a worthwhile target for automatic information extraction and knowledge acquisition from the Web. However, the task of table extraction from web pages is difficult, because of HTML’s design purpose to convey visual instead of semantic information. In this paper, we propose a robust technique for table extraction from arbitrary web pages. This technique relies upon the positional information of… CONTINUE READING
Highly Cited
This paper has 63 citations. REVIEW CITATIONS


Publications citing this paper.

63 Citations

Citations per Year
Semantic Scholar estimates that this publication has 63 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…