Automatic Information Extraction from Web Pages

@inproceedings{Yap2001AutomaticIE,
  title={Automatic Information Extraction from Web Pages},
  author={Roland H. C. Yap and Budi Rahardjo},
  booktitle={SIGIR},
  year={2001}
}
Many web pages have implicit structure. In this paper, we show the feasibility of automatically extracting data from web pages by using approximate matching techniques. This can be applied to generate automatic wrappers or to notify/display web page differences, web page change monitoring, etc. 

Figures, Tables, and Topics from this paper.

Explore Further: Topics Discussed in This Paper

Similar Papers

Loading similar papers…