Mining data records in Web pages

  title={Mining data records in Web pages},
  author={Bing Liu and Robert L. Grossman and Yanhong Zhai},
A large amount of information on the Web is contained in regularly structured objects, which we call data records. Such data records are important because they often present the essential information of their host pages, e.g., lists of products or services. It is useful to mine such data records in order to extract information from them to provide value-added services. Existing automatic techniques are not satisfactory because of their poor accuracies. In this paper, we propose a more effective… CONTINUE READING
Highly Influential
This paper has highly influenced 83 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 535 citations. REVIEW CITATIONS
342 Citations
3 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 342 extracted citations

535 Citations

Citations per Year
Semantic Scholar estimates that this publication has 535 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-3 of 3 references

“Mining data records in Web pages.”

  • B. Liu, R. Grossman, Y. Zhai
  • UIC Technical Report,
  • 2003
Highly Influential
3 Excerpts

Similar Papers

Loading similar papers…