Conceptual-Model-Based Data Extraction from Multiple-Record Web Pages

  title={Conceptual-Model-Based Data Extraction from Multiple-Record Web Pages},
  author={David W. Embley and Douglas M. Campbell and Y. S. Jiang and Stephen W. Liddle and Yiu-Kai Ng and Dallan Quass and Randy D. Smith},
  journal={Data Knowl. Eng.},
Electronically available data on the Web is exploding at an ever increasing pace. Much of this data is unstructured, which makes searching hard and traditional database querying impossible. Many Web documents, however, contain an abundance of recognizable constants that together describe the essence of a document’s content. For these kinds of data-rich, multiple-record documents (e.g. advertisements, movie reviews, weather reports, travel information, sports summaries, financial statements… CONTINUE READING
Highly Influential
This paper has highly influenced 13 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 428 citations. REVIEW CITATIONS

8 Figures & Tables



Citations per Year

429 Citations

Semantic Scholar estimates that this publication has 429 citations based on the available data.

See our FAQ for additional information.