STALKER : Learning Extraction Rules for Semistructured , Web-based Information Sources *

@inproceedings{Muslea1998STALKERL,
  title={STALKER : Learning Extraction Rules for Semistructured , Web-based Information Sources *},
  author={Ion Muslea and Steve Minton and Craig Knoblock},
  year={1998}
}
Information mediators are systems capable of providing a unified view of several information sources. Central to any mediator that accesses Web-based sources is a set of wrappers that can extract relevant information from Web pages. In this paper, we present a wrapper-induction algorithm that generates extraction rules for Web-based information sources. We introduce landmark automata, a formalism that describes classes of extraction rules. Our wrapper induction algorithm, STALKER, generates… CONTINUE READING
Highly Cited
This paper has 207 citations. REVIEW CITATIONS

7 Figures & Tables

Topics

Statistics

0102030'99'01'03'05'07'09'11'13'15'17
Citations per Year

208 Citations

Semantic Scholar estimates that this publication has 208 citations based on the available data.

See our FAQ for additional information.