ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data

@article{Abdessalem2010ObjectRunnerLT,
  title={ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data},
  author={Talel Abdessalem and Bogdan Cautis and Nora Derouiche},
  journal={PVLDB},
  year={2010},
  volume={3},
  pages={1585-1588}
}
We present in this paper ObjectRunner, a system for extracting, integrating and querying structured data from the Web. Our system harvests real-world items from template-based HTML pages (the so-called structured Web). It illustrates a two-phase querying of the Web, in which an intentional description of the targeted data is first provided, in a flexible and widely applicable manner. ObjectRunner follows then a lightweight, best-effort approach, leveraging both the input description and the… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.

Similar Papers

Loading similar papers…