Towards Sophisticated Wrapping of Web-based information Repositories

Abstract

Access to on-line information via the Web is exploding. Index and retrieval engines already start to integrate a huge variety of heterogeneous repositories. However, the heterogeneity issue remains, both in terms of the search formats and the formats of the result pages. In this paper we focus on html-based search and result presentations. We discuss our experience in the design, the development and the maintenance of wrappers (in the context of the Knowledge Broker project). We outline different ways to write wrappers, illustrate some of the lessons learned, and conclude by describing a semi-automatic approach for an efficient wrapping of Web-based information repositories. Throughout the paper, we give illustrating examples for hands-on readers.

Extracted Key Phrases

3 Figures and Tables

Cite this paper

@inproceedings{Chidlovskii1997TowardsSW, title={Towards Sophisticated Wrapping of Web-based information Repositories}, author={Boris Chidlovskii and Uwe M. Borghoff and Pierre-Yves Chevalier}, booktitle={RIAO}, year={1997} }