Scaling Access to Heterogeneous Data Sources withDiscoDRAFT { NOT FOR DISTRIBUTION { SEE TKDE 1998 FOR FINAL


1 Scaling Access to Heterogeneous Data Sources with Disco DRAFT { NOT FOR DISTRIBUTION { SEE TKDE 1998 FOR FINAL VERSION Anthony Tomasic, Louiqa Raschid and Patrick Valduriez Abstract|Accessing many data sources aggravates problems for users of heterogeneous distributed databases. Database administrators must deal with fragile mediators, that is, mediators with schemas and views that must be signi cantly changed to incorporate a new data source. When implementing translators of queries from mediators to data sources, database implementors must deal with weak data sources that do not support all the functionality required by mediators. Application programmers must deal with graceless failures for unavailable data sources. Queries simply return failure and no further information when data sources are unavailable for query processing. The Distributed Information Search COmponent (Disco) addresses these problems. Data modeling techniques manage the connections to data sources, and sources can be added transparently to the users and applications. The interface between mediators and data sources exibly handles di erent query languages and di erent data source functionality. Query rewriting and optimization techniques rewrite queries so they are e ciently evaluated by sources. Query processing and evaluation semantics are developed to process queries over unavailable data sources. In this article we describe (a) the distributed mediator architecture of Disco; (b) the data model and its modeling of data source connections; (c) the interface to underlying data sources and the query rewriting process; and (d) query processing semantics. We describe several advantages of our system. Keywords| Heterogeneous Database, Query Reformulation, Source Capability, Heterogeneous Cost Model, Partial Answer, Partial Evaluation

Extracted Key Phrases


Citations per Year

58 Citations

Semantic Scholar estimates that this publication has 58 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Tomasic1998ScalingAT, title={Scaling Access to Heterogeneous Data Sources withDiscoDRAFT \{ NOT FOR DISTRIBUTION \{ SEE TKDE 1998 FOR FINAL}, author={Anthony Tomasic and Louiqa Raschid and Patrick Valduriez}, year={1998} }