Answering queries using views: A survey
- A. Halevy
- Computer ScienceThe VLDB journal
- 1 December 2001
The state of the art on the problem of answering queries using views is surveyed, the algorithms proposed to solve it are described, and the disparate works into a coherent framework are synthesized.
Crowdsourcing systems on the World-Wide Web
The practice of crowdsourcing is transforming the Web and giving rise to a new field of inquiry called "crowdsourcing", which aims to provide real-time information about events in a democratic manner.
Learning to map between ontologies on the semantic web
Glue is described, a system that employs machine learning techniques to find semantic mappings between ontologies and is distinguished in that it works with a variety of well-defined similarity notions and that it efficiently incorporates multiple types of knowledge.
Querying Heterogeneous Information Sources Using Source Descriptions
The Information Manifold is described, an implemented system that provides uniform access to a heterogeneous collection of more than 100 information sources, many of them on the WWW, and algorithms that use the source descriptions to prune effciently the set of information sources for a given query are described.
Reconciling schemas of disparate data sources: a machine-learning approach
LSD is a system that employs and extends current machine-learning techniques to semi-automatically find semantic mappings between the source schemas and the mediated schema, and its architecture is extensible to additional learners that may exploit new kinds of information.
WebTables: exploring the power of tables on the web
- Michael J. Cafarella, A. Halevy, D. Wang, Eugene Wu, Yang Zhang
- Computer ScienceProceedings of the VLDB Endowment
- 1 August 2008
The WEBTABLES system develops new techniques for keyword search over a corpus of tables, and shows that they can achieve substantially higher relevance than solutions based on a traditional search engine.
Simlarity Search for Web Services
From databases to dataspaces: a new abstraction for information management
This paper proposes dataspaces and their support systems as a new agenda for data management, which encompasses much of the work going on in data management today, while posing additional research objectives.
ULDBs: databases with uncertainty and lineage
- Omar Benjelloun, A. Sarma, A. Halevy, J. Widom
- Computer ScienceVery Large Data Bases Conference
- 1 September 2006
It is shown that the ULDB representation is complete, and that it permits straightforward implementation of many relational operations, and how ULDBs enable a new approach to query processing in probabilistic databases.
MiniCon: A scalable algorithm for answering queries using views
The MiniCon is described, a novel algorithm for finding the maximally-contained rewriting of a conjunctive query using a set of conj unctive views and it is shown that the MiniCon scales up well and significantly outperforms the previous algorithms.