• Publications
  • Influence
A Semantic Approach to XML-based Data Integration
TLDR
A prototype tool, named DIXSE, which supports the integration of XML Document Type Definitions (DTDs) into a common conceptual schema, thereby achieving data integration for XML documents. Expand
Messing Up with BART: Error Generation for Evaluating Data-Cleaning Algorithms
TLDR
The error-generation problem is surprisingly challenging, and in fact, NP-complete, and to provide a scalable solution, a correct and efficient greedy algorithm is developed that sacrifices completeness, but succeeds under very reasonable assumptions. Expand
The iBench Integration Metadata Generator
TLDR
iBench is the first metadata generator that can be used to evaluate a wide-range of integration tasks ( data exchange, mapping creation, mapping composition, schema evolution, among many others) and is believed to raise the bar for empirical evaluation and comparison of data integration systems. Expand
Value invention in data exchange
TLDR
Two techniques for understanding when the Skolem functions needed to represent the correct semantics of incomplete information are computationally well-behaved in second-order (SO) mappings have a first-orders semantics and are therefore programmatically and computationally more desirable for use in practice are presented. Expand
Answering Clinical Questions with Role Identification
TLDR
An alternative approach whose organizing principle is the identification of semantic roles in both question and answer texts that correspond to the fields of PICO format is described. Expand
Data Lake Management: Challenges and Opportunities
TLDR
This tutorial considers how data lakes are introducing new problems including dataset discovery and how they are changing the requirements for classic problems including data extraction, data cleaning, data integration, data versioning, and metadata management. Expand
ToX - the Toronto XML Engine
TLDR
This paper describes the architecture and main of ToX, a repository for XML data and metadata, which supports real and virtual XML documents, and presents the indexing and storage strategies. Expand
Composing local-as-view mappings: closure and applications
TLDR
This paper shows the tractability of the problem for LAV mappings, and provides an algorithm to directly compute the composition of LAV tgds, and gives a polynomial-time algorithm to solve it. Expand
Data Sharing in the Hyperion Peer Database System
TLDR
This demo presents Hyperion, a prototype system that supports data sharing for a network of independent Peer Relational Database Management Systems (PDBMSs) and illustrates the following key functionalities of Hyperion: the use of ( data level) mapping tables to infer new metadata as peers dynamically join the network, and the ability to coordinate peers through update propagation. Expand
Semantic models for knowledge management
TLDR
A set of modeling constructs for representing goals, events and actors that are relevant to the work of analysts are presented and a qualitative goal analysis procedure is described which makes it possible to reason about a goal model under different assumptions. Expand
...
1
2
...