• Publications
  • Influence
COMA - A System for Flexible Combination of Schema Matching Approaches
TLDR
This work develops the COMA schema matching system as a platform to combine multiple matchers in a flexible way and uses COMA as a framework to comprehensively evaluate the effectiveness of different matchers and their combinations for real-world schemas. Expand
Schema and ontology matching with COMA++
TLDR
Different match strategies can be applied including various forms of reusing previously determined match results and a so-called fragment-based match approach which decomposes a large match problem into smaller problems. Expand
Data Cleaning: Problems and Current Approaches
TLDR
This work classifies data quality problems that are addressed by data cleaning and provides an overview of the main solution approaches and discusses current tool support for data cleaning. Expand
Matching large schemas: Approaches and evaluation
TLDR
This work has developed a new generic schema matching tool, COMA++, providing a library of individual matchers and a flexible infrastructure to combine the matcher and refine their results, and conducted a comprehensive evaluation of the match strategies using large e-Business standard schemas. Expand
Comparison of Schema Matching Evaluations
TLDR
This paper surveys recently published schema matching evaluations and introduces the major criteria that influence the effectiveness of a schema matching approach and uses these criteria to compare the various systems. Expand
FUNC: a package for detecting significant associations between gene sets and ontological annotations
TLDR
The program package FUNC is developed that includes and expands on currently available methods to identify significant associations between gene sets and ontological annotations and is a versatile and useful tool for the analysis of genome-wide data. Expand
Regional patterns of gene expression in human and chimpanzee brains.
TLDR
A subset of genes that show expression differences between humans and chimpanzees are distributed nonrandomly across the genome and are statistically significantly enriched in regions that are recently duplicated in humans. Expand
Matching large XML schemas
TLDR
A fragment-oriented match approach is proposed to decompose a large match problem into several smaller ones and to reuse previous match results at the level of schema fragments. Expand
Quickmig: automatic schema matching for data migration projects
TLDR
QuickMig is described, a new semi-automatic approach to determining semantic correspondences between schema elements for data migration applications that advances the state of the art with a set of new techniques exploiting sample instances, domain ontologies, and reuse of existing mappings to detect not only element correspondences but also their mapping expressions. Expand
Flexible Integration of Molecular-Biological Annotation Data: The GenMapper Approach
TLDR
The GenMapper system that physically integrates heterogeneous annotation data in a flexible way and supports large-scale analysis on the integrated data and uses a generic data model to uniformly represent different kinds of annotations originating from different data sources. Expand
...
1
2
3
...