• Publications
  • Influence
OpenII: an open source information integration toolkit
OpenII (openintegration.org) is a collaborative effort to create a suite of open-source tools for information integration (II). The project is leveraging the latest developments in II research toExpand
  • 77
  • 8
  • PDF
Exploring schema similarity at multiple resolutions
Large, dynamic, and ad-hoc organizations must frequently initiate data integration and sharing efforts with insufficient awareness of how organizational data sources are related. Decision makers needExpand
  • 9
  • PDF
Efficient Algorithms for Allocation Policies
Recent work [2] proposed extending the OLAP data model to represent data ambiguity. Specifically, one form of ambigui ty that work addressed arose from relaxing the assumption that all d imensionExpand
Table extraction and understanding for scientific and enterprise applications
Valuable high-precision data are often published in the form of tables in both scientific and business documents. While humans can easily identify, interpret and contextualize tables, developingExpand
Compiling Machine Learning Algorithms with SystemML Extended Abstract
Analytics on big data may range from passenger volume prediction in transportation to customer satisfaction in automotive diagnostic systems, and from correlation analysis in social media data to logExpand