• Publications
  • Influence
LabBook: Metadata-driven social collaborative data analysis
TLDR
LabBook is an open, social, and collaborative data analysis platform designed explicitly to reduce this friction and accelerate discovery. Expand
  • 37
  • 2
  • PDF
Making Open Data Transparent: Data Discovery on Open Data
TLDR
We present new table join and table union search solutions that provide interactive search speed even over massive collections of millions of attributes with heavily skewed cardinality distributions. Expand
  • 9
  • 1
  • PDF
Barriers to adoption of information technology in healthcare
TLDR
In this paper, we take a systems thinking perspective to identify barriers to the application of information technology in healthcare and explore solutions for overcoming them. Expand
  • 9
  • PDF
VizCurator: A Visual Tool for Curating Open Data
TLDR
Vizcurator permits the exploration, understanding and curation of open RDF data, its schema, and how it has been linked to other sources. Expand
  • 8
  • PDF
VIQS: Visual Interactive Exploration of Query Semantics
TLDR
Analytics platforms such as IBM Watson Analytics TM are collecting metadata about their use, including user queries on uploaded datasets. Expand
  • 6
  • PDF
VoidWiz: Resolving incompleteness using network effects
TLDR
We introduce a principled way of performing value imputation on missing values, allowing a user to choose a correct value after viewing possible values and why they were inferred. Expand
  • 2
  • PDF
Automated Conceptual Abstraction of Large Semantic Diagrams
The design and development of applications and systems is a multistep process involving developers and stakeholders. During the development lifecycle, numerous diagrams are often created in order toExpand
Pytheas
CSV is a popular Open Data format widely used in a variety of domains for its simplicity and effectiveness in storing and disseminating data. Unfortunately, data published in this format often doesExpand
Pytheas: Pattern-based Table Discovery in CSV Files
TLDR
Pytheas is a principled method for automatically classifying lines in a CSV file and discovering tables within it based on the intuition that tables maintain a coherency of values in each column. Expand
Towards a Storage Stack for the Data Center
Towards a Storage Stack for the Data Center Ioan Alexandru Stefanovici Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2016 The storage stack in a data centerExpand