Interactive Navigation of Open Data Linkages

  title={Interactive Navigation of Open Data Linkages},
  author={Erkang Zhu and Ken Q. Pu and F. Nargesian and R. Miller},
  journal={Proc. VLDB Endow.},
  • Erkang Zhu, Ken Q. Pu, +1 author R. Miller
  • Published 2017
  • Computer Science
  • Proc. VLDB Endow.
  • We developed Toronto Open Data Search to support the ad hoc, interactive discovery of connections or linkages between datasets. It can be used to efficiently navigate through the open data cloud. Our system consists of three parts: a user-interface provided by a Web application; a scalable backend infrastructure that supports navigational queries; and a dynamic repository of open data tables. Our system uses LSH Ensemble, an efficient index structure, to compute linkages (attributes in two… CONTINUE READING
    9 Citations

    Figures and Topics from this paper.

    Optimizing Organizations for Navigating Data Lakes
    • 3
    Data Lake Organization
    • 1
    • PDF
    Organizing Data Lakes for Navigation
    • 3
    Open Data Integration
    • R. Miller
    • Computer Science
    • Proc. VLDB Endow.
    • 2018
    • 20
    • PDF
    Loki: Streamlining Integration and Enrichment
    Top-k Queries over Digital Traces
    Making Open Data Transparent: Data Discovery on Open Data
    • 9
    • PDF
    Data Lake Management: Challenges and Opportunities
    • 17
    • PDF
    A Smart City Dashboard for Combining and Analysing Multi-source Data Streams
    • A. Gledson, Thamer Ba Dhafari, N. Paton, J. Keane
    • Computer Science
    • 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS)
    • 2018
    • 2
    • PDF


    Discovering Linkage Points over Web Data
    • 33
    • PDF
    The Mannheim Search Join Engine
    • 48
    Finding related tables
    • 133
    • PDF
    A Large Public Corpus of Web Tables containing Time and Context Metadata
    • 76
    • PDF
    Mining of Massive Datasets
    • 1,620
    LSH Ensemble: Internet-Scale Domain Search
    • 39
    • PDF
    Approximate nearest neighbors: towards removing the curse of dimensionality
    • 3,894
    • PDF
    On the resemblance and containment of documents
    • A. Broder
    • Mathematics, History
    • Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171)
    • 1997
    • 1,644
    • PDF