A Framework for Clustering Uncertain Data

  title={A Framework for Clustering Uncertain Data},
  author={Erich Schubert and Alexander Koos and Tobias Emrich and Andreas Z{\"u}fle and K. Schmid and A. Zimek},
  journal={Proc. VLDB Endow.},
  • Erich Schubert, Alexander Koos, +3 authors A. Zimek
  • Published 2015
  • Computer Science
  • Proc. VLDB Endow.
  • The challenges associated with handling uncertain data, in particular with querying and mining, are finding increasing attention in the research community. Here we focus on clustering uncertain data and describe a general framework for this purpose that also allows to visualize and understand the impact of uncertainty---using different uncertainty models---on the data mining results. Our framework constitutes release 0.7 of ELKI (http://elki.dbs.ifi.lmu.de/) and thus comes along with a plethora… CONTINUE READING

    Figures and Topics from this paper.

    DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN
    • 260
    • Open Access
    The (black) art of runtime evaluation: Are we comparing algorithms or implementations?
    • 96
    • Open Access
    dbscan : Fast Density-based Clustering with R
    • 37
    • Highly Influenced
    • Open Access
    An information-theoretic approach to hierarchical clustering of uncertain data
    • 14
    • Open Access
    Pivot-based Metric Indexing
    • 23
    • Open Access
    There and back again: Outlier detection between statistical reasoning and data mining algorithms
    • 17
    • Open Access


    Publications referenced by this paper.
    Efficient query evaluation on probabilistic databases
    • 1,082
    • Open Access
    Well-Separated Clusters and Optimal Fuzzy Partitions
    • 1,806
    A Method for Comparing Two Hierarchical Clusterings
    • 1,025
    • Open Access
    ULDBs: databases with uncertainty and lineage
    • 545
    • Open Access
    Managing and Mining Uncertain Data
    • 264
    • Open Access
    On comparing partitions
    • 1,614
    Trio: a system for data, uncertainty, and lineage
    • 411
    • Open Access
    Efficient Clustering of Uncertain Data
    • 245
    • Open Access
    Uncertain Data Mining: An Example in Clustering Location Data
    • 196
    • Open Access
    Fast and Simple Relational Processing of Uncertain Data
    • 280
    • Open Access