• Corpus ID: 33057031

Exploration of Heterogeneous Data Using Robust Similarity

  title={Exploration of Heterogeneous Data Using Robust Similarity},
  author={Mahsa Mirzargar and Ross T. Whitaker and Robert Michael Kirby},
Heterogeneous data pose serious challenges to data analysis tasks, including exploration and visualization. Current techniques often utilize dimensionality reductions, aggregation, or conversion to numerical values to analyze heterogeneous data. However, the effectiveness of such techniques to find subtle structures such as the presence of multiple modes or detection of outliers is hindered by the challenge to find the proper subspaces or prior knowledge to reveal the structures. In this paper… 
1 Citations
Visualizing Multidimensional Data with Order Statistics
A novel method to project data onto a lower dimensional space by taking into account the order statistics of the individual data points, which are quantified by their depth or centrality in the overall set.


The Data Context Map: Fusing Data and Attributes into a Unified Display
The resulting layout places the data objects in direct context of the attributes and hence it is called the data context map, which enables the map's application in selection tasks where users seek to identify one or more data objects that best fit a certain configuration of factors, using the map to visually balance the tradeoffs.
A Structure-Based Distance Metric for High-Dimensional Space Exploration with Multidimensional Scaling
This work was inspired by the perceptual processes evoked in the method of parallel coordinates which enables users to visually aggregate the data by the patterns the polylines exhibit across the dimension axes and suggests a metric that captures this structure directly in high-dimensional space.
Open-Box Spectral Clustering: Applications to Medical Image Analysis
  • T. Schultz, G. Kindlmann
  • Computer Science, Medicine
    IEEE Transactions on Visualization and Computer Graphics
  • 2013
This framework focuses on applications in 3D image analysis, and links the abstract high-dimensional feature space used in spectral clustering to the three-dimensional data space, which provides a better understanding of the technique and helps the analyst predict how well specific parameter settings will generalize to similar tasks.
Steerable, Progressive Multidimensional Scaling
This work presents MDSteer, a steerable MDS computation engine and visualization tool that progressively computes an MDS layout and handles datasets of over one million points.
Evaluation of Cluster Identification Performance for Different PCP Variants
A user study is performed to evaluate cluster identification performance – with respect to response time and correctness – of nine PCP variations, including standard PCPs, and finds that a fair number of the seemingly valid improvements do not result in significant performance gains.
On the Concept of Depth for Functional Data
The statistical analysis of functional data is a growing need in many research areas. In particular, a robust methodology is important to study curves, which are the output of many experiments in
A Survey of Clustering Data Mining Techniques
  • P. Berkhin
  • Computer Science
    Grouping Multidimensional Data
  • 2006
This survey concentrates on clustering algorithms from a data mining perspective as a data modeling technique that provides for concise summaries of the data.
Integrating statistics and visualization: case studies of gaining clarity during exploratory data analysis
It is demonstrated that the tight integration of statistics and visualizations improves exploratory data analysis, and that the evaluation methodology for long-term case studies captures the research strategies of data analysts.
StratomeX: Visual Analysis of Large‐Scale Heterogeneous Genomics Data for Cancer Subtype Characterization
StratomeX is an integrative visualization tool that allows investigators to explore the relationships of candidate subtypes across multiple genomic data types such as gene expression, DNA methylation, or copy number data and proposes a meta visualization and configuration interface for dataset dependencies and data‐view relationships.
Evaluation of Parallel Coordinates: Overview, Categorization and Guidelines for Future Research
  • J. Johansson, C. Forsell
  • Computer Science, Medicine
    IEEE Transactions on Visualization and Computer Graphics
  • 2016
A thorough literature survey of what has been done in the area of user-centred evaluation of parallel coordinates is contributed and a set of guidelines for future research directions is proposed.