Corpus ID: 33057031

Exploration of Heterogeneous Data Using Robust Similarity

@article{Mirzargar2017ExplorationOH,
  title={Exploration of Heterogeneous Data Using Robust Similarity},
  author={Mahsa Mirzargar and Ross T. Whitaker and Robert Michael Kirby},
  journal={ArXiv},
  year={2017},
  volume={abs/1710.02862}
}
Heterogeneous data pose serious challenges to data analysis tasks, including exploration and visualization. Current techniques often utilize dimensionality reductions, aggregation, or conversion to numerical values to analyze heterogeneous data. However, the effectiveness of such techniques to find subtle structures such as the presence of multiple modes or detection of outliers is hindered by the challenge to find the proper subspaces or prior knowledge to reveal the structures. In this paper… Expand
1 Citations
Visualizing Multidimensional Data with Order Statistics
TLDR
A novel method to project data onto a lower dimensional space by taking into account the order statistics of the individual data points, which are quantified by their depth or centrality in the overall set. Expand

References

SHOWING 1-10 OF 39 REFERENCES
The Data Context Map: Fusing Data and Attributes into a Unified Display
TLDR
The resulting layout places the data objects in direct context of the attributes and hence it is called the data context map, which enables the map's application in selection tasks where users seek to identify one or more data objects that best fit a certain configuration of factors, using the map to visually balance the tradeoffs. Expand
A Structure-Based Distance Metric for High-Dimensional Space Exploration with Multidimensional Scaling
TLDR
This work was inspired by the perceptual processes evoked in the method of parallel coordinates which enables users to visually aggregate the data by the patterns the polylines exhibit across the dimension axes and suggests a metric that captures this structure directly in high-dimensional space. Expand
Open-Box Spectral Clustering: Applications to Medical Image Analysis
  • T. Schultz, G. Kindlmann
  • Computer Science, Medicine
  • IEEE Transactions on Visualization and Computer Graphics
  • 2013
TLDR
This framework focuses on applications in 3D image analysis, and links the abstract high-dimensional feature space used in spectral clustering to the three-dimensional data space, which provides a better understanding of the technique and helps the analyst predict how well specific parameter settings will generalize to similar tasks. Expand
Steerable, Progressive Multidimensional Scaling
TLDR
This work presents MDSteer, a steerable MDS computation engine and visualization tool that progressively computes an MDS layout and handles datasets of over one million points. Expand
Evaluation of Cluster Identification Performance for Different PCP Variants
TLDR
A user study is performed to evaluate cluster identification performance – with respect to response time and correctness – of nine PCP variations, including standard PCPs, and finds that a fair number of the seemingly valid improvements do not result in significant performance gains. Expand
On the Concept of Depth for Functional Data
The statistical analysis of functional data is a growing need in many research areas. In particular, a robust methodology is important to study curves, which are the output of many experiments inExpand
A Survey of Clustering Data Mining Techniques
  • P. Berkhin
  • Computer Science
  • Grouping Multidimensional Data
  • 2006
TLDR
This survey concentrates on clustering algorithms from a data mining perspective as a data modeling technique that provides for concise summaries of the data. Expand
Integrating statistics and visualization: case studies of gaining clarity during exploratory data analysis
TLDR
It is demonstrated that the tight integration of statistics and visualizations improves exploratory data analysis, and that the evaluation methodology for long-term case studies captures the research strategies of data analysts. Expand
StratomeX: Visual Analysis of Large‐Scale Heterogeneous Genomics Data for Cancer Subtype Characterization
TLDR
StratomeX is an integrative visualization tool that allows investigators to explore the relationships of candidate subtypes across multiple genomic data types such as gene expression, DNA methylation, or copy number data and proposes a meta visualization and configuration interface for dataset dependencies and data‐view relationships. Expand
Evaluation of Parallel Coordinates: Overview, Categorization and Guidelines for Future Research
  • J. Johansson, C. Forsell
  • Computer Science, Medicine
  • IEEE Transactions on Visualization and Computer Graphics
  • 2016
TLDR
A thorough literature survey of what has been done in the area of user-centred evaluation of parallel coordinates is contributed and a set of guidelines for future research directions is proposed. Expand
...
1
2
3
4
...