Taxonomizer: Interactive Construction of Fully Labeled Hierarchical Groupings from Attributes of Multivariate Data

  title={Taxonomizer: Interactive Construction of Fully Labeled Hierarchical Groupings from Attributes of Multivariate Data},
  author={Salman Mahmood and Klaus Mueller},
  journal={IEEE Transactions on Visualization and Computer Graphics},
  • Salman Mahmood, K. Mueller
  • Published 2020
  • Computer Science, Medicine
  • IEEE Transactions on Visualization and Computer Graphics
Organizing multivariate data spaces by their dimensions or attributes can be a rather difficult task. Most of the work in this area focuses on the statistical aspects such as correlation clustering, dimension reduction, and the like. These methods typically produce hierarchies in which the leaf nodes are labeled by the attribute names while the inner nodes are often represented by just a statistical measure and criterion, such as a threshold. This makes them difficult to understand for… Expand


Large-scale taxonomy induction using entity and word embeddings
TIEmb is proposed, an approach for automatic unsupervised class subsumption axiom extraction from knowledge bases using entity and text embeddings, and applied on the WebIsA database, a database of subsumption relations extracted from the large portion of the World Wide Web, to extract class hierarchies in the Person and Place domain. Expand
Treemaps: Visualizing Hierarchical and Categorical Data
Generating existing diagrams via treemap transformations is an exercise meant to show the power, ease, and generality with which alternative presentations can be generated from the basic treemaps algorithms. Expand
Learning Semantic Hierarchies via Word Embeddings
This paper proposes a novel and effective method for the construction of semantic hierarchies based on word embeddings, which can be used to measure the semantic relationship between words. Expand
Learning Syntactic Patterns for Automatic Hypernym Discovery
This paper presents a new algorithm for automatically learning hypernym (is-a) relations from text, using "dependency path" features extracted from parse trees and introduces a general-purpose formalization and generalization of these patterns. Expand
RDF2Vec: RDF Graph Embeddings for Data Mining
RDF2Vec is presented, an approach that uses language modeling approaches for unsupervised feature extraction from sequences of words, and adapts them to RDF graphs, and shows that feature vector representations of general knowledge graphs such as DBpedia and Wikidata can be easily reused for different tasks. Expand
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
A Sentiment Treebank that includes fine grained sentiment labels for 215,154 phrases in the parse trees of 11,855 sentences and presents new challenges for sentiment compositionality, and introduces the Recursive Neural Tensor Network. Expand
Distributed Representations of Sentences and Documents
Paragraph Vector is an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents, and its construction gives the algorithm the potential to overcome the weaknesses of bag-of-words models. Expand
Automatic Acquisition of Hyponyms from Large Text Corpora
A set of lexico-syntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of interest are identified. Expand
Entity Hierarchy Embedding
This work proposes a principled framework of embedding entities that integrates hierarchical information from large-scale knowledge bases and shows that both the entity vectors and category distance metrics encode meaningful semantics. Expand
Just-in-time annotation of clusters, outliers, and trends in point-based data visualizations
  • E. Kandogan
  • Computer Science
  • 2012 IEEE Conference on Visual Analytics Science and Technology (VAST)
  • 2012
It is argued that just-in-time descriptive analytics applied to a point-based multi-dimensional visualization technique to identify and describe clusters, outliers, and trends provides a novel user experience of computational techniques working alongside of users allowing them to build faster qualitative mental models of data. Expand