• Corpus ID: 245385933

Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows

  title={Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows},
  author={Doris Jung Lin Lee and Dixin Tang and Kunal Agarwal and Thyne Boonmark and Caitlyn Chen and Jake Tan Jun Kang and Ujjaini Mukhopadhyay and Jerry Song and Micah Yong and Marti A. Hearst and Aditya G. Parameswaran},
Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data. Yet, visually exploring data in dataframes remains tedious, requiring substantial programming effort for visualization and mental effort to determine what analysis to perform next. We propose Lux, an always-on framework for accelerating visual insight discovery in dataframe workflows. When users print a dataframe in their… 


SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics
This work proposes SeeDB, a visualization recommendation engine to facilitate fast visual analysis: given a subset of data to be studied, SeeDB intelligently explores the space of visualizations, evaluates promising visualizations for trends, and recommends those it deems most “useful” or “interesting”.
The Case for a Visual Discovery Assistant: A Holistic Solution for Accelerating Visual Data Exploration
It is argued that typical navigation tasks can be organized across two dimensions—overall goal and precision of specification and together, addressing these challenges underlying VIDA can help pave the way for a comprehensive solution for removing the pain points in visual data exploration.
Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System
Data visualization is by far the most commonly used mechanism to explore data, especially by novice data analysts and data scientists. And yet, current visual analytics tools are rather limited in
DIVE: A Mixed-Initiative System Supporting Integrated Data Exploration Workflows
DIVE contributes a mixed-initiative interaction scheme that combines recommendation with point-and-click manual specification, and a consistent visual language that unifies different stages of the data exploration workflow, finding that DIVE users were significantly more successful and faster than Excel users at completing predefined data visualization and analysis tasks.
How Information Visualization Novices Construct Visualizations
An exploratory laboratory study in which information visualization novices explored fictitious sales data by communicating visualization specifications to a human mediator, who rapidly constructed the visualizations using commercial visualization software found that three activities were central to the iterative visualization construction process: data attribute selection, visual template selection, and visual mapping specification.
Foresight: Rapid Data Exploration Through Guideposts
Foresight, a visualization recommender system that helps the user rapidly explore large high-dimensional datasets through "guideposts", facilitates interactive exploration of large datasets using fast, approximate sketching to compute ranking metrics.
Deconstructing Categorization in Visualization Recommendation: A Taxonomy and Comparative Study
This paper explores the efficacy of recommendation categories by formalizing a taxonomy of common categories and developing a system, Frontier, that implements these categories and evaluating workflow strategies adopted by users and how categories influence those strategies.
The Interactive Visualization Gap in Initial Exploratory Data Analysis
This paper explores areas where visualization would be beneficial in applied research by conducting a design study using a novel variation on contextual inquiry conducted with professional data analysts and proposes a set of interactive initial exploratory visualization guidelines which it believes will promote adoption by this type of user.
Rapid Sampling for Visualizations with Ordering Guarantees
This paper formally shows that its sampling algorithms are generally applicable and provably optimal in theory, in that they do not take more samples than necessary to generate the visualizations with ordering guarantees and work well in practice, correctly ordering output groups while taking orders of magnitude fewer samples and much less time than conventional sampling schemes.
Dziban: Balancing Agency & Automation in Visualization Design via Anchored Recommendations
Dziban is a visualization API that supports both ambiguous specification and a novel anchoring mechanism for conveying desired context, providing a more concise and flexible authoring experience through automated design, while preserving predictability and control through anchored recommendations.