Lux: always-on visualization recommendations for exploratory dataframe workflows

Lee, Doris Jung-Lin; Tang, Dixin; Agarwal, Kunal; Boonmark, Thyne; Chen, Caitlyn; Kang, Jake; Mukhopadhyay, Ujjaini; Song, Jerry; Yong, Micah; Hearst, Marti A.; Parameswaran, Aditya G.

doi:10.14778/3494124.3494151

Citation Details

Lux: always-on visualization recommendations for exploratory dataframe workflows

Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data. Yet, visually exploring data in dataframes remains tedious, requiring substantial programming effort for visualization and mental effort to determine what analysis to perform next. We propose Lux, an always-on framework for accelerating visual insight discovery in dataframe workflows. When users print a dataframe in their notebooks, Lux recommends visualizations to provide a quick overview of the patterns and trends and suggests promising analysis directions. Lux features a high-level language for generating visualizations on demand to encourage rapid visual experimentation with data. We demonstrate that through the use of a careful design and three system optimizations, Lux adds no more than two seconds of overhead on top of pandas for over 98% of datasets in the UCI repository. We evaluate Lux in terms of usability via interviews with early adopters, finding that Lux helps fulfill the needs of data scientists for visualization support within their dataframe workflows. Lux has already been embraced by data science practitioners, with over 3.1k stars on Github. more »

Award ID(s):: 1940757

PAR ID:: 10324482

Author(s) / Creator(s):: Lee, Doris Jung-Lin; Tang, Dixin; Agarwal, Kunal; Boonmark, Thyne; Chen, Caitlyn; Kang, Jake; Mukhopadhyay, Ujjaini; Song, Jerry; Yong, Micah; Hearst, Marti A.; Parameswaran, Aditya G.

Date Published:: 2021-11-01

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 15

Issue:: 3

ISSN:: 2150-8097

Page Range / eLocation ID:: 727 to 738

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.14778/3494124.3494151

More Like this