NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Physical Visualization Design: Decoupling Interface and System Design

Chen, Yiru; Li, Xupeng; Tao, Jeffrey; Ramjit, Lana; Mitra, Subrata; Ghaderi, Javad; Netravali, Ravi; Parameswaran, Aditya; Rubenstein, Dan; Wu, Eugene (June 2025, ACM)

Interactive visualization interfaces enable users to efficiently explore, analyze, and make sense of their datasets. However, as data grows in size, it becomes increasingly challenging to build data interfaces that meet the interface designer’s desired latency expectations and resource constraints. Cloud DBMSs, while optimized for scalability, often fail to meet latency expectations, necessitating complex, bespoke query execution and optimization techniques for data interfaces. This involves manually navigating a huge optimization space that is sensitive to interface design and resource constraints, such as client vs server data and compute placement, choosing which computations are done offline vs online, and selecting from a large library of visualization-optimized data structures. This paper advocates for a Physical Visualization Design (PVD) tool that decouples interface design from system design to provide design independence. Given an interfaces underlying data flow, interactions with latency expectations, and resource constraints, PVD checks if the interface is feasible and, if so, proposes and instantiates a middleware architecture spanning the client, server, and cloud DBMS that meets the expectations. To this end, this paper presents Jade, the first prototype PVD tool that enables design independence. Jade proposes an intermediate representation called Diffplans to represent the data flows, develops cost estimation models that trade off between latency guarantees and plan feasibility, and implements an optimization framework to search for the middleware architecture that meets the guarantees. We evaluate Jade on six representative data interfaces as compared to Mosaic and Azure SQL database. We find Jade supports a wider range of interfaces, makes better use of available resources, and can meet a wider range of data, latency, and resource conditions.
more » « less
Free, publicly-accessible full text available June 20, 2026
DIG: The Data Interface Grammar

https://doi.org/10.1145/3597465.3605223

Chen, Yiru; Tao, Jeffrey; Wu, Eugene (June 2023, ACM)

Building interactive data interfaces is hard because the design of an interface depends on the data processing needs for the underlying analysis task, yet we do not have a good representation for analysis tasks. To fill this gap, this paper advocates for a Data Interface Grammar (DIG) as an intermediate representation of analysis tasks. We show that DIG is compatible with existing data engineering practices, compact to represent any analysis, simple to translate into an interface design, and amenable to offline analysis. We further illustrate the potential benefits of this abstraction, such as automatic interface generation, automatic interface backend optimization, tutorial generation, and workload generation.
more » « less
Full Text Available
OM3: An Ordered Multi-level Min-Max Representation for Interactive Progressive Visualization of Time Series

https://doi.org/10.1145/3589290

Wang, Yunhai; Wang, Yuchun; Chen, Xin; Zhao, Yue; Zhang, Fan; Wu, Eugene; Fu, Chi-Wing; Yu, Xiaohui (June 2023, Proceedings of the ACM on Management of Data)

We present a novel multi-level representation of time series called OM3 that facilitates efficient interactive progressive visualization of large data stored in a database and supports various interactions such as resizing, panning, zooming, and visual query. Based on our proposed line-segment aggregation, this representation can produce error-free line visualizations that preserve the shape of a time series in windows of arbitrary sizes. To reduce the interaction latency, we develop an incremental tree-based query strategy to support progressive visualizations, allowing a finer control on the accuracy-time tradeoff. We quantitatively compare OM3 with state-of-the-art methods, including a method implemented on a leading time-series database InfluxDB, in two settings with databases residing either in the local area network or on the cloud. Results show that OM^3 maintains a low latency within 300~ms on the web browser and a high data reduction ratio regardless of the data size (ranging from millions to billions of records), achieving around 1,000 times faster than the state-of-the-art methods on the largest dataset experimented with.
more » « less
Full Text Available
Reptile: Aggregation-level Explanations for Hierarchical Data

https://doi.org/10.1145/3514221.3517854

Huang, Zezhou; Wu, Eugene (June 2022, SIGMOD)

Full Text Available
How Do Captions Affect Visualization Reading?

Hanxiu 'Hazel' Zhu, Shelly Shiying (January 2022, VISComm)

Full Text Available
Demonstration of PI2: Interactive Visualization Interface Generation for SQL Analysis in Notebook

Jeffrey Tao, Yiru Chen (January 2022, SIGMOD Demo)

Full Text Available
NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries

Yiru Chen, Ryan Li (January 2022, NL2Viz)

Full Text Available
Facilitating Exploration with Interaction Snapshots under High Latency

https://doi.org/10.1109/VIS47514.2020.00034

Wu, Yifan; Chang, Remco; Hellerstein, Joseph M.; Wu, Eugene (February 2021, 2020 IEEE Visualization Conference (VIS))
null (Ed.)
Latency is, unfortunately, a reality when working with large data sets. Guaranteeing imperceptible latency for interactivity is often prohibitively expensive: the application developer may be forced to migrate data processing engines or deal with complex error bounds on samples, and to limit the application to users with high network bandwidth. Instead of relying on the backend, we propose a simple UX design-interaction snapshots. Responses of requests from the interactions are asynchronously loaded in "snapshots". With interaction snapshots, users can interact concurrently while the snapshots load. Our user study participants found it useful not to have to wait for each result and easily navigate to prior snapshots. For latency up to 5 seconds, participants were able to complete extrema, threshold, and trend identification tasks with little negative impact.
more » « less
Full Text Available
Impact of Cognitive Biases on Progressive Visualization

https://doi.org/10.1109/TVCG.2021.3051013

Procopio, Marianne; Mosca, Ab; Scheidegger, Carlos E.; Wu, Eugene; Chang, Remco (January 2021, IEEE Transactions on Visualization and Computer Graphics)
null (Ed.)
Full Text Available
DIEL: Interactive Visualization Beyond the Here and Now

Wu, Yifan; Chang, Remco; Hellerstein, Joseph; Satyanarayan, Arvind; Wu, Eugene (January 2021, IEEE Vis)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records