skip to main content


Title: More of what? Dissociating effects of conceptual and numeric mappings on interpreting colormap data visualizations
Abstract

In visual communication, people glean insights about patterns of data by observing visual representations of datasets. Colormap data visualizations (“colormaps”) show patterns in datasets by mapping variations in color to variations in magnitude. When people interpret colormaps, they have expectations about how colors map to magnitude, and they are better at interpreting visualizations that align with those expectations. For example, they infer that darker colors map to larger quantities (dark-is-more bias) and colors that are higher on vertically oriented legends map to larger quantities (high-is-more bias). In previous studies, the notion of quantity was straightforward because more of the concept represented (conceptual magnitude) corresponded to larger numeric values (numeric magnitude). However, conceptual and numeric magnitude can conflict, such as using rank order to quantify health—smaller numbers correspond to greater health. Under conflicts, are inferred mappings formed based on the numeric level, the conceptual level, or a combination of both? We addressed this question across five experiments, spanning data domains: alien animals, antibiotic discovery, and public health. Across experiments, the high-is-more bias operated at the conceptual level: colormaps were easier to interpret when larger conceptual magnitude was represented higher on the legend, regardless of numeric magnitude. The dark-is-more bias tended to operate at the conceptual level, but numeric magnitude could interfere, or even dominate, if conceptual magnitude was less salient. These results elucidate factors influencing meanings inferred from visual features and emphasize the need to consider data meaning, not just numbers, when designing visualizations aimed to facilitate visual communication.

 
more » « less
NSF-PAR ID:
10423803
Author(s) / Creator(s):
; ;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
Cognitive Research: Principles and Implications
Volume:
8
Issue:
1
ISSN:
2365-7464
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. People have expectations about how colors map to concepts in visualizations, and they are better at interpreting visualizations that match their expectations. Traditionally, studies on these expectations ( inferred mappings ) distinguished distinct factors relevant for visualizations of categorical vs. continuous information. Studies on categorical information focused on direct associations (e.g., mangos are associated with yellows) whereas studies on continuous information focused on relational associations (e.g., darker colors map to larger quantities; dark-is-more bias). We unite these two areas within a single framework of assignment inference. Assignment inference is the process by which people infer mappings between perceptual features and concepts represented in encoding systems. Observers infer globally optimal assignments by maximizing the “merit,” or “goodness,” of each possible assignment. Previous work on assignment inference focused on visualizations of categorical information. We extend this approach to visualizations of continuous data by (a) broadening the notion of merit to include relational associations and (b) developing a method for combining multiple (sometimes conflicting) sources of merit to predict people's inferred mappings. We developed and tested our model on data from experiments in which participants interpreted colormap data visualizations, representing fictitious data about environmental concepts (sunshine, shade, wild fire, ocean water, glacial ice). We found both direct and relational associations contribute independently to inferred mappings. These results can be used to optimize visualization design to facilitate visual communication. 
    more » « less
  2. Abstract

    Visual representations of data are widely used for communication and understanding, particularly in science, technology, engineering, and mathematics (STEM). However, despite their importance, many people have difficulty understanding data-based visualizations. This work presents a series of three studies that examine how understanding time-based Earth-science data visualizations are influenced by scale and the different directions time can be represented (e.g., the Geologic Time Scale represents time moving from bottom-to-top, whereas many calendars represent time moving left-to-right). In Study 1, 316 visualizations from two top scholarly geoscience journals were analyzed for how time was represented. These expert-made graphs represented time in a range of ways, with smaller timescales more likely to be represented as moving left-to-right and larger scales more likely to be represented in other directions. In Study 2, 47 STEM novices were recruited from an undergraduate psychology experiment pool and asked to construct four separate graphs representing change over two scales of time (Earth’s history or a single day) and two phenomena (temperature or sea level). Novices overwhelmingly represented time moving from left-to-right, regardless of scale. In Study 3, 40 STEM novices were shown expert-made graphs where the direction of time varied. Novices had difficulty interpreting the expert-made graphs when time was represented moving in directions other than left-to-right. The study highlights the importance of considering representations of time and scale in STEM education and offers insights into how experts and novices approach visualizations. The findings inform the development of educational resources and strategies to improve students’ understanding of scientific concepts where time and space are intrinsically related.

     
    more » « less
  3. Abstract

    Communicating and interpreting uncertainty in ecological model predictions is notoriously challenging, motivating the need for new educational tools, which introduce ecology students to core concepts in uncertainty communication. Ecological forecasting, an emerging approach to estimate future states of ecological systems with uncertainty, provides a relevant and engaging framework for introducing uncertainty communication to undergraduate students, as forecasts can be used as decision support tools for addressing real‐world ecological problems and are inherently uncertain. To provide critical training on uncertainty communication and introduce undergraduate students to the use of ecological forecasts for guiding decision‐making, we developed a hands‐on teaching module within the Macrosystems Environmental Data‐Driven Inquiry and Exploration (EDDIE;MacrosystemsEDDIE.org) educational program. Our module used an active learning approach by embedding forecasting activities in an R Shiny application to engage ecology students in introductory data science, ecological modeling, and forecasting concepts without needing advanced computational or programming skills. Pre‐ and post‐module assessment data from more than 250 undergraduate students enrolled in ecology, freshwater ecology, and zoology courses indicate that the module significantly increased students' ability to interpret forecast visualizations with uncertainty, identify different ways to communicate forecast uncertainty for diverse users, and correctly define ecological forecasting terms. Specifically, students were more likely to describe visual, numeric, and probabilistic methods of uncertainty communication following module completion. Students were also able to identify more benefits of ecological forecasting following module completion, with the key benefits of using forecasts for prediction and decision‐making most commonly described. These results show promise for introducing ecological model uncertainty, data visualizations, and forecasting into undergraduate ecology curricula via software‐based learning, which can increase students' ability to engage and understand complex ecological concepts.

     
    more » « less
  4. Data visualization provides a powerful way for analysts to explore and make data-driven discoveries. However, current visual analytic tools provide only limited support for hypothesis-driven inquiry, as their built-in interactions and workflows are primarily intended for exploratory analysis. Visualization tools notably lack capabilities that would allow users to visually and incrementally test the fit of their conceptual models and provisional hypotheses against the data. This imbalance could bias users to overly rely on exploratory analysis as the principal mode of inquiry, which can be detrimental to discovery. In this paper, we introduce Visual (dis) Confirmation, a tool for conducting confirmatory, hypothesis-driven analyses with visualizations. Users interact by framing hypotheses and data expectations in natural language. The system then selects conceptually relevant data features and automatically generates visualizations to validate the underlying expectations. Distinctively, the resulting visualizations also highlight places where one's mental model disagrees with the data, so as to stimulate reflection. The proposed tool represents a new class of interactive data systems capable of supporting confirmatory visual analysis, and responding more intelligently by spotlighting gaps between one's knowledge and the data. We describe the algorithmic techniques behind this workflow. We also demonstrate the utility of the tool through a case study. 
    more » « less
  5. null (Ed.)
    Context. Inferences about dark matter, dark energy, and the missing baryons all depend on the accuracy of our model of large-scale structure evolution. In particular, with cosmological simulations in our model of the Universe, we trace the growth of structure, and visualize the build-up of bigger structures from smaller ones and of gaseous filaments connecting galaxy clusters. Aims. Here we aim to reveal the complexity of the large-scale structure assembly process in great detail and on scales from tens of kiloparsecs up to more than 10 Mpc with new sensitive large-scale observations from the latest generation of instruments. We also aim to compare our findings with expectations from our cosmological model. Methods. We used dedicated SRG/eROSITA performance verification (PV) X-ray, ASKAP/EMU Early Science radio, and DECam optical observations of a ~15 deg 2 region around the nearby interacting galaxy cluster system A3391/95 to study the warm-hot gas in cluster outskirts and filaments, the surrounding large-scale structure and its formation process, the morphological complexity in the inner parts of the clusters, and the (re-)acceleration of plasma. We also used complementary Sunyaev-Zeldovich (SZ) effect data from the Planck survey and custom-made Galactic total (neutral plus molecular) hydrogen column density maps based on the HI4PI and IRAS surveys. We relate the observations to expectations from cosmological hydrodynamic simulations from the Magneticum suite. Results. We trace the irregular morphology of warm and hot gas of the main clusters from their centers out to well beyond their characteristic radii, r 200 . Between the two main cluster systems, we observe an emission bridge on large scale and with good spatial resolution. This bridge includes a known galaxy group but this can only partially explain the emission. Most gas in the bridge appears hot, but thanks to eROSITA’s unique soft response and large field of view, we discover some tantalizing hints for warm, truly primordial filamentary gas connecting the clusters. Several matter clumps physically surrounding the system are detected. For the “Northern Clump,” we provide evidence that it is falling towards A3391 from the X-ray hot gas morphology and radio lobe structure of its central AGN. Moreover, the shapes of these X-ray and radio structures appear to be formed by gas well beyond the virial radius, r 100 , of A3391, thereby providing an indirect way of probing the gas in this elusive environment. Many of the extended sources in the field detected by eROSITA are also known clusters or new clusters in the background, including a known SZ cluster at redshift z = 1. We find roughly an order of magnitude more cluster candidates than the SPT and ACT surveys together in the same area. We discover an emission filament north of the virial radius of A3391 connecting to the Northern Clump. Furthermore, the absorption-corrected eROSITA surface brightness map shows that this emission filament extends south of A3395 and beyond an extended X-ray-emitting object (the “Little Southern Clump”) towards another galaxy cluster, all at the same redshift. The total projected length of this continuous warm-hot emission filament is 15 Mpc, running almost 4 degrees across the entire eROSITA PV observation field. The Northern and Southern Filament are each detected at >4 σ . The Planck SZ map additionally appears to support the presence of both new filaments. Furthermore, the DECam galaxy density map shows galaxy overdensities in the same regions. Overall, the new datasets provide impressive confirmation of the theoretically expected structure formation processes on the individual system level, including the surrounding warm-hot intergalactic medium distribution; the similarities of features found in a similar system in the Magneticum simulation are striking. Our spatially resolved findings show that baryons indeed reside in large-scale warm-hot gas filaments with a clumpy structure. 
    more » « less