skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Question-Generating Datasets: Facilitating Data Transformation of Official Statistics for Broad Citizenry Decision- Making
Citizenry decision-making relies on data for informed actions, and official statistics provide many of the relevant data needed for these decisions. However, the wide, distributed, and diverse datasets available from official statistics remain hard to access, scrutinise and manipulate, especially for non-experts. As a result, the complexities involved in official statistical databases create barriers to broader access to these data, often rendering the data non-actionable or irrelevant for the speed at which decisions are made in social and public life. To address this problem, this paper proposes an approach to automatically generating basic, factual questions from an existing dataset of official statistics. The question generating process, now specifically instantiated for geospatial data, starts from a raw dataset and gradually builds toward formulating and presenting users with examples of questions that the dataset can answer, and for which geographic units. This approach exemplifies a novel paradigm of question-first data rendering, where questions, rather than data tables, are used as a human-centred and relevant access points to explore, manipulate, navigate and cross-link data to support decision making. This approach can automate time-consuming aspects of data transformation and facilitate broader access to data.  more » « less
Award ID(s):
1934942
PAR ID:
10213584
Author(s) / Creator(s):
; ;
Editor(s):
Domenech, Josep; Vicente, María Rosalía
Date Published:
Journal Name:
International Conference on Advanced Research Methods and Analytics
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    In the past decade, there has been a surge of interest in using games derived from experimental economics to test decision-making behaviour across species. In most cases, researchers are using the games as a tool, for instance, to understand what factors influence decision-making, how decision-making differs across species or contexts, or to ask broader questions about species’ propensities to cooperate or compete. These games have been quite successful in this regard. To what degree, however, do these games tap into species' economic decision-making? For the purpose of understanding the evolution of economic systems in humans, this is the key question. To study this, we can break economic decision-making down into smaller components, each of which is a potential step in the evolution of human economic behaviour. We can then use data from economic games, which are simplified, highly structured models of decision-making and therefore ideal for the comparative approach, to directly compare these components across species and contexts, as well as in relation to more naturalistic behaviours, to better understand the evolution of economic behaviour and the social and ecological contexts that influenced it. The comparative approach has successfully informed us about the evolution of other complex traits, such as language and morality, and should help us more deeply understand why and how human economic systems evolved. This article is part of the theme issue ‘Existence and prevalence of economic behaviours among non-human primates’. 
    more » « less
  2. null (Ed.)
    Abstract We hypothesize that for disaster risk mitigation, many households, despite being aware of their risk and possible mitigation actions, never seriously consider doing anything about them. In mitigation-focused decisions, since there is no equivalent to warning messages, the decision process is likely to evolve over an extended time. We explore what activates hurricane mitigation protective action decisions through three research questions: (1) to what extent are homeowners unengaged in protective action decision making? (2) What homeowner characteristics are associated with lack of engagement? And (3) to what extent do different life events trigger engagement in the decision-making process? We use the Precaution Adoption Process Model to conceptualize engagement as distinct from decision making; the broader protective action decision-making literature to explore drivers of engagement; and Life Course Theory to examine potential transitions from unengaged to engaged. We use survey data of homeowners in North Carolina to examine these questions empirically. Findings suggest that one-third of respondents had never engaged in protective action decisions, that life experiences differ in their occurrence frequency and effect on households’ mitigation decisions, and that some events, such as renovating, reroofing, or purchasing a home may offer critical moments that could be leveraged to encourage greater engagement in mitigation decision making. 
    more » « less
  3. Abstract Contemporary wildlife disease management is complex because managers need to respond to a wide range of stakeholders, multiple uncertainties, and difficult trade‐offs that characterize the interconnected challenges of today. Despite general acknowledgment of these complexities, managing wildlife disease tends to be framed as a scientific problem, in which the major challenge is lack of knowledge. The complex and multifactorial process of decision‐making is collapsed into a scientific endeavor to reduce uncertainty. As a result, contemporary decision‐making may be oversimplified, rely on simple heuristics, and fail to account for the broader legal, social, and economic context in which the decisions are made. Concurrently, scientific research on wildlife disease may be distant from this decision context, resulting in information that may not be directly relevant to the pertinent management questions. We propose reframing wildlife disease management challenges as decision problems and addressing them with decision analytical tools to divide the complex problems into more cognitively manageable elements. In particular, structured decision‐making has the potential to improve the quality, rigor, and transparency of decisions about wildlife disease in a variety of systems. Examples of management of severe acute respiratory syndrome coronavirus 2, white‐nose syndrome, avian influenza, and chytridiomycosis illustrate the most common impediments to decision‐making, including competing objectives, risks, prediction uncertainty, and limited resources. 
    more » « less
  4. The security of printed circuit boards (PCBs) has become increasingly vital as supply chain vulnerabilities, including tampering, present significant risks to electronic systems. While detecting tampering on a PCB is the first step for verification, forensics is also needed to identify the modified component. One non-invasive and reliable PCB tamper detection technique with global coverage is the impedance characterization of PCB's power delivery network (PDN). However, it is an open question whether one can use the two-dimensional impedance signatures for forensics purposes. In this work, we introduce a novel PCB forensics approach, using explainable AI (XAI) on impedance signatures. Through extensive experiments, we replicate various PCB tamper events, generating a dataset used to develop an XAI algorithm capable of not only detecting tampering but also explaining why the algorithm makes a decision about whether a tamper event has happened. At the core of our XAI algorithm is a random forest classifier with an accuracy of 96.7%, sufficient to explain the algorithm's decisions. To understand the behavior of the classifier In the decision-making process, we utilized the SHAP values as an XAI tool to determine which frequency component influences the classifier's decision for a particular class the most. This approach enhances detection capabilities as well as advancing the verifier's ability to reverse-engineer and analyze two-dimensional impedance signatures for forensics. 
    more » « less
  5. Domenech, Josep; Vicente, María Rosalía (Ed.)
    The accessibility of official statistics to non-expert users could be aided by employing natural language processing and deep learning models to dataset lexicons. Specifically, the semantic structure of FIPS codes would offer a relatively standardized data dictionary of column names and string variable structure to identify: two-digits for states, followed by three-digits for counties. The technical, methodological contribution of this paper is a bibliometric analysis of scientific publications based on FIPS code analysis indicated that between 27,954 and 1,970,000 publications attend to this geo-identifier. Within a single dataset reporting national representative and longitudinal survey data, 141 publications utilize FIPS data. The high incidence shows the research impact. Yet, the low proportion of only 2.0 percent of all publications utilizing this dataset also shows a gap even among expert users. A data use case drawn from public health data implies that cracking the code of geo-identifiers could advance access by helping everyday users formulate data inquiries within intuitive language. 
    more » « less