skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Operationalizing science literacy: an experimental analysis of measurement
Inequalities in scientific knowledge are the subject of increasing attention, so how factual science knowledge is measured, and any inconsistencies in said measurement, is extremely relevant to the field of science communication. Different operationalizations of factual science knowledge are used interchangeably in research, potentially resulting in artificially comparable knowledge levels among respondents. Here, we present data from an experiment embedded in an online survey conducted in the United States (N = 1,530) that examined the distribution of factual science knowledge responses on a 3- vs. 5-point response scale. Though the scale did not impact a summative knowledge index, significant differences emerged when knowledge items were analyzed individually or grouped based on whether the correct response was “true” or “false.” Our findings emphasize the necessity for communicators to consider the goals of knowledge assessment when making operationalization decisions.  more » « less
Award ID(s):
1906864
PAR ID:
10231799
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Journal of Science Communication
Volume:
19
Issue:
04
ISSN:
1824-2049
Page Range / eLocation ID:
A03
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Outreach and communication with the public have substantial value in polar research, in which studies often find changes of global importance that are happening far out of sight from the majority of people living at lower latitudes. Seeking evidence on the effectiveness of outreach programs, the U.S. National Science Foundation sponsored large-scale survey assessments before and after the International Polar Year in 2007/2008. Polar-knowledge questions have subsequently been tested and refined through other nationwide and regional surveys. More than a decade of such work has established that basic but fairly specific knowledge questions, with all answer choices sounding plausible but one being uniquely correct, can yield highly replicable results. Those results, however, paint a mixed picture of knowledge. Some factual questions seem to be interpreted by many respondents as if they had been asked for their personal beliefs about climate change, so their responses reflect sociopolitical identity rather than physical-world knowledge. Other factual questions, by design, do not link in obvious ways to climate-change beliefs—so responses have simpler interpretations in terms of knowledge gaps, and education needs. 
    more » « less
  2. null (Ed.)
    Science communicators have been encouraged to use humor in their online engagement efforts. Yet, humor’s effectiveness for engaging people with science remains an open question. We report the results of an experiment designed to elicit varied levels of mirth in respondents, which was positively associated with perceived likability of the communicator and motivation to follow more science on social media. Furthermore, mirth and perceived likability serially mediated the effect of the experimental manipulation on motivation and factual science knowledge served as a moderator. This indicates that, while humor might be an effective means of reaching audiences, downstream effects are likely to vary depending on individuals’ knowledge. 
    more » « less
  3. The capability to generate responses with diversity and faithfulness using factual knowledge is paramount for creating a human-like, trustworthy dialogue system. Common strategies either adopt a two-step paradigm, which optimizes knowledge selection and response generation separately and may overlook the inherent correlation between these two tasks, or leverage conditional variational method to jointly optimize knowledge selection and response generation by employing an inference network. In this paper, we present an end-to-end learning framework, termed Sequential Posterior Inference (SPI), capable of se- lecting knowledge and generating dialogues by approximately sampling from the posterior distribution. Unlike other methods, SPI does not require the inference network or assume a simple geometry of the posterior distribution. This straightforward and intuitive inference procedure of SPI directly queries the response generation model, allowing for accurate knowledge selection and generation of faithful responses. In addition to modeling contributions, our experimental results on two common dialogue datasets (Wizard of Wikipedia and Holl-E) demonstrate that SPI outperforms previous strong baselines according to both automatic and human evaluation metrics. 
    more » « less
  4. By design, large language models (LLMs) are static general-purpose models, expensive to retrain or update frequently. As they are increasingly adopted for knowledge-intensive tasks, it becomes evident that these design choices lead to failures to generate factual, relevant, and up-to-date knowledge. To this end, we propose Knowledge Card, a modular framework to plug in new factual and relevant knowledge into general-purpose LLMs. We first introduce knowledge cards---specialized language models trained on corpora from specific domains and sources. Knowledge cards serve as parametric repositories that are selected at inference time to generate background knowledge for the base LLM. We then propose three content selectors to dynamically select and retain information in documents generated by knowledge cards, specifically controlling for relevance, brevity, and factuality of outputs. Finally, we propose two complementary integration approaches to augment the base LLM with the (relevant, factual) knowledge curated from the specialized LMs. Through extensive experiments, we demonstrate that Knowledge Card achieves state-of-the-art performance on six benchmark datasets. Ultimately, Knowledge Card framework enables dynamic synthesis and updates of knowledge from diverse domains. Its modularity will ensure that relevant knowledge can be continuously updated through the collective efforts of the research community. 
    more » « less
  5. Abstract There is strong agreement in science teacher education of the importance of teachers' content knowledge for teaching (CKT), which includes their subject matter knowledge and their pedagogical content knowledge. However, there are limited instruments that can be easily administered and scored on a large scale to assess and study elementary science teachers' CKT. Such measures would support strategic monitoring of large groups of science teachers' CKT and the investigation of comparative questions about science teachers' CKT longitudinally across the professional continuum or across teacher education or professional development sites. To address this gap, this study focused on designing an automatically scorable summative assessment that can be used to measure preservice elementary teachers' (PSETs') CKT in one high‐leverage science content area: matter and its interactions. We conducted a field test of this CKT instrument with 822 PSETs from across the United States and used the response data to examine how this instrument functions as a potential tool for measuring PSETs' CKT in this science content area. Results suggest this instrument is reliable and can be used on large scale to support valid inferences about PSETs' CKT in this content area. In addition, the dimensionality analysis showed that all items measure a single construct of CKT about matter and its interactions, as participants did not show any differential performance by content topic or work of teaching science instructional tool categories. Implications for progressing the field's understanding of the nature of CKT and approaches to developing summative instruments to assess science teachers' CKT are discussed. 
    more » « less