skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on July 17, 2025

Title: New viruses are inevitable; pandemics are optional—Lessons for and from statistics
Abstract We explore ways in which statistics can be used to understand disease spread and support decision‐making by governments. “Past performance does not guarantee future results”—we hope. We discuss and show examples from the National Science Foundation (NSF)‐funded COVID‐Inspired Data Science Education through Epidemiology (CIDSEE) project. Throughout, the emphasis is on the relationships between evidence, modeling and theorizing, and appropriate action. Statistics should be an essential element in all these aspects. We point to some “big statistical ideas” that underpin the whole process of modeling, which can be illustrated vividly in the context of pandemics. We argue that statistics education should emphasize the application of statistics in practical situations, and that many curricula do not equip students to use their understandings of statistics outside the classroom. We offer a framework for curriculum analysis and point to some rich teaching resources.  more » « less
Award ID(s):
2313212
PAR ID:
10524345
Author(s) / Creator(s):
 ;  
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
Teaching Statistics
Volume:
46
Issue:
3
ISSN:
0141-982X
Format(s):
Medium: X Size: p. 132-140
Size(s):
p. 132-140
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The ever increasing popularity of machine learning methods in virtually all areas of science, engineering and beyond is poised to put established statistical modeling approaches into question. Environmental statistics is no exception, as popular constructs such as neural networks and decision trees are now routinely used to provide forecasts of physical processes ranging from air pollution to meteorology. This presents both challenges and opportunities to the statistical community, which could contribute to the machine learning literature with a model‐based approach with formal uncertainty quantification. Should, however, classical statistical methodologies be discarded altogether in environmental statistics, and should our contribution be focused on formalizing machine learning constructs? This work aims at providing some answers to this thought‐provoking question with two time series case studies where selected models from both the statistical and machine learning literature are compared in terms of forecasting skills, uncertainty quantification and computational time. Relative merits of both class of approaches are discussed, and broad open questions are formulated as a baseline for a discussion on the topic. 
    more » « less
  2. Abstract Machine learning (ML) has become commonplace in educational research and science education research, especially to support assessment efforts. Such applications of machine learning have shown their promise in replicating and scaling human‐driven codes of students' work. Despite this promise, we and other scholars argue that machine learning has not yet achieved its transformational potential. We argue that this is because our field is currently lacking frameworks for supporting creative, principled, and critical endeavors to use machine learning in science education research. To offer considerations for science education researchers' use of ML, we present a framework, Distributing Epistemic Functions and Tasks (DEFT), that highlights the functions and tasks that pertain to generating knowledge that can be carried out by either trained researchers or machine learning algorithms. Such considerations are critical decisions that should occur alongside those about, for instance, the type of data or algorithm used. We apply this framework to two cases, one that exemplifies the cutting‐edge use of machine learning in science education research and another that offers a wholly different means of using machine learning and human‐driven inquiry together. We conclude with strategies for researchers to adopt machine learning and call for the field to rethink how we prepare science education researchers in an era of great advances in computational power and access to machine learning methods. 
    more » « less
  3. The emphasis on an equitable vision of science learning in current science education reform efforts sees students as contributing to knowledge-building through drawing on their rich cultural and linguistic backgrounds while engaging in the three dimensions to make sense of compelling, relevant phenomena. However, this vision will not be fully realized without coherence between curriculum, instruction, and assessment. As a majority of states have now adopted standards aligned to or adapted from the Framework, we see an urgent need for assessments that can support rather than conflict with equitable science learning. In this study, we seek to understand the current state of Framework-aligned assessment tasks. We have amassed 352 middle school tasks, originating from state-level assessment banks and assessment developers at universities or research organizations. Our preliminary findings from characterizing 104 tasks revealed that the majority of tasks target dimensions of the NGSS or Framework-based standards and include a phenomenon. However, there are challenges in framing phenomena that attend to students’ interests and identities and engage students in three-dimensional sensemaking. Additionally, some phenomena are not based in real-world observations and are not authentic from students’ perspectives, which makes it difficult for students to see connections of local or global relevance. 
    more » « less
  4. With a call for schools to infuse data across the curriculum, many are creating curricula and examining students’ thinking in data-intensive problems. As the discipline of statistics education broadens to data science education, there is a need to examine how practices in data science can inform work in K-12. We synthesize literature about statistics investigation processes, data science as a field and practices of data scientists. Further, we provide results from an ethnographic and interview study of the work of data scientists. Together, these inform a new framework to support data investigation processes. We explicate the practices and dispositions needed and offer a glimpse of how the framework can be used to move the discipline of data science education forward. 
    more » « less
  5. Abstract We develop a new market‐making model, from the ground up, which is tailored toward high‐frequency trading under a limit order book (LOB), based on the well‐known classification of order types in market microstructure. Our flexible framework allows arbitrary order volume, price jump, and bid‐ask spread distributions as well as the use of market orders. It also honors the consistency of price movements upon arrivals of different order types. For example, it is apparent that prices should never go down on buy market orders. In addition, it respects the price‐time priority of LOB. In contrast to the approach of regular control on diffusion as in the classical Avellaneda and Stoikov (Quantitative Finance, 8, 217, 2008) market‐making framework, we exploit the techniques of optimal switching and impulse control on marked point processes, which have proven to be very effective in modeling the order book features. The Hamilton‐Jacobi‐Bellman quasi‐variational inequality (HJBQVI) associated with the control problem can be solved numerically via finite‐difference method. We illustrate our optimal trading strategy with a full numerical analysis, calibrated to the order book statistics of a popular exchanged‐traded fund (ETF). Our simulation shows that the profit of market‐making can be severely overstated under LOBs with inconsistent price movements. 
    more » « less