With a call for schools to infuse data across the curriculum, many are creating curricula and examining students’ thinking in data-intensive problems. As the discipline of statistics education broadens to data science education, there is a need to examine how practices in data science can inform work in K-12. We synthesize literature about statistics investigation processes, data science as a field and practices of data scientists. Further, we provide results from an ethnographic and interview study of the work of data scientists. Together, these inform a new framework to support data investigation processes. We explicate the practices and dispositions needed and offer a glimpse of how the framework can be used to move the discipline of data science education forward.
more »
« less
What to consider when we consider data
Abstract The data sets used in statistics education have changed over time, from mathematically “well‐behaved” ones that facilitated computation, to more context‐rich sources and now, with the increasing influence of data science practices, to “found” data, often from open data sites. As data sources change, it is important for educators to take a fresh look at the ways we engage students in thinking about the processes that generated the data they encounter. The use of already collected data requires particular attention because many of the decisions that went into the processes of obtaining the data are hidden. Students need to learn to ask “Who, When, How, Where, and Why?” data were collected and to wonder if the data really measure what needs to be measured. Our advocacy in this paper is to deepen the educational treatment of data production to better reflect the current and future practice of statistics and data science.
more »
« less
- PAR ID:
- 10388961
- Publisher / Repository:
- Wiley-Blackwell
- Date Published:
- Journal Name:
- Teaching Statistics
- Volume:
- 43
- Issue:
- S1
- ISSN:
- 0141-982X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Data science is revolutionizing academia and industry, creating a high demand for a workforce fluent in this field. While the availability of data science courses has increased recently, few curricula rigorously build on mathematical logic. The LogicDS Project addresses this gap by engaging high school students from rural communities in an online data science course integrating mathematics, statistics, and programming into a unified framework based on logic and reasoning. A one-week course, consisting of six lessons, was developed and 110 participants were recruited. Pre- and post-intervention data, along with students' LMS activity logs, were collected to analyze engagement. Results indicate that the Logic-Based framework effectively engages students from diverse backgrounds, with participants finding the course valuable for learning data science skills. Notably, entropy analysis of student activity logs correlated with other mixed methods analyses, providing insights into engaging K-12 students in data science education.more » « less
-
Abstract BackgroundWhile previous work in higher education documents the impact of high tuition costs of attending graduate school as a key motivator in attrition decisions, in engineering, most graduate students are fully funded on research fellowships, indicating there are different issues causing individuals to consider departure. There has been little work characterizing nonfinancial costs for students in engineering graduate programs and the impact these costs may have on persistence or attrition. Purpose/HypothesisFramed through the lens of cost as a component of the expectancy–value theory framework and the graduate attrition decisions (GrAD) model conceptual framework specific to engineering attrition, the purpose of this article is to characterize the costs engineering graduate students associate with attending graduate school and document how costs affect students' decisions to persist or depart. Design/MethodData were collected through semistructured interviews with 42 engineering graduate students from R1 engineering doctoral programs across the United States who have considered, are currently considering, or have chosen to depart from their engineering PhD programs with a master's degree. ResultsIn addition to time and money, which are costs previously captured in research, participants identified costs to life balance, costs to well‐being, and identify‐informed opportunity costs framed in terms of what “could have been” if they had chosen to not go to graduate school. As these costs relate to persistence, students primarily identified their expended effort and already‐incurred costs as the primary motivator for persistence, rather than any expected benefits of a graduate degree. ConclusionThe findings of this work expand the cost component of the GrAD model conceptual framework, providing a deeper understanding of the costs that graduate students relate to their persistence in engineering graduate programs. It evidences that motivation to persist may not be due to particularly strong goals but may result from costs already incurred. Through this research, the scholarly community, students, advisors, and university policymakers can better understand the needs of engineering graduate students as they navigate graduate study.more » « less
-
Abstract We explore ways in which statistics can be used to understand disease spread and support decision‐making by governments. “Past performance does not guarantee future results”—we hope. We discuss and show examples from the National Science Foundation (NSF)‐funded COVID‐Inspired Data Science Education through Epidemiology (CIDSEE) project. Throughout, the emphasis is on the relationships between evidence, modeling and theorizing, and appropriate action. Statistics should be an essential element in all these aspects. We point to some “big statistical ideas” that underpin the whole process of modeling, which can be illustrated vividly in the context of pandemics. We argue that statistics education should emphasize the application of statistics in practical situations, and that many curricula do not equip students to use their understandings of statistics outside the classroom. We offer a framework for curriculum analysis and point to some rich teaching resources.more » « less
-
Abstract Teachers must know how to use language to support students in knowledge generation environments that align to the Next Generation Science Standards. To measure this knowledge, this study refines a survey on teachers’ knowledge of language as an epistemic tool. Rasch modelling was used to examine 15 items’ fit statistics and the functioning of a previously-designed questionnaire’s response categories. Cronbach’s alpha reliability was also examined. Additionally, interviews were used to investigate teachers’ interpretations of each item to identify ambiguous items. The results indicated that three ambiguous items were deleted based on qualitative data and three more items were deleted because of negative correlation and mismatched fit statistics. Finally, we present a revised language questionnaire with nine items and acceptable correlation and good fit statistics, with utility for science education researchers and teacher educators. This research contributes a revised questionnaire to measure teachers’ knowledge of language that could inform professional development efforts. This research also describes instrument refinement processes that could be applied elsewhere.more » « less
An official website of the United States government
