Despite growing calls to develop data science students’ ethical awareness and expand human-centered approaches to data science education, introductory courses in the field remain largely technical. A new interdisciplinary data science program aims to merge STEM and humanities perspectives starting at the very beginning of the data science curriculum. Existing literature suggests that humanities integration can make STEM courses more appealing to a wider range of students, including women and students of color, and enhance student learning of essential concepts and foundational reasoning skills, such as those collectively known as data acumen. Cultivating students’ data acumen requires a more inclusive vision of how the knowledge and insights generated through computational methods and statistical analysis relates to other ways of knowing. 
                        more » 
                        « less   
                    
                            
                            Collections Education: The Extended Specimen and Data Acumen
                        
                    
    
            Abstract Biodiversity scientists must be fluent across disciplines; they must possess the quantitative, computational, and data skills necessary for working with large, complex data sets, and they must have foundational skills and content knowledge from ecology, evolution, taxonomy, and systematics. To effectively train the emerging workforce, we must teach science as we conduct science and embrace emerging concepts of data acumen alongside the knowledge, tools, and techniques foundational to organismal biology. We present an open education resource that updates the traditional plant collection exercise to incorporate best practices in twenty-first century collecting and to contextualize the activities that build data acumen. Students exposed to this resource gained skills and content knowledge in plant taxonomy and systematics, as well as a nuanced understanding of collections-based data resources. We discuss the importance of the extended specimen in fostering scientific discovery and reinforcing foundational concepts in biodiversity science, taxonomy, and systematics. 
        more » 
        « less   
        
    
    
                            - PAR ID:
- 10336284
- Date Published:
- Journal Name:
- BioScience
- Volume:
- 72
- Issue:
- 2
- ISSN:
- 0006-3568
- Page Range / eLocation ID:
- 177 to 188
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            As demand grows for job-ready data science professionals, there is increasing recognition that traditional training often falls short in cultivating the higher-order reasoning and real-world problem-solving skills essential to the field. A foundational step toward addressing this gap is the identification and organization of knowledge components (KCs) that underlie data science problem solving (DSPS). KCs represent conditional knowledge—knowing about appropriate actions given particular contexts or conditions—and correspond to the critical decisions data scientists must make throughout the problem-solving process. While existing taxonomies in data science education support curriculum development, they often lack the granularity and focus needed to support the assessment and development of DSPS skills. In this paper, we present a novel framework that combines the strengths of large language models (LLMs) and human expertise to identify, define, and organize KCs specific to DSPS. We treat LLMs as ``knowledge engineering assistants" capable of generating candidate KCs by drawing on their extensive training data, which includes a vast amount of domain knowledge and diverse sets of real-world DSPS cases. Our process involves prompting multiple LLMs to generate decision points, synthesizing and refining KC definitions across models, and using sentence-embedding models to infer the underlying structure of the resulting taxonomy. Human experts then review and iteratively refine the taxonomy to ensure validity. This human-AI collaborative workflow offers a scalable and efficient proof-of-concept for LLM-assisted knowledge engineering. The resulting KC taxonomy lays the groundwork for developing fine-grained assessment tools and adaptive learning systems that support deliberate practice in DSPS. Furthermore, the framework illustrates the potential of LLMs not just as content generators but as partners in structuring domain knowledge to inform instructional design. Future work will involve extending the framework by generating a directed graph of KCs based on their input-output dependencies and validating the taxonomy through expert consensus and learner studies. This approach contributes to both the practical advancement of DSPS coaching in data science education and the broader methodological toolkit for AI-supported knowledge engineering.more » « less
- 
            Societal Impact StatementThe practice of writing science blogs benefits both the scientist and society alike by providing professional development opportunities and delivering information in a format that is accessible to large and diverse audiences. By designing a project that introduced upper‐level undergraduate students to science blog writing with a focus on plant biology, we piqued students' interest in science writing and the content of a popular plant science blog website. If adopted more widely, this work could broaden the scope of science education and promote the development of effective science communication skills for the next generation of scientists. SummarySuccessful scientists must communicate their research to broad audiences, including distilling key scientific concepts for the general public. Students pursuing careers in Science, Technology, Engineering, and Mathematics (STEM) fields benefit from developing public communication skills early in their careers, but opportunities are limited in traditional biology curricula.We created the “Plant Science Blogging Project” for a Plant Biology undergraduate course at the University of Pittsburgh in Fall 2018 and 2019. Students wrote blog posts merging personal connections with plants with plant biology concepts for the popular science blogsPlant Love StoriesandEvoBites. By weaving biology into their narratives, students learned how to share botanical knowledge with the general public.The project had positive impacts on student learning and public engagement. In post‐assignment surveys, the majority of students reported that they enjoyed the assignment, felt it improved their understanding of plant biology, and piqued their interest in reading and writing science blogs in the future. Approximately one‐third of the student‐authored blogs were published, including two that rose to the top 10 most‐read posts on Plant Love Stories. Some dominant themes in student blogs, including medicine and culture, differed from common story themes published on the web, indicating the potential for students to diversify science blog content.Overall, the Plant Science Blogging Project allows undergraduate students to engage with plant biology topics in a new way, sharpen their scientific communication skills in accordance with today's world of mass information sharing, and contribute to the spread of scientific knowledge for public benefit.more » « less
- 
            Systematics provides the foundational knowledge about the units of biodiversity, i.e., species, and how we classify them. The results of this discipline extend across Biology and can have important impacts on conservation. Here we review the systematic and taxonomic practices within Theraphosidae over the last 260 years. We examine the rate of newly described species and investigate the contemporary practices being used in the description of new genera and species. There have been two large waves of theraphosid taxonomy, with an explosive growth of newly described species and author combinations in the last 60 years. We look back and find that during 2010–2024 contemporary practices in theraphosid systematics and taxonomy have remained largely static, being dominated by morphology-based approaches. Over this period, only 10% of newly described species incorporated DNA data or explicitly stated the species concept used. Similarly for genera, only five of the 37 newly described genera over that time were supported as distinct and monophyletic by DNA. We highlight the taxonomic movement of species among Theraphosidae, Barychelidae, and Paratropididae; however, given the limited molecular sampling for the two latter families, the boundaries of these families remain a significant area of needed research. To promote inclusivity, we provide a copy of this paper in Spanish as supplementary material.more » « less
- 
            ACM (Ed.)Early computer science courses (CS1, CS2) are the cornerstone of student understanding of computer science. These courses introduce the foundational knowledge of computer science needed to understand more complex topics and to be successful in follow-on courses. It is thus important to introduce CS concepts in an engaging and easy-to-understand manner to increase student interest and retention. This paper presents a new approach to teaching the Computer Science 1 (CS1) course through our BRIDGES system. This approach aims to increase student engagement and improve learning outcomes by using audio-based assignments that they can manipulate and process audio signal information, as well as visualize and play them. We explain how to design and implement audiobased assignments and connect them to fundamental programming constructs such as variables, control flow, and simple data structures, such as arrays. These assignments encourage and engage students by using audio data they are interested in to write code, promoting problem-solving and improvements in their critical thinking skills.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    