Search for: All records

Creators/Authors contains: "Botelho, Anthony F"

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

So What? Unpacking the Complexities in Collaborative Problem Solving with AI-Augmented Sense-Making

https://doi.org/10.1007/978-3-031-98462-4_54

Lee, Seiyon M; Li, Hongming; Zhang, Shan; Zhong, Zirui; Lee, Ji-Eun; Botelho, Anthony F (July 2025, Springer Nature Switzerland)

Free, publicly-accessible full text available July 21, 2026
Developing Feedback Taxonomy for Math: A Synergy of Perspectives through Data Mining Methods

https://doi.org/10.5281/zenodo.16684562

Lee, Seiyon M; Baral, Sami; Li, Hongming Chip; Cheng, Li; Zhang, Shan; Thorp, Carly S; St_John, Jennifer; Thompson, Tamisha; Heffernan, Neil; Botelho, Anthony F (August 2025, Journal of Educational Data Mining)

Teachers often use open-ended questions to promote students' deeper understanding of the content. These questions are particularly useful in K–12 mathematics education, as they provide richer insights into students' problem-solving processes compared to closed-ended questions. However, they are also challenging to implement in educational technologies as significant time and effort are required to qualitatively evaluate the quality of students' responses and provide timely feedback. In recent years, there has been growing interest in developing algorithms to automatically grade students' open responses and generate feedback. Yet, few studies have focused on augmenting teachers' perceptions and judgments when assessing students' responses and crafting appropriate feedback. Even fewer have aimed to build empirically grounded frameworks and offer a shared language across different stakeholders. In this paper, we propose a taxonomy of feedback using data mining methods to analyze teacher-authored feedback from an online mathematics learning platform. By incorporating qualitative codes from both teachers and researchers, we take a methodological approach that accounts for the varying interpretations across coders. Through a synergy of diverse perspectives and data mining methods, our data-driven taxonomy reflects the complexity of feedback content as it appears in authentic settings. We discuss how this taxonomy can support more generalizable methods for providing pedagogically meaningful feedback at scale.
more » « less
Free, publicly-accessible full text available August 1, 2026
Semi-automating the Scoping Review Process: Is it Worthwhile? A Methodological Evaluation

https://doi.org/10.1007/s10648-024-09972-0

Zhang, Shan; Palaguachi, Chris; Pitera, Marcin; Jaldi, Chris Davis; Schroeder, Noah L; Botelho, Anthony F; Gladstone, Jessica R (December 2024, Educational Psychology Review)

Systematic reviews are a time-consuming yet effective approach to understanding research trends. While researchers have investigated how to speed up the process of screening studies for potential inclusion, few have focused on to what extent we can use algorithms to extract data instead of human coders. In this study, we explore to what extent analyses and algorithms can produce results similar to human data extraction during a scoping review—a type of systematic review aimed at understanding the nature of the field rather than the efficacy of an intervention—in the context of a never before analyzed sample of studies that were intended for a scoping review. Specifically, we tested five approaches: bibliometric analysis with VOSviewer, latent Dirichlet allocation (LDA) with bag of words, k-means clustering with TF-IDF, Sentence-BERT, or SPECTER, hierarchical clustering with Sentence-BERT, and BERTopic. Our results showed that topic modeling approaches (LDA/BERTopic) and k-means clustering identified specific, but often narrow research areas, leaving a substantial portion of the sample unclassified or in unclear topics. Meanwhile, bibliometric analysis and hierarchical clustering with SBERT were more informative for our purposes, identifying key author networks and categorizing studies into distinct themes as well as reflecting the relationships between themes, respectively. Overall, we highlight the capabilities and limitations of each method and discuss how these techniques can complement traditional human data extraction methods. We conclude that the analyses tested here likely cannot fully replace human data extraction in scoping reviews but serve as valuable supplements.
more » « less
Free, publicly-accessible full text available December 1, 2025
This Paper Was Written with the Help of ChatGPT: Exploring the Consequences of AI-Driven Academic Writing on Scholarly Practices

https://doi.org/10.5281/zenodo.12729880

Li, Hongming; Lee, Seiyon; Botelho, Anthony F (July 2024, International Educational Data Mining Society)
Benjamin, Paaßen; Carrie, Demmans Epp (Ed.)
This paper was written with the help of ChatGPT. Recent advancements in the development and deployment of large generative language models to power generative AI tools, including OpenAIż˝fs ChatGPT, have led to their broad usage across virtually all fields of study. While the tools have been trained to generate human-like-dialogue in response to questions or prompts, they are similarly used to compose larger, more complex artifacts, including social media posts, essays, and even research articles. Although this abstract has been written entirely by a human without any input, consultation, or revision from a generative language model, it would likely be difficult to discern any difference as a reader. In light of this, there is growing debate and concern regarding using these models to aid the writing process, particularly concerning publication. Aside from some notable risks, including the unintentional generation of false information, citation of non-existing research articles, or plagiarism by generating text that is sampled from another source without proper citation, there are additional questions pertaining to the originality of ideas expressed in a work has been partially-written or revised by a generative language model. We present this paper as both a case study into the usage of generative models to aid in the writing of academic research articles but also as an example of how transparency and open science practices may help in addressing several issues that have been raised in other contexts and communities. While this paper neither attempts to promote nor contest the use of these language models in any writing task, it is the goal of this work to provide insight and potential guidance into the ethical and effective usage of these models within this domain.
more » « less
Full Text Available
Predicting and Analyzing Students’ Higher-Order Questions in Collaborative Problem-Solving

https://doi.org/10.58459/icce.2024.4807

ZHANG, Shan; EARLE-RANDELL, Toni V; SHEN, Qian; BOTELHO, Anthony F; ISRAEL, Maya; BOYER, Kristy Elizabeth; LYNCH, Collin F; WIEBE, Eric (November 2024, International Conference on Computers in Education)

Question-asking is a crucial learning and teaching approach. It reveals different levels of students' understanding, application, and potential misconceptions. Previous studies have categorized question types into higher and lower orders, finding positive and significant associations between higher-order questions and students' critical thinking ability and their learning outcomes in different learning contexts. However, the diversity of higher-order questions, especially in collaborative learning environments. has left open the question of how they may be different from other types of dialogue that emerge from students' conversations, To address these questions, our study utilized natural language processing techniques to build a model and investigate the characteristics of students' higher-order questions. We interpreted these questions using Bloom's taxonomy, and our results reveal three types of higher-order questions during collaborative problem-solving. Students often use Why, How and What If' questions to I) understand the reason and thought process behind their partners' actions: 2) explore and analyze the project by pinpointing the problem: and 3) propose and evaluate ideas or alternative solutions. In addition. we found dialogue labeled 'Social'. 'Question - other', 'Directed at Agent', and 'Confusion/Help Seeking' shows similar underlying patterns to higher-order questions, Our findings provide insight into the different scenarios driving students' higher-order questions and inform the design of adaptive systems to deliver personalized feedback based on students' questions.
more » « less
Free, publicly-accessible full text available November 25, 2025
Fine-Tuning Large Language Models for Data Augmentation to Detect At-Risk Students in Online Learning Communities

https://doi.org/10.22318/cscl2024.208036

Li, Hongming; Botelho, Anthony F (June 2024, International Society of the Learning Sciences)

We introduce a working approach that combines the method of fine-tuning large language models (LLMs) to create augmented data for the regression predictive models aimed at detecting at-risk students in online learning communities. This approach has the potential to leverage scarce data to improve urgency detection, and it can also present the role of artificial intelligence in enhancing the resilience of educational communities and ensuring timely interventions within online learning settings.
more » « less
Full Text Available
Investigating the Dynamic Change of Pre- and In-service Teachers' Experiences, Attitudes, and Perceptions through CS Autobiography Using Topic Modeling

https://doi.org/10.5281/zenodo.12729999

Zhang, Shan; Li, Hai; Li, Hongming; Botelho, Anthony F; Israel, Maya (July 2024, International Educational Data Mining Society)
Benjamin, Paaßen; Carrie, Demmans Epp (Ed.)
K-12 Computer Science (CS) education has seen remarkable growth recently, driven by the increasing focus on CS and Computational Thinking (CT) integration. Despite the abundance of Professional development (PD) programs designed to prepare future CS teachers with the required knowledge and skills, there is a lack of research on how teachers' perceptions and attitudes of CS and CT evolve before and after participating in these programs. To address this gap, our exploratory study aims to study the dynamics of pre-and in-service teachers' experiences, attitudes, and perceptions towards CS and CT through their participation in a K-12 CS education micro-credential program. In this study, we employed topic modeling to identify topics that emerged from teachers' written pre- and post-CS autobiographies, conducted statistical analysis to explore how these topics evolve over time and applied regression analysis to investigate the factors influencing these dynamics. We observed a shift in teachers' initial feelings of fear, intimidation, and stress towards confidence, fun, and feeling competent in basic CS, reflecting a positive transformation. Regression analysis revealed that features, such as experienced teacher status and CT conceptual understanding, correlate with participants' evolving views. These observed relationships highlight the micro-credential's role in not only enhancing technical competency but also fostering an adaptive, integrative pedagogical mindset, providing new insights for course design.
more » « less
Full Text Available
Math in Motion: Analyzing Real-Time Student Collaboration in Computer-Supported Learning Environments

https://doi.org/10.5281/zenodo.12729878

Li, Hongming; Zhang, Shan; Lee, Seiyon; Lee, Ji-Eun; Zhong, Zirui; Weitnauer, Erik; Botelho, Anthony F (July 2024, International Educational Data Mining Society)
Benjamin, Paaßen; Carrie, Demmans Epp (Ed.)
With the support of digital learning platforms, synchronous and collaborative learning has become a prominent learning paradigm in mathematics education. Computer-Supported Collaborative Learning (CSCL) has emerged as a valuable tool for enhancing mathematical discourse, problem solving, and ultimately learning outcomes. This paper presents an innovative examination of Graspable Math (GM), a dynamic mathematic notation and learning online platform, to enable synchronous, collaborative learning between pairs of students. Through analyzing students' online log data, we adopt a data-driven method to better understand the intricate dynamics of collaborative learning in mathematics as it happens. Specifically, we apply frequency distributions, cluster analysis to present students' dynamic interaction patterns and identify distinctive profiles of collaboration. Our findings reveal several collaboration profiles that emerge through these analyses. This research not only bridges the gap in current CSCL tools for mathematics, but also provides empirical insights into the effective design and implementation of such tools. The insights gained from this research offer implications for the design of digital learning tools that support effective and engaging collaborative learning experiences.
more » « less
Full Text Available
Identification, Exploration, and Remediation: Can Teachers Predict Common Wrong Answers?

https://doi.org/10.1145/3576050.3576109

Gurung, Ashish; Baral, Sami; Vanacore, Kirk P.; Mcreynolds, Andrew A.; Kreisberg, Hilary; Botelho, Anthony F.; Shaw, Stacy T.; Hefferna, Neil T. (March 2023, LAK2023: LAK23: 13th International Learning Analytics and Knowledge Conference)

Prior work analyzing tutoring sessions provided evidence that highly effective tutors, through their interaction with students and their experience, can perceptively recognize incorrect processes or “bugs” when students incorrectly answer problems. Researchers have studied these tutoring interactions examining instructional approaches to address incorrect processes and observed that the format of the feedback can influence learning outcomes. In this work, we recognize the incorrect answers caused by these buggy processes as Common Wrong Answers (CWAs). We examine the ability of teachers and instructional designers to identify CWAs proactively. As teachers and instructional designers deeply understand the common approaches and mistakes students make when solving mathematical problems, we examine the feasibility of proactively identifying CWAs and generating Common Wrong Answer Feedback (CWAFs) as a formative feedback intervention for addressing student learning needs. As such, we analyze CWAFs in three sets of analyses. We first report on the accuracy of the CWAs predicted by the teachers and instructional designers on the problems across two activities. We then measure the effectiveness of the CWAFs using an intent-to-treat analysis. Finally, we explore the existence of personalization effects of the CWAFs for the students working on the two mathematics activities.
more » « less
Full Text Available
Deep Learning or Deep Ignorance? Comparing Untrained Recurrent Models in Educational Contexts

Botelho, Anthony F; Prihar, Ethan; Heffernan, Neil T (June 2022, Proceedings of the 23rd International Conference on Artificial Intelligence in Education)

The development and application of deep learning method- ologies has grown within educational contexts in recent years. Perhaps attributable, in part, to the large amount of data that is made avail- able through the adoption of computer-based learning systems in class- rooms and larger-scale MOOC platforms, many educational researchers are leveraging a wide range of emerging deep learning approaches to study learning and student behavior in various capacities. Variations of recurrent neural networks, for example, have been used to not only pre- dict learning outcomes but also to study sequential and temporal trends in student data; it is commonly believed that they are able to learn high- dimensional representations of learning and behavioral constructs over time, such as the evolution of a students’ knowledge state while working through assigned content. Recent works, however, have started to dis- pute this belief, instead finding that it may be the model’s complexity that leads to improved performance in many prediction tasks and that these methods may not inherently learn these temporal representations through model training. In this work, we explore these claims further in the context of detectors of student affect as well as expanding on exist- ing work that explored benchmarks in knowledge tracing. Specifically, we observe how well trained models perform compared to deep learning networks where training is applied only to the output layer. While the highest results of prior works utilizing trained recurrent models are found to be superior, the application of our untrained-versions perform compa- rably well, outperforming even previous non-deep learning approaches.
more » « less
Full Text Available

« Prev Next »