NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

9th Educational Data Mining in Computer Science Education (CSEDM) Workshop

https://doi.org/10.5281/zenodo.15870308

Akram, Bita; Shi, Yang; Brusilovsky, Peter; Price, Thomas; Koedinger, Ken; Carvalho, Paulo; Zhang, Shan; Lan, Andrew; Leinonen, Juho (July 2025, Proceedings of 18th International Conference on Educational Data Mining (EDM 2025), International Educational Data Mining Society)
Mills, Caitlin; Alexandron, Giora; Taibi, Davide; Lo_Bosco, Giosuè; Paquette, Luc (Ed.)
There is a growing community of researchers at the intersection- tion of data mining, AI, and computing education research. The objective of the CSEDM workshop is to facilitate a dis- Discussion among this research community, with a focus on how data mining can be uniquely applied in computing ed- ucation research. For example, what new techniques are needed to analyze program code and CS log data? How do results from CS education inform our analysis of this data? The workshop is meant to be an interdisciplinary event at the intersection of EDM and Computing Education Research. Researchers, faculty, and students are encouraged to share their AI- and data-driven approaches, methodological- gies, and experiences where data transforms how students learn Computer Science (CS) skills. This full-day workshop will feature paper presentations and discussions to promote collaboration.
more » « less
Free, publicly-accessible full text available July 20, 2026
Example Explorers and Persistent Finishers: Exploring Student Practice Behaviors in a Python Practice System

Poh, Alison; Hridi, Anurata; Barria-Pineda, Jordan; Brusilovsky, Peter; Akram, Bita (July 2025, Proceedings of 9th Educational Data Mining in Computer Science Education (CSEDM) Workshop at EDM 2025)

Understanding student practice behavior and its connection to their learning is essential for effective recommender systems that provide personalized learning support. In this study, we apply a sequential pattern mining approach to analyze student practice behavior in a practice system for introductory Python programming. Our goal is to identify different types of practice behavior and connect them to student performance. We examine two types of practice sequences: (1) by login session and (2) by learning topic. For each sequence type, we use SPAM (Sequential PAttern Mining) to identify the most frequent micro-patterns and build behavior profiles of individual learners as vectors of micro-pattern frequencies observed in their behavior. We confirm that these vectors are stable for both sequence types (p < 0.03 for session sequences and p < 0.003 for topic sequences). Using the vectors, we perform K-means clustering where we identify two practice behaviors: example explorers and persistent finishers. We repeat this experiment using different coding approaches for student sequences and obtain similar clusters. Our results suggest that example explorers and persistent finishers might represent two typical types of divergent student behaviors in a programming practice system. Finally, to better understand the relationship between students' background knowledge, learning outcomes, and practice behavior, we perform statistical analyses to assess the significance of the associations among pre-test scores, cluster assignments, and final course grades.
more » « less
Free, publicly-accessible full text available July 20, 2026
Can Motivated Students Do More Activities?

Lekshmi-Narayanan, Arun-Balajiee; Asher, Michael W; Brusilovsky, Peter; Carvalho, Paulo (July 2025, 9th Educational Data Mining in Computer Science Education (CSEDM) Workshop at EDM2025)
Akram, Bita; Shi, Yang; Brusilovsky, Peter; Price, Thomas; Koedinger, Ken; Carvalho, Paulo; Zhang, Shan; Lan, Andrew; Leinonen, Juho (Ed.)
The “Doer Effect” is the empirical phenomenon observed as a stronger correlational relationship between students who complete more activities and their course learning outcomes compared to those who complete fewer activities or watch fewer videos. In this paper, we extended prior evidence of a “Doer Effect” to investigate how doing more can be related not only to better learning outcomes but also to motivational ones. Specifically, we investigated persistence as the student’s willingness to continue working on course activities. We used secondary analyses of data from MOOC that taught Advanced Placement (AP) Introductory Java Programming to high school students using the digital textbook platform RuneStone. Although we failed to identify a doer effect in learning outcomes, our analyses do suggest that completing more activities is related to longer persistence in the course than reading more pages or watching more videos. This effect does not appear to be limited to highly motivated students.
more » « less
Free, publicly-accessible full text available July 19, 2026
Integrating Expert Knowledge With Automated Knowledge Component Extraction for Student Modeling

https://doi.org/10.1145/3699682.3728348

Sampaio_de_Alencar, Rafaella; Demirtas, Mehmet Arif; Saha, Adittya Soukarjya; Shi, Yang; Brusilovsky, Peter (June 2025, 33rd ACM Conference on User Modeling, Adaptation and Personalization)

Knowledge tracing is a method to model students’ knowledge and enable personalized education in many STEM disciplines such as mathematics and physics, but has so far still been a challenging task in computing disciplines. One key obstacle to successful knowledge tracing in computing education lies in the accurate extraction of knowledge components (KCs), since multiple intertwined KCs are practiced at the same time for programming problems. In this paper, we address the limitations of current methods and explore a hybrid approach for KC extraction, which combines automated code parsing with an expert-built ontology. We use an introductory (CS1) Java benchmark dataset to compare its KC extraction performance with the traditional extraction methods using a state-of-the-art evaluation approach based on learning curves. Our preliminary results show considerable improvement over traditional methods of student modeling. The results indicate the opportunity to improve automated KC extraction in CS education by incorporating expert knowledge into the process.
more » « less
Free, publicly-accessible full text available June 13, 2026
An Automated Approach to Recommending Relevant Worked Examples for Programming Problems

https://doi.org/10.1145/3641554.3701951

Hoq, M; Patil, A; Akhuseyinoglu, K; Brusilovsky, P; Akram, B (February 2025, ACM Technical Symposium on Computer Science Education)

Novice programmers can greatly improve their understanding of challenging programming concepts by studying worked examples that demonstrate the implementation of these concepts. Despite the extensive repositories of effective worked examples created by CS education experts, a key challenge remains: identifying the most relevant worked example for a given programming problem and the specific difficulties a student faces solving the problem. Previous studies have explored similar example recommendation approaches. Our research introduces a novel method by utilizing deep learning code representation models to generate code vectors, capturing both syntactic and semantic similarities among programming examples. Driven by the need to provide relevant and personalized examples to programming students, our approach emphasizes similarity assessment and clustering techniques to identify similar code problems, examples, and challenges. This method aims to deliver more accurate and contextually relevant recommendations based on individual learning needs. Providing tailored support to students in real-time facilitates better problem-solving strategies and enhances students' learning experiences, contributing to the advancement of programming education.
more » « less
Free, publicly-accessible full text available February 12, 2026
Using Self-regulated Learning Theory to Inform the Design of Educational Recommender Systems for Introductory Programming

https://doi.org/10.1007/978-3-031-98462-4_35

Barria-Pineda, Jordan; Sonmez_Unal, Deniz; Akhuseyinoglu, Kamil; Brusilovsky, Peter; Walker, Erin (January 2025, Proceedings of 26th International Conference on Artificial Intelligence in Education, AIED 2025, Part 4, Springer Nature Switzerland)

Many educational recommender systems (EdRecSys) rely on commercial recommendation strategies that emphasize content relevance while neglecting learners’ views on recommendation effectiveness. To address this, we conducted a co-design study with computer science students in an introductory programming course to explore their vision of an ideal EdRecSys. The subjects shared preferences and concerns related to three areas: recommendation approaches, transparency, and control. We used Zimmerman’s model of self-regulated learning to contextualize their expectations within a broader educational framework. Findings offer actionable insights for designing learner-centered AIED systems that foster engagement, agency, and self-regulation.
more » « less
Free, publicly-accessible full text available January 1, 2026
Explaining Explainability: Early Performance Prediction with Student Programming Pattern Profiling

Hoq, Muntasir; Brusilovsky, Peter; Akram, Bita (November 2024, Journal of Educational Data Mining)

The ability to predict student performance in introductory programming courses is important to help struggling students and enhance their persistence. However, for this prediction to be impactful, it is crucial that it remains transparent and accessible for both instructors and students, ensuring effective utilization of the predicted results. Machine learning models with explainable features provide an effective means for students and instructors to comprehend students' diverse programming behaviors and problem-solving strategies, elucidating the factors contributing to both successful and suboptimal performance. This study develops an explainable model that predicts student performance based on programming assignment submission information in different stages of the course to enable early explainable predictions. We extract data-driven features from student programming submissions and utilize a stacked ensemble model for predicting final exam grades. The experimental results suggest that our model successfully predicts student performance based on their programming submissions earlier in the semester. Employing SHAP, a game-theory-based framework, we explain the model's predictions, aiding stakeholders in understanding the influence of diverse programming behaviors on students' success. Additionally, we analyze crucial features, employing a mix of descriptive statistics and mixture models to identify distinct student profiles based on their problem-solving patterns, enhancing overall explainability. Furthermore, we dive deeper and analyze the profiles using different programming patterns of the students to elucidate the characteristics of different students where SHAP explanations are not comprehensible. Our explainable early prediction model elucidates common problem-solving patterns in students relative to their expertise, facilitating effective intervention and adaptive support.
more » « less
Free, publicly-accessible full text available November 29, 2025
The Impact of Connecting Worked Examples and Completion Problems for Introductory Programming Practice

Akhuseyinoglu, Kamil; Klašnja-Milicevic, Aleksandra; Brusilovsky, Peter (September 2024, European Conference on Technology Enhanced Learning, EC-TEL 2025)

Worked examples have consistently demonstrated their value in education, serving as the model solutions for solving specific problem types. Past studies indicate that combining worked examples with practice problems is more effective than providing either problems or examples in isolation. Despite these findings, the exploration of the effects of grouping worked examples and problems for programming practice is limited, especially in learning environments designed for practice. This paper compares two content organization approaches in a practice system. The first one is explicitly connecting worked examples and completion problems, allowing students to access them in smaller bundles. The other one is delivering the same set of activities separately but keeping an implicit connection by grouping them under a topic. We examined the effects of these two approaches on student engagement and performance in a semester-long classroom experiment conducted in a CS1 programming course. The results indicate that explicitly connecting worked examples and completion problems increased engagement with the completion problems and supported problem-solving performance by leading to higher success rates and persistence.
more » « less
Full Text Available
Intelligent Technologies for Personalized Practice Systems

Brusilovsky, Peter (June 2024, Information and technology in education and learning)

Personalized practice systems focus on supporting self-organized learning in a free practice mode. Adapting to the learners’ knowledge and goals, these systems help them navigate the increasing volumes of smart learning content, guide them to practice opportunities that are most appropriate to their level of knowledge and increase their motivation to practice. In this paper, we distill the experience generated by 20 years of research on personalized practice systems into a set of AI-based technologies that make these systems efficient, engaging, and user-friendly.
more » « less
Full Text Available

Search for: All records