SelfCode 2.0: An Annotated Corpus of Student and Expert Line-by-Line Explanations of Code Examples for Automated Assessment

Chapagain, Jeevan; Lekshmi, Arun Balajiee; Akhuseyinoglu, Kamil; Brusilovsky, Peter; Rus, Vasile

doi:10.32473/flairs.38.1.138727

Citation Details

SelfCode 2.0: An Annotated Corpus of Student and Expert Line-by-Line Explanations of Code Examples for Automated Assessment

Assessing student responses is a critical task in adaptive educational systems. More specifically, automatically evaluating students' self-explanations contributes to understanding their knowledge state which is needed for personalized instruction, the crux of adaptive educational systems. To facilitate the development of Artificial Intelligence (AI) and Machine Learning models for automated assessment of learners' self-explanations, annotated datasets are essential. In response to this need, we developed the SelfCode2.0 corpus, which consists of 3,019 pairs of student and expert explanations of Java code snippets, each annotated with semantic similarity, correctness, and completeness scores provided by experts. Alongside the dataset, we also provide performance results obtained with several baseline models based on TF-IDF and Sentence-BERT vectorial representations. This work aims to enhance the effectiveness of automated assessment tools in programming education and contribute to a better understanding and supporting student learning of programming. more »

Award ID(s):: 2213789 1918751

PAR ID:: 10613654

Author(s) / Creator(s):: Chapagain, Jeevan; Lekshmi, Arun Balajiee; Akhuseyinoglu, Kamil; Brusilovsky, Peter; Rus, Vasile

Publisher / Repository:: Florida Online Journals

Date Published:: 2025-05-14

Journal Name:: The International FLAIRS Conference Proceedings

Volume:: 38

ISSN:: 2334-0754

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Journal Article:
https://doi.org/10.32473/flairs.38.1.138727

More Like this