Generating Effective Distractors for Introductory Programming Challenges: LLMs vs Humans

Hassany, Mohammad; Brusilovsky, Peter; Savelka, Jaromir; Lekshmi_Narayanan, Arun Balajiee; Akhuseyinoglu, Kamil; Agarwal, Arav; Hendrawan, Rully Agus

doi:10.1145/3706468.3706529

Citation Details

This content will become publicly available on March 3, 2026

Generating Effective Distractors for Introductory Programming Challenges: LLMs vs Humans

As large language models (LLMs) show great promise in generating a wide spectrum of educational materials, robust yet cost-effective assessment of the quality and effectiveness of such materials becomes an important challenge. Traditional approaches, including expert-based quality assessment and student-centered evaluation, are resource-consuming, and do not scale efficiently. In this work, we explored the use of pre-existing student learning data as a promising approach to evaluate LLM-generated learning materials. Specifically, we used a dataset where students were completing the program construction challenges by picking the correct answers among human-authored distractors to evaluate the quality of LLM-generated distractors for the same challenges. The dataset included responses from 1,071 students across 22 classes taught from Fall 2017 to Spring 2023. We evaluated five prominent LLMs (OpenAI-o1, GPT-4, GPT-4o, GPT-4o-mini, and Llama-3.1-8b) across three different prompts to see which combinations result in more effective distractors, i.e., those that are plausible (often picked by students), and potentially based on common misconceptions. Our results suggest that GPT-4o was the most effective model, matching close to 50% of the functional distractors originally authored by humans. At the same time, all of the evaluated LLMs generated many novel distractors, i.e., those that did not match the pre-existing human-authored ones. Our preliminary analysis shows that those appear to be promising. Establishing their effectiveness in real-world classroom settings is left for future work. more »

Award ID(s):: 2213789

PAR ID:: 10613695

Author(s) / Creator(s):: Hassany, Mohammad; Brusilovsky, Peter; Savelka, Jaromir; Lekshmi_Narayanan, Arun Balajiee; Akhuseyinoglu, Kamil; Agarwal, Arav; Hendrawan, Rully Agus

Publisher / Repository:: ACM

Date Published:: 2025-03-03

ISBN:: 9798400707018

Page Range / eLocation ID:: 484 to 493

Format(s):: Medium: X

Location:: Dublin Ireland

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 3, 2026
Conference Paper:
https://doi.org/10.1145/3706468.3706529

More Like this