How Beginning Programmers and Code LLMs (Mis)read Each Other

Nguyen, Sydney; Babe, Hannah McLean; Zi, Yangtian; Guha, Arjun; Anderson, Carolyn Jane; Feldman, Molly Q

doi:10.1145/3613904.3642706

Citation Details

How Beginning Programmers and Code LLMs (Mis)read Each Other

Generative AI models, specifically large language models (LLMs), have made strides towards the long-standing goal of text-to-code generation. This progress has invited numerous studies of user interaction. However, less is known about the struggles and strategies of non-experts, for whom each step of the text-to-code problem presents challenges: describing their intent in natural language, evaluating the correctness of generated code, and editing prompts when the generated code is incorrect. This paper presents a large-scale controlled study of how 120 beginning coders across three academic institutions approach writing and editing prompts. A novel experimental design allows us to target specific steps in the text-to-code process and reveals that beginners struggle with writ- ing and editing prompts, even for problems at their skill level and when correctness is automatically determined. Our mixed-methods evaluation provides insight into student processes and perceptions with key implications for non-expert Code LLM use within and outside of education. more »

Award ID(s):: 2326175 2326173 2326174

PAR ID:: 10509619

Author(s) / Creator(s):: Nguyen, Sydney; Babe, Hannah McLean; Zi, Yangtian; Guha, Arjun; Anderson, Carolyn Jane; Feldman, Molly Q

Publisher / Repository:: ACM

Date Published:: 2024-05-11

ISBN:: 9798400703300

Page Range / eLocation ID:: 1 to 26

Format(s):: Medium: X

Location:: Honolulu HI USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3613904.3642706

More Like this