A Tale of Two Comprehensions? Analyzing Student Programmer Attention during Code Summarization

Karas, Zachary; Bansal, Aakash; Zhang, Yifan; Li, Toby; McMillan, Collin; Huang, Yu

doi:10.1145/3664808

Citation Details

A Tale of Two Comprehensions? Analyzing Student Programmer Attention during Code Summarization

Code summarization is the task of creating short, natural language descriptions of source code. It is an important part of code comprehension and a powerful method of documentation. Previous work has made progress in identifying where programmers focus in code as they write their own summaries (i.e., Writing). However, there is currently a gap in studying programmers’ attention as they read code with pre-written summaries (i.e., Reading). As a result, it is currently unknown how these two forms of code comprehension compare: Reading and Writing. Also, there is a limited understanding of programmer attention with respect to program semantics. We address these shortcomings with a human eye-tracking study (n= 27) comparing Reading and Writing. We examined programmers’ attention with respect to fine-grained program semantics, including their attention sequences (i.e., scan paths). We find distinctions in programmer attention across the comprehension tasks, similarities in reading patterns between them, and differences mediated by demographic factors. This can help guide code comprehension in both computer science education and automated code summarization. Furthermore, we mapped programmers’ gaze data onto the Abstract Syntax Tree to explore another representation of human attention. We find that visual behavior on this structure is not always consistent with that on source code. more »

Award ID(s):: 2211428 2211429

PAR ID:: 10576277

Author(s) / Creator(s):: Karas, Zachary; Bansal, Aakash; Zhang, Yifan; Li, Toby; McMillan, Collin; Huang, Yu

Publisher / Repository:: ACM

Date Published:: 2024-09-30

Journal Name:: ACM Transactions on Software Engineering and Methodology

Volume:: 33

Issue:: 7

ISSN:: 1049-331X

Page Range / eLocation ID:: 1 to 37

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3664808

More Like this