A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization

Chu, Yucheng; Li, Hang; Yang, Kaiqi; Shomer, Harry; Copur-Gencturk, Yasemin; Kaldaras, Leonora; Haudek, Kevin; Krajcik, Joseph; Shin, Namsoo; Liu, Hui; Tang, Jiliang

doi:10.5281/zenodo.15870201

Citation Details

This content will become publicly available on July 12, 2026

A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization

Open-text responses provide researchers and educators with rich, nuanced insights that multiple-choice questions cannot capture. When reliably assessed, such responses have the potential to enhance teaching and learning. However, scaling and consistently capturing these nuances remain significant challenges, limiting the widespread use of open-text questions in educational research and assessments. In this paper, we introduce and evaluate GradeOpt, a unified multiagent automatic short-answer grading (ASAG) framework that leverages large language models (LLMs) as graders for short-answer responses. More importantly, GradeOpt incorporates two additional LLM-based agents—the reflector and the refiner—into the multi-agent system. This enables GradeOpt to automatically optimize the original grading guidelines by performing self-reflection on its errors. To assess GradeOpt's effectiveness, we conducted experiments on two representative ASAG datasets, which include items designed to capture key aspects of teachers' pedagogical knowledge and students' learning progress. Our results demonstrate that GradeOpt consistently outperforms representative baselines in both grading accuracy and alignment with human evaluators across different knowledge domains. Finally, comprehensive ablation studies validate the contributions of GradeOpt's individual components, confirming their impact on overall performance. more »

Award ID(s):: 2200757

PAR ID:: 10621106

Author(s) / Creator(s):: Chu, Yucheng; Li, Hang; Yang, Kaiqi; Shomer, Harry; Copur-Gencturk, Yasemin; Kaldaras, Leonora; Haudek, Kevin; Krajcik, Joseph; Shin, Namsoo; Liu, Hui; Tang, Jiliang

Editor(s):: Mills, Caitlin; Alexandron, Giora; Taibi, Davide; Lo_Bosco, Giosuè; Paquette, Luc

Publisher / Repository:: International Educational Data Mining Society

Date Published:: 2025-07-12

ISBN:: 978-1-7336736-6-2

Format(s):: Medium: X

Right(s):: Creative Commons Attribution 4.0 International

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 12, 2026
Conference Paper:
https://doi.org/10.5281/zenodo.15870201

More Like this