Reductive Analysis with Compiler-Guided Large Language Models for Input-Centric Code Optimizations

Wang, Xiangwei (ORCID:0009000136490852); Hui, Xinning (ORCID:0000000170785454); Liao, Chunhua (ORCID:0000000164770547); Shen, Xipeng (ORCID:0000000335998010)

doi:10.1145/3729282

Citation Details

Reductive Analysis with Compiler-Guided Large Language Models for Input-Centric Code Optimizations

Input-centric program optimization aims to optimize code by considering the relations between program inputs and program behaviors. Despite its promise, a long-standing barrier for its adoption is the difficulty of automatically identifying critical features of complex inputs. This paper introduces a novel technique,reductive analysis through compiler-guided Large Language Models (LLMs), to solve the problem through a synergy between compilers and LLMs. It uses a reductive approach to overcome the scalability and other limitations of LLMs in program code analysis. The solution, for the first time, automates the identification of critical input features without heavy instrumentation or profiling, cutting the time needed for input identification by 44× (or 450× for local LLMs), reduced from 9.6 hours to 13 minutes (with remote LLMs) or 77 seconds (with local LLMs) on average, making input characterization possible to be integrated into the workflow of program compilations. Optimizations on those identified input features show similar or even better results than those identified by previous profiling-based methods, leading to optimizations that yield 92.6% accuracy in selecting the appropriate adaptive OpenMP parallelization decisions, and 20–30% performance improvement of serverless computing while reducing resource usage by 50–60%. more »

Award ID(s):: 2417850 2312207

PAR ID:: 10650639

Author(s) / Creator(s):: Wang, Xiangwei; Hui, Xinning; Liao, Chunhua; Shen, Xipeng

Publisher / Repository:: ACM Digital Library

Date Published:: 2025-06-10

Journal Name:: Proceedings of the ACM on Programming Languages

Volume:: 9

Issue:: PLDI

ISSN:: 2475-1421

Page Range / eLocation ID:: 797 to 821

Subject(s) / Keyword(s):: Large Language Models, Program Optimization, Input-Centric Optimization, Seminal Behavior Identification, Predictive Modeling

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Journal Article:
https://doi.org/10.1145/3729282

More Like this