Evolutionary Large Language Model for Automated Feature Transformation

Gong, Nanxu; Reddy, Chandan K; Ying, Wangyang; Chen, Haifeng; Fu, Yanjie

doi:10.1609/aaai.v39i16.33851

Citation Details

This content will become publicly available on April 11, 2026

Evolutionary Large Language Model for Automated Feature Transformation

Feature transformation aims to reconstruct the feature space of raw features to enhance the performance of downstream models. However, the exponential growth in the combinations of features and operations poses a challenge, making it difficult for existing methods to efficiently explore a wide space. Additionally, their optimization is solely driven by the accuracy of downstream models in specific domains, neglecting the acquisition of general feature knowledge. To fill this research gap, we propose an evolutionary LLM framework for automated feature transformation. This framework consists of two parts: 1) constructing a multi-population database through an RL data collector while utilizing evolutionary algorithm strategies for database maintenance, and 2) utilizing the ability of Large Language Model (LLM) in sequence understanding, we employ few-shot prompts to guide LLM in generating superior samples based on feature transformation sequence distinction. Leveraging the multi-population database initially provides a wide search scope to discover excellent populations. Through culling and evolution, high-quality populations are given greater opportunities, thereby furthering the pursuit of optimal individuals. By integrating LLMs with evolutionary algorithms, we achieve efficient exploration within a vast space, while harnessing feature knowledge to propel optimization, thus realizing a more adaptable search paradigm. Finally, we empirically demonstrate the effectiveness and generality of our proposed method. more »

Award ID(s):: 2416728

PAR ID:: 10621217

Author(s) / Creator(s):: Gong, Nanxu; Reddy, Chandan K; Ying, Wangyang; Chen, Haifeng; Fu, Yanjie

Publisher / Repository:: AAAI

Date Published:: 2025-04-11

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 39

Issue:: 16

ISSN:: 2159-5399

Page Range / eLocation ID:: 16844 to 16852

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 11, 2026
Journal Article:
https://doi.org/10.1609/aaai.v39i16.33851

More Like this