NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DF$^2$: Distribution-Free Decision-Focused Learning

Kong, Lingkai; Mu, Wenhao; Cui, Jiaming; Zhuang, Yuchen; Prakash, B Aditya; Dai, Bo; Zhang, Chao (July 2025, AUAI Press)

Free, publicly-accessible full text available July 21, 2026
Efficient Evolutionary Search Over Chemical Space with Large Language Models

Wang, Haorui; Skreta, Marta; Ser, Cher_Tian; Gao, Wenhao; Kong, Lingkai; Strieth-Kalthoff, Felix; Duan, Chenru; Zhuang, Yuchen; Yu, Yue; Zhu, Yanqiao; et al (April 2025, International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available April 24, 2026
HYDRA: Model Factorization Framework for Black-Box LLM Personalization

Zhuang, Yuchen; Sun, Haotian; Yu, Yue; Qiang, Rushi; Wang, Qifan; Zhang, Chao; Dai, Bo (December 2024, Curran Associates, Inc.)

Personalization has emerged as a critical research area in modern intelligent systems, focusing on mining users' behavioral history and adapting to their preferences for delivering tailored experiences. Despite the remarkable few-shot capabilities exhibited by black-box large language models (LLMs), the inherent opacity of their model parameters presents significant challenges in aligning the generated output with individual expectations. Existing solutions have primarily focused on prompt design to incorporate user-specific profiles and behaviors; however, such approaches often struggle to generalize effectively due to their inability to capture shared knowledge among all users. To address these challenges, we propose HYDRA, a model factorization framework that captures both user-specific behavior patterns from historical data and shared general knowledge among all users to deliver personalized generation. In order to capture user-specific behavior patterns, we first train a reranker to prioritize the most useful information from top-retrieved relevant historical records. By combining the prioritized history with the corresponding query, we train an adapter to align the output with individual user-specific preferences, eliminating the reliance on access to inherent model parameters of black-box LLMs. Both the reranker and the adapter can be decomposed into a base model with multiple user-specific heads, resembling a hydra. The base model maintains shared knowledge across users, while the multiple personal heads capture user-specific preferences. Experimental results demonstrate that \method outperforms existing state-of-the-art prompt-based methods by an average relative improvement of 9.01% across five diverse personalization tasks in the LaMP benchmark.
more » « less
Full Text Available
Aligning Large Language Models with Representation Editing: A Control Perspective

Kong, Lingkai; Wang, Haorui; Mu, Wenhao; Du, Yuanqi; Zhuang, Yuchen; Zhou, Yifei; Song, Yue; Zhang, Rongzhi; Wang, Kai; Zhang, Chao (December 2024, NeurIPS 2024)

Aligning large language models (LLMs) with human objectives is crucial for real-world applications. However, fine-tuning LLMs for alignment often suffers from unstable training and requires substantial computing resources. Test-time alignment techniques, such as prompting and guided decoding, do not modify the underlying model, and their performance remains dependent on the original model's capabilities. To address these challenges, we propose aligning LLMs through representation editing. The core of our method is to view a pre-trained autoregressive LLM as a discrete-time stochastic dynamical system. To achieve alignment for specific objectives, we introduce external control signals into the state space of this language dynamical system. We train a value function directly on the hidden states according to the Bellman equation, enabling gradient-based optimization to obtain the optimal control signals at test time. Our experiments demonstrate that our method outperforms existing test-time alignment techniques while requiring significantly fewer resources compared to fine-tuning methods. Our code is available at https://github.com/Lingkai-Kong/RE-Control.
more » « less
Full Text Available
Two Birds with One Stone: Enhancing Calibration and Interpretability with Graph Functional Neural Process

Kong, Lingkai; Sun, Haotian; Zhuang, Yuchen; Wang, Haorui; Zhang, Chao (May 2024, Proceedings of Machine Learning Research)

Full Text Available
POLYIE: A Dataset of Information Extraction from Polymer Material Scientific Literature

Cheung, Jerry_Junyang; Zhuang, Yuchen; Li, Yinghao; Shetty, Pranav; Zhao, Wantian; Grampurohit, Sanjeev; Ramprasad, Rampi; Zhang, Chao (June 2024, Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL))

Full Text Available
RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records

https://doi.org/10.18653/v1/2024.acl-short.68

Xu, Ran; Shi, Wenqi; Yu, Yue; Zhuang, Yuchen; Jin, Bowen; Wang, May Dongmei; Ho, Joyce; Yang, Carl (January 2024, Association for Computational Linguistics)

Full Text Available
DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling

https://doi.org/10.1145/3580305.3599318

Zhuang, Yuchen; Yu, Yue; Kong, Lingkai; Chen, Xiang; Zhang, Chao (August 2023, ACM)

Full Text Available
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers

https://doi.org/10.18653/v1/2024.emnlp-main.1241

Xu, Ran; Shi, Wenqi; Yu, Yue; Zhuang, Yuchen; Zhu, Yanqiao; Wang, May Dongmei; Ho, Joyce C; Zhang, Chao; Yang, Carl (January 2024, Association for Computational Linguistics)

Full Text Available
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models

https://doi.org/10.18653/v1/2024.findings-acl.916

Xu, Ran; Cui, Hejie; Yu, Yue; Kan, Xuan; Shi, Wenqi; Zhuang, Yuchen; Wang, May Dongmei; Jin, Wei; Ho, Joyce; Yang, Carl (January 2024, Association for Computational Linguistics)

Full Text Available

« Prev Next »

Search for: All records