NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints

Kong, Lingkai; Du, Yuanqi; Mu, Wenhao; Neklyudov, Kirill; De_Bortoli, Valentin; Wu, Dongxia; Wang, Haorui; Ferber, Aaron M; Ma, Yian; Gomes, Carla_P; et al (May 2025, Proceedings of Machine Learning Research)

Free, publicly-accessible full text available May 1, 2026
Graph Generative Pre-trained Transformer

Chen, Xiaohui; Wang, Yinkai; He, Jiaxing; Du, Yuanqi; Hassoun, Soha; Xu, Xiaolin; Liu, Liping (January 2025, arxiv.org)

Full Text Available
Efficient Evolutionary Search Over Chemical Space with Large Language Models

Wang, Haorui; Skreta, Marta; Ser, Cher_Tian; Gao, Wenhao; Kong, Lingkai; Strieth-Kalthoff, Felix; Duan, Chenru; Zhuang, Yuchen; Yu, Yue; Zhu, Yanqiao; et al (April 2025, International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available April 24, 2026
Controllable Data Generation by Deep Learning: A Review

https://doi.org/10.1145/3648609

Wang, Shiyu; Du, Yuanqi; Guo, Xiaojie; Pan, Bo; Qin, Zhaohui; Zhao, Liang (October 2024, ACM Computing Surveys)

Designing and generating new data under targeted properties has been attracting various critical applications such as molecule design, image editing and speech synthesis. Traditional hand-crafted approaches heavily rely on expertise experience and intensive human efforts, yet still suffer from the insufficiency of scientific knowledge and low throughput to support effective and efficient data generation. Recently, the advancement of deep learning has created the opportunity for expressive methods to learn the underlying representation and properties of data. Such capability provides new ways of determining the mutual relationship between the structural patterns and functional properties of the data and leveraging such relationships to generate structural data, given the desired properties. This article is a systematic review that explains this promising research area, commonly known as controllable deep data generation. First, the article raises the potential challenges and provides preliminaries. Then the article formally defines controllable deep data generation, proposes a taxonomy on various techniques and summarizes the evaluation metrics in this specific domain. After that, the article introduces exciting applications of controllable deep data generation, experimentally analyzes and compares existing works. Finally, this article highlights the promising future directions of controllable deep data generation and identifies five potential challenges.
more » « less
Full Text Available
Aligning Large Language Models with Representation Editing: A Control Perspective

Kong, Lingkai; Wang, Haorui; Mu, Wenhao; Du, Yuanqi; Zhuang, Yuchen; Zhou, Yifei; Song, Yue; Zhang, Rongzhi; Wang, Kai; Zhang, Chao (December 2024, NeurIPS 2024)

Aligning large language models (LLMs) with human objectives is crucial for real-world applications. However, fine-tuning LLMs for alignment often suffers from unstable training and requires substantial computing resources. Test-time alignment techniques, such as prompting and guided decoding, do not modify the underlying model, and their performance remains dependent on the original model's capabilities. To address these challenges, we propose aligning LLMs through representation editing. The core of our method is to view a pre-trained autoregressive LLM as a discrete-time stochastic dynamical system. To achieve alignment for specific objectives, we introduce external control signals into the state space of this language dynamical system. We train a value function directly on the hidden states according to the Bellman equation, enabling gradient-based optimization to obtain the optimal control signals at test time. Our experiments demonstrate that our method outperforms existing test-time alignment techniques while requiring significantly fewer resources compared to fine-tuning methods. Our code is available at https://github.com/Lingkai-Kong/RE-Control.
more » « less
Full Text Available
Accurate transition state generation with an object-aware equivariant elementary reaction diffusion model

https://doi.org/10.1038/s43588-023-00563-7

Duan, Chenru; Du, Yuanqi; Jia, Haojun; Kulik, Heather J. (December 2023, Nature Computational Science)

Full Text Available
On Separate Normalization in Self-supervised Transformers

Chen, Xiaohui; Wang, Yinkai; Du, Yuanqi; Hassoun, Soha; Liu, Li-Ping (December 2023, Advances in Neural Information Processing Systems 36)

Self-supervised training methods for transformers have demonstrated remarkable performance across various domains. Previous transformer-based models, such as masked autoencoders (MAE), typically utilize a single normalization layer for both the [CLS] symbol and the tokens. We propose in this paper a simple modification that employs separate normalization layers for the tokens and the [CLS] symbol to better capture their distinct characteristics and enhance downstream task performance. Our method aims to alleviate the potential negative effects of using the same normalization statistics for both token types, which may not be optimally aligned with their individual roles. We empirically show that by utilizing a separate normalization layer, the [CLS] embeddings can better encode the global contextual information and are distributed more uniformly in its anisotropic space. When replacing the conventional normalization layer with the two separate layers, we observe an average 2.7% performance improvement over the image, natural language, and graph domains.
more » « less
Interpretable Molecular Graph Generation via Monotonic Constraints

https://doi.org/10.1137/1.9781611977172.9

Du, Yuanqi; Guo, Xiaojie; Shehu, Amarda; Zhao, Liang (July 2022, Proceedings of the 2022 SIAM International Conference on Data Mining (SDM))

Full Text Available
Disentangled Spatiotemporal Graph Generative Models

https://doi.org/10.1609/aaai.v36i6.20607

Du, Yuanqi; Guo, Xiaojie; Cao, Hengning; Ye, Yanfang; Zhao, Liang (July 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

Full Text Available
Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

https://doi.org/10.1561/2200000115

Zhang, Xuan; Wang, Limei; Helwig, Jacob; Luo, Youzhi; Fu, Cong; Xie, Yaochen; Liu, Meng; Lin, Yuchao; Xu, Zhao; Yan, Keqiang; et al (January 2025, Foundations and Trends® in Machine Learning)

Full Text Available

« Prev Next »

Search for: All records