NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MADGEN: Mass-Spec attends to De Novo Molecular generation

Wang, Yinkai; Chen, Xiaohui; Liu, Liping; Hassoun, Soha (April 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 28, 2026
Graph Generative Pre-trained Transformer

Chen, Xiaohui; Wang, Yinkai; He, Jiaxing; Du, Yuanqi; Hassoun, Soha; Xu, Xiaolin; Liu, Liping (January 2025, arxiv.org)

Full Text Available
MassSpecGym: A benchmark for the discovery and identification of molecules

Bushuiev, Roman; Bushuiev, Anton; Jonge, Niek F; Young, Adamo; Kretschmer, Fleming; Samusevich, Raman; Heirman, Janne; Wang, Fei; Zhang, Luke; Dührkop, Kai; et al (December 2024, The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track)

Full Text Available
Incorporating Inductive Biases to Energy-based Generative Models

Li, Yukun; Liu, Li-Ping (October 2024, Transactions on machine learning research)
Larochelle, Hugo; Murray, Naila; Kamath, Gautam; Shah, Nihar B (Ed.)
Full Text Available
Graph Pruning for Enumeration of Minimal Unsatisfiable Subsets

Lymperopoulos, Panagiotis; Liu, Liping (May 2024, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics)
Dasgupta, Sanjoy; Mandt, Stephan; Li, Yingzhen (Ed.)
Full Text Available
Unifying Predictions of Deterministic and Stochastic Physics in Mesh-reduced Space with Sequential Flow Generative Model

Sun, Luning; Han, Xu; Gao, Han; Wang, Jian-Xun; Liu, Li-Ping (December 2023, Advances in Neural Information Processing Systems 36)

Accurate prediction of dynamical systems in unstructured meshes has recently shown successes in scientific simulations. Many dynamical systems have a nonnegligible level of stochasticity introduced by various factors (e.g. chaoticity), so there is a need for a unified framework that captures both deterministic and stochastic components in the rollouts of these systems. Inspired by regeneration learning, we propose a new model that combines generative and sequential networks to model dynamical systems. Specifically, we use an autoencoder to learn compact representations of full-space physical variables in a low-dimensional space. We then integrate a transformer with a conditional normalizing flow model to model the temporal sequence of latent representations. We evaluate the new model in both deterministic and stochastic systems. The model outperforms several competitive baseline models and makes more accurate predictions of deterministic systems. Its own prediction error is also reflected in its uncertainty estimations. When predicting stochastic systems, the proposed model generates high-quality rollout samples. The mean and variance of these samples well match the statistics of samples computed from expensive numerical simulations.
more » « less
On Separate Normalization in Self-supervised Transformers

Chen, Xiaohui; Wang, Yinkai; Du, Yuanqi; Hassoun, Soha; Liu, Li-Ping (December 2023, Advances in Neural Information Processing Systems 36)

Self-supervised training methods for transformers have demonstrated remarkable performance across various domains. Previous transformer-based models, such as masked autoencoders (MAE), typically utilize a single normalization layer for both the [CLS] symbol and the tokens. We propose in this paper a simple modification that employs separate normalization layers for the tokens and the [CLS] symbol to better capture their distinct characteristics and enhance downstream task performance. Our method aims to alleviate the potential negative effects of using the same normalization statistics for both token types, which may not be optimally aligned with their individual roles. We empirically show that by utilizing a separate normalization layer, the [CLS] embeddings can better encode the global contextual information and are distributed more uniformly in its anisotropic space. When replacing the conventional normalization layer with the two separate layers, we observe an average 2.7% performance improvement over the image, natural language, and graph domains.
more » « less
Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling

Chen, Xiaohui; He, Jiaxing; Han, Xu; Liu, Li-Ping (July 2023, Proceedings of Machine Learning Research)

Diffusion-based graph generative models are effective in generating high-quality small graphs. However, it is hard to scale them to large graphs that contain thousands of nodes. In this work, we propose EDGE, a new diffusion-based graph generative model that addresses generative tasks for large graphs. The model is developed by reversing a discrete diffusion process that randomly removes edges until obtaining an empty graph. It leverages graph sparsity in the diffusion process to improve computational efficiency. In particular, EDGE only focuses on a small portion of graph nodes and only adds edges between these nodes. Without compromising modeling ability, it makes much fewer edge predictions than previous diffusion-based generative models. Furthermore, EDGE can explicitly model the node degrees of training graphs and then gain performance improvement in capturing graph statistics. The empirical study shows that EDGE is much more efficient than competing methods and can generate large graphs with thousands of nodes. It also outperforms baseline models in generation quality: graphs generated by the proposed model have graph statistics more similar to those of training graphs.
more » « less
Full Text Available
Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling

Chen, Xiaohui; He, Jiaxing; Han, Xu; Liu, Li-Ping (July 2023, International Conference on Machine Learning)

Diffusion-based graph generative models are effective in generating high-quality small graphs. However, it is hard to scale them to large graphs that contain thousands of nodes. In this work, we propose EDGE, a new diffusion-based graph generative model that addresses generative tasks for large graphs. The model is developed by reversing a discrete diffusion process that randomly removes edges until obtaining an empty graph. It leverages graph sparsity in the diffusion process to improve computational efficiency. In particular, EDGE only focuses on a small portion of graph nodes and only adds edges between these nodes. Without compromising modeling ability, it makes much fewer edge predictions than previous diffusion-based generative models. Furthermore, EDGE can explicitly model the node degrees of training graphs and then gain performance improvement in capturing graph statistics. The empirical study shows that EDGE is much more efficient than competing methods and can generate large graphs with thousands of nodes. It also outperforms baseline models in generation quality: graphs generated by the proposed model have graph statistics more similar to those of training graphs.
more » « less
Full Text Available
Fitting Autoregressive Graph Generative Models through Maximum Likelihood Estimation

Han, Xu; Chen, Xiaohui; Ruiz, Francisco J.; Liu, Li-Ping (March 2023, Journal of machine learning research)
Zhou, Mingyuan (Ed.)
We consider the problem of fitting autoregressive graph generative models via maximum likelihood estimation (MLE). MLE is intractable for graph autoregressive models because the nodes in a graph can be arbitrarily reordered; thus the exact likelihood involves a sum over all possible node orders leading to the same graph. In this work, we fit the graph models by maximizing a variational bound, which is built by first deriving the joint probability over the graph and the node order of the autoregressive process. This approach avoids the need to specify ad-hoc node orders, since an inference network learns the most likely node sequences that have generated a given graph. We improve the approach by developing a graph generative model based on attention mechanisms and an inference network based on routing search. We demonstrate empirically that fitting autoregressive graph models via variational inference improves their qualitative and quantitative performance, and the improved model and inference network further boost the performance. The implementation of the proposed model is publicly available at https://github.com/tufts-ml/Graph-Generation-MLE.
more » « less
Full Text Available

Search for: All records