NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

Mao Ye, Bo Liu (September 2022, arXivorg)

Full Text Available
Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems

Mao Ye, Ruichen Jiang (April 2022, Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022))

Full Text Available
Pareto navigation gradient descent: a first-order algorithm for optimization in pareto set

Mao Ye, Qiang Liu (January 2022, Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence)

Full Text Available
Argmax Centroids: with Applications to Multi-domain Learning

Chengyue Gong, Mao Ye (December 2021, NeurIPS 2021)

Full Text Available
First Hitting Diffusion Models for Generating Manifold, Graph and Categorical Data

Mao Ye; Lemeng Wu; Qiang Liu (January 2022, Advances in neural information processing systems)

We propose a family of First Hitting Diffusion Models (FHDM), deep generative models that generate data with a diffusion process that terminates at a random first hitting time. This yields an extension of the standard fixed-time diffusion models that terminate at a pre-specified deterministic time. Although standard diffusion models are designed for continuous unconstrained data, FHDM is natu- rally designed to learn distributions on continuous as well as a range of discrete and structure domains. Moreover, FHDM enables instance-dependent terminate time and accelerates the diffusion process to sample higher quality data with fewer diffusion steps. Technically, we train FHDM by maximum likelihood estimation on diffusion trajectories augmented from observed data with conditional first hitting processes (i.e., bridge) derived based on Doob’s h-transform, deviating from the commonly used time-reversal mechanism. We apply FHDM to generate data in various domains such as point cloud (general continuous distribution), climate and geographical events on earth (continuous distribution on the sphere), unweighted graphs (distribution of binary matrices), and segmentation maps of 2D images (high-dimensional categorical distribution). We observe considerable improvement compared with the state-of-the-art approaches in both quality and speed.
more » « less
Full Text Available
BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

Bo Liu; Mao Ye; Stephen Wright; Peter Stone; Qiang Liu (January 2022, Advances in neural information processing systems)

Bilevel optimization (BO) is useful for solving a variety of important machine learning problems including but not limited to hyperparameter optimization, meta- learning, continual learning, and reinforcement learning. Conventional BO methods need to differentiate through the low-level optimization process with implicit dif- ferentiation, which requires expensive calculations related to the Hessian matrix. There has been a recent quest for first-order methods for BO, but the methods pro- posed to date tend to be complicated and impractical for large-scale deep learning applications. In this work, we propose a simple first-order BO algorithm that de- pends only on first-order gradient information, requires no implicit differentiation, and is practical and efficient for large-scale non-convex functions in deep learning. We provide a non-asymptotic convergence analysis of the proposed method to stationary points for non-convex objectives and present empirical results that show its superior practical performance.
more » « less
Full Text Available
Knowing" When" and" Where": Temporal-ASTNN for Student Learning Progression in Novice Programming Tasks.

Mao, Ye; Shi, Y.; Marwan, S.; Price, T. W.; Barnes, T.; Chi, M. (January 2021, International Educational Data Mining Society.)

Full Text Available
Knowing" When" and" Where": Temporal-ASTNN for Student Learning Progression in Novice Programming Tasks.

Mao, Ye; Shi, Y.; Marwan, S.; Price, T. W.; Barnes, T.; Chi, M. (January 2021, International Educational Data Mining Society.)

Full Text Available

Search for: All records