NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

Bukharin, Alexander; Li, Yan; Yu, Yue; Zhang, Qingru; Chen, Zhehui; Zuo, Simiao; Zhang, Chao; Zhang, Songan; Zhao, Tuo. (December 2023, Conference on Neural Information Processing Systems)

Full Text Available
A hierarchical expected improvement method for Bayesian optimization

https://doi.org/10.1080/01621459.2023.2210803

Chen, Zhehui; Mak, Simon; Wu, C. F. (June 2023, Journal of the American Statistical Association)

Full Text Available
A Diffusion Approximation Theory of Momentum Stochastic Gradient Descent in Nonconvex Optimization

https://doi.org/10.1287/stsy.2021.0083

Liu, Tianyi; Chen, Zhehui; Zhou, Enlu; Zhao, Tuo (December 2021, Stochastic Systems)

Momentum stochastic gradient descent (MSGD) algorithm has been widely applied to many nonconvex optimization problems in machine learning (e.g., training deep neural networks, variational Bayesian inference, etc.). Despite its empirical success, there is still a lack of theoretical understanding of convergence properties of MSGD. To fill this gap, we propose to analyze the algorithmic behavior of MSGD by diffusion approximations for nonconvex optimization problems with strict saddle points and isolated local optima. Our study shows that the momentum helps escape from saddle points but hurts the convergence within the neighborhood of optima (if without the step size annealing or momentum annealing). Our theoretical discovery partially corroborates the empirical success of MSGD in training deep neural networks.
more » « less
Full Text Available
Learning to Defend by Learning to Attack

Jiang, Haoming; Chen, Zhehui; Shi, Yuyang; Dai, Bo; Zhao, Tuo. (April 2021, International Conference on Artificial Intelligence and Statistics)

Full Text Available
On Constrained Nonconvex Stochastic Optimization: A Case Study for Generalized Eigenvalue Decomposition

Chen, Zhehui; Li, Xingguo; Yang, Lin; Haupt, Jarvis; Zhao, Tuo (April 2019, International Conference on Artificial Intelligence and Statistics)

We study constrained nonconvex optimization problems in machine learning and signal processing. It is well-known that these problems can be rewritten to a min-max problem in a Lagrangian form. However, due to the lack of convexity, their landscape is not well understood and how to find the stable equilibria of the Lagrangian function is still unknown. To bridge the gap, we study the landscape of the Lagrangian function. Further, we define a special class of Lagrangian functions. They enjoy the following two properties: 1. Equilibria are either stable or unstable (Formal definition in Section 2); 2.Stable equilibria correspond to the global optima of the original problem. We show that a generalized eigenvalue (GEV) problem, including canonical correlation analysis and other problems as special examples, belongs to the class. Specifically, we characterize its stable and unstable equilibria by leveraging an invariant group and symmetric property (more details in Section 3). Motivated by these neat geometric structures, we propose a simple, efficient, and stochastic primal-dual algorithm solving the online GEV problem. Theoretically, under sufficient conditions, we establish an asymptotic rate of convergence and obtain the first sample complexity result for the online GEV problem by diffusion approximations, which are widely used in applied probability. Numerical results are also provided to support our theory.
more » « less
Full Text Available
On Computation and Generalization of Generative Adversarial Networks under Spectrum Control

Jiang, Haoming; Chen, Zhehui; Chen, Minshuo; Liu, Feng; Wang, Dingding; Zhao, Tuo (May 2019, International Conference on Learning Representations)

Generative Adversarial Networks (GANs), though powerful, is hard to train. Sev- eral recent works (Brock et al., 2016; Miyato et al., 2018) suggest that controlling the spectra of weight matrices in the discriminator can significantly improve the training of GANs. Motivated by their discovery, we propose a new framework for training GANs, which allows more flexible spectrum control (e.g., making the weight matrices of the discriminator have slow singular value decays). Specifically, we propose a new reparameterization approach for the weight matrices of the discriminator in GANs, which allows us to directly manipulate the spectra of the weight matrices through various regularizers and constraints, without intensively computing singular value decompositions. Theoretically, we further show that the spectrum control improves the generalization ability of GANs. Our experiments on CIFAR-10, STL-10, and ImgaeNet datasets confirm that compared to other methods, our proposed method is capable of generating images with competitive quality by utilizing spectral normalization and encouraging the slow singular value decay.
more » « less
Full Text Available

Search for: All records