NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Discovering Mixtures of Structural Causal Models from Time Series Data

Varambally, Sumanth; Ma, Yi-An; Yu, Rose (July 2024, Proceedings of Machine Learning Research)

Full Text Available
Demystifying SGD with Doubly Stochastic Gradients

Kim, Kyurae; Ko, Joohwan; Ma, Yi-An; Gardner, Jacob R (July 2024, International Conference on Machine Learning (ICML 2024))

Full Text Available
Linear Convergence of Black-Box Variational Inference: Should We Stick the Landing?

Kim, Kyurae; Ma, Yi-An; Gardner, Jacob R (May 2024, Conference on Artificial Intelligence and Statistics (AISTATS 2024))

We prove that black-box variational infer- ence (BBVI) with control variates, particularly the sticking-the-landing (STL) estima- tor, converges at a geometric (traditionally called “linear”) rate under perfect variational family specification. In particular, we prove a quadratic bound on the gradient variance of the STL estimator, one which encompasses misspecified variational families. Combined with previous works on the quadratic variance condition, this directly implies convergence of BBVI with the use of projected stochastic gradient descent. For the projection operator, we consider a domain with triangular scale matrices, which the pro jection onto is computable in O(𝑑) time, where 𝑑 is the dimensionality of the target posterior. We also improve existing analysis on the reg- ular closed-form entropy gradient estimators, which enables comparison against the STL estimator, providing explicit non-asymptotic complexity guarantees for both.
more » « less
Full Text Available
Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo

Huang, Xunpeng; Zou, Difan; Dong, Hanze; Ma, Yi-An; Zhang, Tong (June 2024, Proceedings of Thirty Seventh Conference on Learning Theory)

Full Text Available
Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo

Huang, Xunpeng; Zou, Difan; Dong, Hanze; Ma, Yi-An; Zhang, Tong (June 2024, Proceedings of Thirty Seventh Conference on Learning Theory)

Full Text Available
Estimate exponential memory decay in hidden Markov model and its applications to inference

https://doi.org/10.1016/j.physd.2024.134053

Ye, Felix X-F; Ma, Yi-an; Qian, Hong (April 2024, Physica D: Nonlinear Phenomena)

Full Text Available
Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Lin, Yingyu; Ma, Yi-An; Wang, Yu-Xiang; Redberg, Rachel; Bu, Zhiqi (May 2024, ICLR 2024)

Posterior sampling, i.e., exponential mechanism to sample from the posterior distribution, provides ε-pure differential privacy (DP) guarantees and does not suffer from potentially unbounded privacy breach introduced by (ε,δ)-approximate DP. In practice, however, one needs to apply approximate sampling methods such as Markov chain Monte Carlo (MCMC), thus re-introducing the unappealing δ-approximation error into the privacy guarantees. To bridge this gap, we propose the Approximate SAample Perturbation (abbr. ASAP) algorithm which perturbs an MCMC sample with noise proportional to its Wasserstein-infinity (W∞) distance from a reference distribution that satisfies pure DP or pure Gaussian DP (i.e., δ=0). We then leverage a Metropolis-Hastings algorithm to generate the sample and prove that the algorithm converges in W∞ distance. We show that by combining our new techniques with a localization step, we obtain the first nearly linear-time algorithm that achieves the optimal rates in the DP-ERM problem with strongly convex and smooth losses.
more » « less
Full Text Available
On the Convergence of Black-Box Variational Inference

Kim, Kyurae; Oh, Jisu; Wu, Kaiwen; Ma, Yi-An; Gardner, Jacob R (December 2023, Neural Information Processing Systems (NeurIPS 2023))

Full Text Available
Posterior sampling with delayed feedback for reinforcement learning with linear function approximation

Kuang, Nikki Lijing; Yin, Ming; Wang, Mengdi; Wang, Yu-Xiang; Ma, Yi-An (December 2023, Proceedings of Machine Learning Research)

Recent studies in reinforcement learning (RL) have made significant progress by leveraging function approximation to alleviate the sample complexity hurdle for better performance. Despite the success, existing provably efficient algorithms typically rely on the accessibility of immediate feedback upon taking actions. The failure to account for the impact of delay in observations can significantly degrade the performance of real-world systems due to the regret blow-up. In this work, we tackle the challenge of delayed feedback in RL with linear function approximation by employing posterior sampling, which has been shown to empirically outperform the popular UCB algorithms in a wide range of regimes. We first introduce Delayed-PSVI, an optimistic value-based algorithm that effectively explores the value function space via noise perturbation with posterior sampling. We provide the first analysis for posterior sampling algorithms with delayed feedback in RL and show our algorithm achieves $$\widetilde{O}(\sqrt{d^3H^3 T} + d^2H^2 E[\tau])$$ worst-case regret in the presence of unknown stochastic delays. Here $$E[\tau]$$ is the expected delay. To further improve its computational efficiency and to expand its applicability in high-dimensional RL problems, we incorporate a gradient-based approximate sampling scheme via Langevin dynamics for Delayed-LPSVI, which maintains the same order-optimal regret guarantee with $$\widetilde{O}(dHK)$$ computational cost. Empirical evaluations are performed to demonstrate the statistical and computational efficacy of our algorithms.
more » « less
Full Text Available
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation

Kuang, Nikki; Yin, Ming; Wang, Mengdi; Wang, Yu-Xiang; Ma, Yi-An (November 2023, 37th Conference on Neural Information Processing Systems (NeurIPS 2023))

Full Text Available

« Prev Next »

Search for: All records