NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Convergence of Policy Iteration for Entropy-Regularized Stochastic Control Problems

https://doi.org/10.1137/24M1638744

Huang, Yu-Jui; Wang, Zhenhua; Zhou, Zhou (April 2025, SIAM Journal on Control and Optimization)

Free, publicly-accessible full text available April 30, 2026
Relaxed Equilibria for Time-Inconsistent Markov Decision Processes

https://doi.org/10.1287/moor.2023.0209

Bayraktar, Erhan; Huang, Yu-Jui; Wang, Zhenhua; Zhou, Zhou (October 2024, Mathematics of Operations Research)

This paper considers an infinite-horizon Markov decision process (MDP) that allows for general nonexponential discount functions in both discrete and continuous time. Because of the inherent time inconsistency, we look for a randomized equilibrium policy (i.e., relaxed equilibrium) in an intrapersonal game between an agent’s current and future selves. When we modify the MDP by entropy regularization, a relaxed equilibrium is shown to exist by a nontrivial entropy estimate. As the degree of regularization diminishes, the entropy-regularized MDPs approximate the original MDP, which gives the general existence of a relaxed equilibrium in the limit by weak convergence arguments. As opposed to prior studies that consider only deterministic policies, our existence of an equilibrium does not require any convexity (or concavity) of the controlled transition probabilities and reward function. Interestingly, this benefit of considering randomized policies is unique to the time-inconsistent case.
more » « less
Full Text Available
On characterizing optimal Wasserstein GAN solutions for non-Gaussian data

https://doi.org/10.1109/ISIT54713.2023.10206785

Huang, Yu-Jui; Lin, Shih-Chun; Huang, Yu-Chih; Lyu, Kuan-Hui; Shen, Hsin-Hua; Lin, Wan-Yi (June 2023, 2023 IEEE International Symposium on Information Theory (ISIT))

Full Text Available
GANs as Gradient Flows that Converge

Huang, Yu-Jui; Zhang, Yuchong (June 2023, Journal of machine learning research)

Full Text Available
Minimizing the Repayment Cost of Federal Student Loans

https://doi.org/10.1137/22M1505840

Guasoni, Paolo; Huang, Yu-Jui (August 2022, SIAM Review)

Full Text Available

Search for: All records