NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Systemic Robustness: A Mean‐Field Particle System Approach

https://doi.org/10.1111/mafi.12459

Bayraktar, Erhan; Guo, Gaoyue; Tang, Wenpin; Zhang, Yuming Paul (February 2025, Mathematical Finance)

ABSTRACT This paper is concerned with the problem of capital provision in a large particle system modeled by stochastic differential equations involving hitting times, which arises from considerations of systemic risk in a financial network. Motivated by Tang and Tsai, we focus on the number or proportion of surviving entities that never default to measure the systemic robustness. First we show that the mean‐field particle system and its limit McKean–Vlasov equation are both well‐posed by virtue of the notion of minimal solutions. We then establish a connection between the proportion of surviving entities in the large particle system and the probability of default in the McKean–Vlasov equation as the size of the interacting particle system tends to infinity. Finally, we study the asymptotic efficiency of capital provision for different drift , which is linked to the economy regime: The expected number of surviving entities has a uniform upper bound if ; it is of order if ; and it is of order if , where the effect of capital provision is negligible.
more » « less
Free, publicly-accessible full text available February 5, 2026
Trading under the proof‐of‐stake protocol – A continuous‐time control approach

https://doi.org/10.1111/mafi.12403

Tang, Wenpin; Yao, David D. (May 2023, Mathematical Finance)

Abstract We develop a continuous‐time control approach to optimal trading in a Proof‐of‐Stake (PoS) blockchain, formulated as a consumption‐investment problem that aims to strike the optimal balance between a participant's (or agent's) utility from holding/trading stakes and utility from consumption. We present solutions via dynamic programming and the Hamilton–Jacobi–Bellman (HJB) equations. When the utility functions are linear or convex, we derive close‐form solutions and show that the bang‐bang strategy is optimal (i.e., always buy or sell at full capacity). Furthermore, we bring out the explicit connection between the rate of return in trading/holding stakes and the participant's risk‐adjusted valuation of the stakes. In particular, we show when a participant is risk‐neutral or risk‐seeking, corresponding to the risk‐adjusted valuation being a martingale or a sub‐martingale, the optimal strategy must be to either buy all the time, sell all the time, or first buy then sell, and with both buying and selling executed at full capacity. We also propose a risk‐control version of the consumption‐investment problem; and for a special case, the “stake‐parity” problem, we show a mean‐reverting strategy is optimal.
more » « less
The Convergence Rate of Vanishing Viscosity Approximations for Mean Field Games

https://doi.org/10.1137/24M1640008

Tang, Wenpin; Zhang, Yuming Paul (June 2025, SIAM Journal on Mathematical Analysis)

Free, publicly-accessible full text available June 30, 2026
Policy Iteration for the Deterministic Control Problems—A Viscosity Approach

https://doi.org/10.1137/24M1631602

Tang, Wenpin; Tran, Hung Vinh; Zhang, Yuming Paul (February 2025, SIAM Journal on Control and Optimization)

Free, publicly-accessible full text available February 28, 2026
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization

Zhao, Hanyang; Winata, Genta Indra; Das, Anirban; Zhang, Shi-Xiong; Yao, David; Tang, Wenpin; Sahu, Sambit (January 2025, ICLR Proceedings)

Free, publicly-accessible full text available January 22, 2026
MallowsPO: Fine-Tune Your LLM with Preference Dispersions

Chen, Haoxian; Zhao, Hanyang; Lam, Henry; Yao, David Yao; Tang, Wenpin (January 2025, ICLR Proceedings)

Free, publicly-accessible full text available January 22, 2026
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

https://doi.org/10.1613/jair.1.17541

Winata, Genta Indra; Zhao, Hanyang; Das, Anirban; Tang, Wenpin; Yao, David D; Zhang, Shi-Xiong; Sahu, Sambit (January 2025, Journal of Artificial Intelligence Research)

Preference tuning is a crucial process for aligning deep generative models with human preferences. This survey offers a thorough overview of recent advancements in preference tuning and the integration of human feedback. The paper is organized into three main sections: 1) introduction and preliminaries: an introduction to reinforcement learning frameworks, preference tuning tasks, models, and datasets across various modalities: language, speech, and vision, as well as different policy approaches, 2) in-depth exploration of each preference tuning approach: a detailed analysis of the methods used in preference tuning, and 3) applications, discussion, and future directions: an exploration of the applications of preference tuning in downstream tasks, including evaluation methods for different modalities, and an outlook on future research directions. Our objective is to present the latest methodologies in preference tuning and model alignment, enhancing the understanding of this field for researchers and practitioners. We hope to encourage further engagement and innovation in this area. Additionally, we provide a GitHub link https://github.com/hanyang1999/Preference-Tuning-with-Human-Feedback.
more » « less
Free, publicly-accessible full text available January 6, 2026
Trading and wealth evolution in the Proof of Stake protocol

https://doi.org/10.1049/PBSE024E_ch7

Tang, Wenpin (April 2024, Proof-of-Stake for Blockchain Networks: Fundamentals, Challenges and Approaches)

Full Text Available
Escaping saddle points efficiently with occupation-time-adapted perturbations

https://doi.org/10.1016/j.jcmds.2024.100090

Guo, Xin; Han, Jiequn; Tajrobehkar, Mahan; Tang, Wenpin (March 2024, Journal of Computational Mathematics and Data Science)

Full Text Available
Policy optimization for continuous reinforcement learning

Zhao, Hanyang; Tang, Wenpin; Yao, David D (February 2024, Advances in neural information processing systems)

Full Text Available

« Prev Next »

Search for: All records