The objective of this paper is to study the optimal consumption and portfolio choice problem of risk-controlled investors who strive to maximize total expected discounted utility of both consumption and terminal wealth. Risk is measured by the variance of terminal wealth, which introduces a nonlinear function of the expected value into the control problem. The control problem presented is no longer a standard stochastic control problem but rather, a mean field-type control problem. The optimal portfolio and consumption rules are obtained explicitly. Numerical results shed light on the importance of controlling variance risk. The optimal investment policy is nonmyopic, and consumption is not sacrificed.
more »
« less
This content will become publicly available on September 1, 2026
On Merton's optimal portfolio problem with sporadic bankruptcy for isoelastic utility
In this paper, we consider a stock that follows a geometric Brownian motion (GBM) and a riskless asset continuously compounded at a constant rate. We assume that the stock can go bankrupt, i.e. lose all of its value, at some exogenous random time (independent of the stock price) modeled as the first arrival time of a homogeneous Poisson process. For this setup, we study Merton’s optimal portfolio problem consisting in maximizing the expected isoelastic utility of the total wealth at a given finite maturity time. We obtain an analytical solution using coupled Hamilton–Jacobi–Bellman (HJB) equations. The optimal strategy bans borrowing and never allocates more wealth into the stock than the classical Merton ratio recommends. For nonlogarithmic isoelastic utilities, the optimal weights are nonmyopic. This is an example where a realistic problem, being merely a slight modification of the usual GBM model, leads to nonmyopic weights. For logarithmic utility, we additionally present an alternative derivation using a stochastic integral and verify that the weights obtained are identical to our first approach. We also present an example for our strategy applied to a stock with nonzero bankruptcy probability.
more »
« less
- Award ID(s):
- 2402544
- PAR ID:
- 10657305
- Publisher / Repository:
- World Scientific Publishing
- Date Published:
- Journal Name:
- International Journal of Theoretical and Applied Finance
- Volume:
- 28
- Issue:
- 05n06
- ISSN:
- 0219-0249
- Subject(s) / Keyword(s):
- Markov-switching processes absorbing processes Hamilton–Jacobi–Bellman equation nonmyopic strategies isoelastic utility
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Finite-horizon sequential experimental design (SED) arises naturally in many contexts, including hyperparameter tuning in machine learning among more traditional settings. Computing the optimal policy for such problems requires solving Bellman equations, which are generally intractable. Most existing work resorts to severely myopic approximations by limiting the decision horizon to only a single time-step, which can underweight exploration in favor of exploitation. We present BINOCULARS: Batch-Informed NOnmyopic Choices, Using Long-horizons for Adaptive, Rapid SED, a general framework for deriving efficient, nonmyopic approximations to the optimal experimental policy. Our key idea is simple and surprisingly effective: we first compute a one-step optimal batch of experiments, then select a single point from this batch to evaluate. We realize BINOCULARS for Bayesian optimization and Bayesian quadrature -- two notable example problems with radically different objectives -- and demonstrate that BINOCULARS significantly outperforms significantly outperforms myopic alternatives in real-world scenarios.more » « less
-
In this paper, we develop an optimal weight adap- tation strategy of model predictive control (MPC) for connected and automated vehicles (CAVs) in mixed traffic. We model the interaction between a CAV and a human-driven vehicle (HDV) as a simultaneous game and formulate a game-theoretic MPC problem to find a Nash equilibrium of the game. In the MPC problem, the weights in the HDV’s objective function can be learned online using moving horizon inverse reinforcement learning. Using Bayesian optimization, we propose a strategy to optimally adapt the weights in the CAV’s objective function so that the expected true cost when using MPC in simulations can be minimized. We validate the effectiveness of the optimal strategy by numerical simulations of a vehicle crossing example at an unsignalized intersection.more » « less
-
We study settings where a set of identical, reusable resources must be allocated in an online fashion to arriving agents. Each arriving agent is patient and willing to wait for some period of time to be matched. When matched, each agent occupies a resource for a certain amount of time, and then releases it, gaining some utility from having done so. The goal of the system designer is to maximize overall utility given some prior knowledge of the distribution of arriving agents. We are particularly interested in settings where demand for the resources far outstrips supply, as is typical in the provision of social services, for example homelessness resources. We formulate this problem as online bipartite matching with reusable resources and patient agents. We develop new, efficient nonmyopic algorithms for this class of problems, and compare their performance with that of greedy algorithms in a variety of simulated settings, as well as in a setting calibrated to real-world data on household demand for homelessness services. We find substantial overall welfare benefits to using our nonmyopic algorithms, particularly in more extreme settings – those where agents are unwilling or unable to wait for resources, and where the ratio of resource demand to supply is particularly high.more » « less
-
Abstract We develop a continuous‐time control approach to optimal trading in a Proof‐of‐Stake (PoS) blockchain, formulated as a consumption‐investment problem that aims to strike the optimal balance between a participant's (or agent's) utility from holding/trading stakes and utility from consumption. We present solutions via dynamic programming and the Hamilton–Jacobi–Bellman (HJB) equations. When the utility functions are linear or convex, we derive close‐form solutions and show that the bang‐bang strategy is optimal (i.e., always buy or sell at full capacity). Furthermore, we bring out the explicit connection between the rate of return in trading/holding stakes and the participant's risk‐adjusted valuation of the stakes. In particular, we show when a participant is risk‐neutral or risk‐seeking, corresponding to the risk‐adjusted valuation being a martingale or a sub‐martingale, the optimal strategy must be to either buy all the time, sell all the time, or first buy then sell, and with both buying and selling executed at full capacity. We also propose a risk‐control version of the consumption‐investment problem; and for a special case, the “stake‐parity” problem, we show a mean‐reverting strategy is optimal.more » « less
An official website of the United States government
