skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Simulation‐based sequential design
Abstract We review some simulation‐based methods to implement optimal decisions in sequential design problems as they naturally arise in clinical trial design. As a motivating example we use a stylized version of a dose‐ranging design in the ASTIN trial. The approach can be characterized as constrained backward induction. The nature of the constraint is a restriction of the decisions to a set of actions that are functions of the current history only implicitly through a low‐dimensional summary statistic. In addition, the action set is restricted to time‐invariant policies. Time‐dependence is only introduced indirectly through the change of the chosen summary statistic over time. This restriction allows computationally efficient solutions to the sequential decision problem. A further simplification is achieved by restricting optimal actions to be described by decision boundaries on the space of such summary statistics.  more » « less
Award ID(s):
1952679
PAR ID:
10368910
Author(s) / Creator(s):
 ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Pharmaceutical Statistics
Volume:
21
Issue:
4
ISSN:
1539-1604
Page Range / eLocation ID:
p. 729-739
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract We consider a decision maker who faces a binary treatment choice when their welfare is only partially identified from data. We contribute to the literature by anchoring our finite-sample analysis on mean square regret, a decision criterion advocated by Kitagawa et al. in (2022) Treatment Choice with Nonlinear Regret . We find that optimal rules are always fractional, irrespective of the width of the identified set and precision of its estimate. The optimal treatment fraction is a simple logistic transformation of the commonly used t-statistic multiplied by a factor calculated by a simple constrained optimization. This treatment fraction gets closer to 0.5 as the width of the identified set becomes wider, implying the decision maker becomes more cautious against the adversarial Nature. 
    more » « less
  2. Summary Clinicians and patients must make treatment decisions at a series of key decision points throughout disease progression. A dynamic treatment regime is a set of sequential decision rules that return treatment decisions based on accumulating patient information, like that commonly found in electronic medical record (EMR) data. When applied to a patient population, an optimal treatment regime leads to the most favorable outcome on average. Identifying optimal treatment regimes that maximize residual life is especially desirable for patients with life-threatening diseases such as sepsis, a complex medical condition that involves severe infections with organ dysfunction. We introduce the residual life value estimator (ReLiVE), an estimator for the expected value of cumulative restricted residual life under a fixed treatment regime. Building on ReLiVE, we present a method for estimating an optimal treatment regime that maximizes expected cumulative restricted residual life. Our proposed method, ReLiVE-Q, conducts estimation via the backward induction algorithm Q-learning. We illustrate the utility of ReLiVE-Q in simulation studies, and we apply ReLiVE-Q to estimate an optimal treatment regime for septic patients in the intensive care unit using EMR data from the Multiparameter Intelligent Monitoring Intensive Care database. Ultimately, we demonstrate that ReLiVE-Q leverages accumulating patient information to estimate personalized treatment regimes that optimize a clinically meaningful function of residual life. 
    more » « less
  3. Abstract Sequential decision‐making involves making informed decisions based on continuous interactions with a complex environment. This process is ubiquitous in various applications, including recommendation systems and clinical treatment design. My research has concentrated on addressing two pivotal challenges in sequential decision‐making: (1) How can we design algorithms that efficiently learn the optimal decision strategy with minimal interactions and limited sample data? (2) How can we ensure robustness in decision‐making algorithms when faced with distributional shifts due to environmental changes and the sim‐to‐real gap? This paper summarizes and expands upon the talk I presented at the AAAI 2024 New Faculty Highlights program, detailing how my research aims to tackle these challenges. 
    more » « less
  4. In hierarchical planning for Markov decision processes (MDPs), temporal abstraction allows planning with macro-actions that take place at different time scale in the form of sequential composition. In this paper, we propose a novel approach to compositional reasoning and hierarchical planning for MDPs under co-safe temporal logic constraints. In addition to sequential composition, we introduce a composition of policies based on generalized logic composition: Given sub-policies for sub-tasks and a new task expressed as logic compositions of subtasks, a semi-optimal policy, which is optimal in planning with only sub-policies, can be obtained by simply composing sub-polices. Thus, a synthesis algorithm is developed to compute optimal policies efficiently by planning with primitive actions, policies for sub-tasks, and the compositions of sub-policies, for maximizing the probability of satisfying constraints specified in the fragment of co-safe temporal logic. We demonstrate the correctness and efficiency of the proposed method in stochastic planning examples with a single agent and multiple task specifications. 
    more » « less
  5. Abstract Design thinking is essential to the success of a design process as it helps achieve the design goal by guiding design decision-making. Therefore, fundamentally understanding design thinking is vital for improving design methods, tools and theories. However, interpreting design thinking is challenging because it is a cognitive process that is hidden and intangible. In this paper, we represent design thinking as an intermediate layer between human designers’ thought processes and their design behaviors. To do so, this paper first identifies five design behaviors based on the current design theories. These behaviors include design action preference, one-step sequential behavior, contextual behavior, long-term sequential behavior, and reflective thinking behavior. Next, we develop computational methods to characterize each of the design behaviors. Particularly, we use design action distribution, first-order Markov chain, Doc2Vec, bi-directional LSTM autoencoder, and time gap distribution to characterize the five design behaviors. The characterization of the design behaviors through embedding techniques is essentially a latent representation of the design thinking, and we refer to it as design embeddings. After obtaining the embedding, an X-mean clustering algorithm is adopted to each of the embeddings to cluster designers. The approach is applied to data collected from a high school solar system design challenge. The clustering results show that designers follow several design patterns according to the corresponding behavior, which corroborates the effectiveness of using design embedding for design behavior clustering. The extraction of design embedding based on the proposed approach can be useful in other design research, such as inferring design decisions, predicting design performance, and identifying design actions identification. 
    more » « less