skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Sample size re‐estimation in Phase 2 dose‐finding: Conditional power versus Bayesian predictive power
Abstract Unblinded sample size re‐estimation (SSR) is often planned in a clinical trial when there is large uncertainty about the true treatment effect. For Proof‐of Concept (PoC) in a Phase II dose finding study, contrast test can be adopted to leverage information from all treatment groups. In this article, we propose two‐stage SSR designs using frequentist conditional power (CP) and Bayesian predictive power (PP) for both single and multiple contrast tests. The Bayesian SSR can be implemented under a wide range of prior settings to incorporate different prior knowledge. Taking the adaptivity into account, all type I errors of final analysis in this paper are rigorously protected. Simulation studies are carried out to demonstrate the advantages of unblinded SSR in multi‐arm trials.  more » « less
Award ID(s):
2210371
PAR ID:
10401115
Author(s) / Creator(s):
 ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Pharmaceutical Statistics
Volume:
22
Issue:
2
ISSN:
1539-1604
Page Range / eLocation ID:
p. 349-364
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Trans-arterial chemoembolization (TACE) is an important yet variably effective treatment for management of hepatic malignancies. Lack of response can be in part due to inability to assess treatment adequacy in real-time. Gold-standard contrast enhanced computed tomography and magnetic resonance imaging, although effective, suffer from treatment-induced artifacts that prevent early treatment evaluation. Non-contrast ultrasound is a potential solution but has historically been ineffective at detecting treatment response. Here, we propose non-contrast ultrasound with recent perfusion-focused advancements as a tool for immediate evaluation of TACE. We demonstrate initial feasibility in an 11-subject pilot study. Treatment-induced changes in tumor perfusion are detected best when combining adaptive demodulation (AD) and singular value decomposition (SVD) techniques. Using a 0.5 s (300-sample) ensemble size, AD + SVD resulted in a 7.42 dB median decrease in tumor power after TACE compared to only a 0.06 dB median decrease with conventional methods. 
    more » « less
  2. Summary Power analyses are an important aspect of experimental design, because they help determine how experiments are implemented in practice. It is common to specify a desired level of power and compute the sample size necessary to obtain that power. Such calculations are well known for completely randomized experiments, but there can be many benefits to using other experimental designs. For example, it has recently been established that rerandomization, where subjects are randomized until covariate balance is obtained, increases the precision of causal effect estimators. This work establishes the power of rerandomized treatment-control experiments, thereby allowing for sample size calculators. We find the surprising result that, while power is often greater under rerandomization than complete randomization, the opposite can occur for very small treatment effects. The reason is that inference under rerandomization can be relatively more conservative, in the sense that it can have a lower Type-I error at the same nominal significance level, and this additional conservativeness adversely affects power. This surprising result is due to treatment effect heterogeneity, a quantity often ignored in power analyses. We find that heterogeneity increases power for large effect sizes, but decreases power for small effect sizes. 
    more » « less
  3. A modern power system is characterized by an increasing penetration of wind power, which results in large uncertainties in its states. These uncertainties must be quantified properly; otherwise, the system security may be threatened. Facing this challenge, we propose a cost-effective, data-driven approach for the probabilistic load-margin assessment problem. Using actual wind data, a kernel-density-estimator is applied to infer the nonparametric wind speed distributions, which are further merged into the framework of a vine copula. The latter enables us to simulate complex multivariate and highly dependent model inputs with a variety of bivariate copulae that precisely represent the tail dependence in the correlated samples. Furthermore, to reduce the prohibitive computational time of the traditional Monte-Carlo simulations processing a large amount of samples, we propose to use a nonparametric, Gaussian-process-emulator-based reduced-order model to replace the original complicated continuation power-flow model through a Bayesian learning framework. To accelerate the convergence rate of this Bayesian algorithm, a truncated polynomial chaos surrogate is developed, which serves as a highly efficient, parametric Bayesian prior. This emulator allows us to execute the time-consuming continuation power-flow solver at the sampled values with a negligible computational cost. Simulation results reveal the impressive performances of the proposed method. 
    more » « less
  4. ABSTRACT We present a comprehensive simulation-based study of the bayeseor code for 21 cm power spectrum recovery when analytically marginalizing over foreground parameters. To account for covariance between the 21 cm signal and contaminating foreground emission, bayeseor jointly constructs models for both signals within a Bayesian framework. Due to computational constraints, the forward model is constructed using a restricted field of view (FoV) in the image domain. When the only Epoch of Reionization contaminants are noise and foregrounds, we demonstrate that bayeseor can accurately recover the 21 cm power spectrum when the component of sky emission outside this forward-modelled region is downweighted by the beam at the level of the dynamic range between the foreground and 21 cm signals. However, when all-sky foreground emission is included along with a realistic instrument primary beam with sidelobes above this threshold extending to the horizon, the recovered power spectrum is contaminated by unmodelled sky emission outside the restricted FoV model. Expanding the combined cosmological and foreground model to cover the whole sky is computationally prohibitive. To address this, we present a modified version of bayeseor that allows for an all-sky foreground model, while the modelled 21 cm signal remains only within the primary FoV of the telescope. With this modification, it will be feasible to run an all-sky bayeseor analysis on a sizeable compute cluster. We also discuss several future directions for further reducing the need to model all-sky foregrounds, including wide-field foreground subtraction, an image-domain likelihood utilizing a tapering function, and instrument primary beam design. 
    more » « less
  5. Strictly proper scoring rules (SPSR) are incentive compatible for eliciting information about random variables from strategic agents when the principal can reward agents after the realization of the random variables. They also quantify the quality of elicited information, with more accurate predictions receiving higher scores in expectation. In this paper, we extend such scoring rules to settings where a principal elicits private probabilistic beliefs but only has access to agents’ reports. We name our solution Surrogate Scoring Rules (SSR). SSR is built on a bias correction step and an error rate estimation procedure for a reference answer defined using agents’ reports. We show that, with a little information about the prior distribution of the random variables, SSR in a multi-task setting recover SPSR in expectation, as if having access to the ground truth. Therefore, a salient feature of SSR is that they quantify the quality of information despite the lack of ground truth, just as SPSR do for the setting with ground truth. As a by-product, SSR induce dominant uniform strategy truthfulness in reporting. Our method is verified both theoretically and empirically using data collected from real human forecasters. 
    more » « less