skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: On the existence of powerful p-values and e-values for composite hypotheses
Award ID(s):
2310718
PAR ID:
10529417
Author(s) / Creator(s):
; ;
Publisher / Repository:
Institute of Mathematical Statistics
Date Published:
Journal Name:
The Annals of Statistics
Volume:
52
Issue:
5
ISSN:
0090-5364
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. DanceSport is a competitive form of ballroom dancing. At a DanceSport event, couples perform multiple dances in front of judges. This paper shows how a goal for a couple and the judges' evaluations of the couple's dance performances can be used to formulate a weighted simple game. We explain why couples and their coaches may consider a variety of goals. We also show how prominent power values can be used to measure the contributions of dance performances to achieving certain goals. As part of our analysis, we develop novel visual representations of the Banzhaf and Shapley-Shubik index profiles for different thresholds. In addition, we show that the "quota paradox" is relevant for DanceSport events. 
    more » « less
  2. Joel Sobel (Ed.)
  3. In the context of supervised parametric models, we introduce the concept of e-values. An e-value is a scalar quantity that represents the proximity of the sampling distribution of parameter estimates in a model trained on a subset of features to that of the model trained on all features (i.e. the full model). Under general conditions, a rank ordering of e-values separates models that contain all essential features from those that do not. The e-values are applicable to a wide range of parametric models. We use data depths and a fast resampling-based algorithm to implement a feature selection procedure using e-values, providing consistency results. For a p-dimensional feature space, this procedure requires fitting only the full model and evaluating p + 1 models, as opposed to the traditional requirement of fitting and evaluating 2^p models. Through experiments across several model settings and synthetic and real datasets, we establish that the e-values method as a promising general alternative to existing model-specific methods of feature selection 
    more » « less