NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Post-Selection Inference

https://doi.org/10.1146/annurev-statistics-100421-044639

Kuchibhotla, Arun K.; Kolassa, John E.; Kuffner, Todd A. (March 2022, Annual Review of Statistics and Its Application)

We discuss inference after data exploration, with a particular focus on inference after model or variable selection. We review three popular approaches to this problem: sample splitting, simultaneous inference, and conditional selective inference. We explain how each approach works and highlight its advantages and disadvantages. We also provide an illustration of these post-selection inference approaches.
more » « less
Full Text Available
Bayesian inference on volatility in the presence of infinite jump activity and microstructure noise

https://doi.org/10.1214/20-EJS1794

Wang, Qi; Figueroa-López, José E.; Kuffner, Todd A. (January 2021, Electronic Journal of Statistics)
null (Ed.)
Full Text Available
Block bootstrap optimality and empirical block selection for sample quantiles with dependent data

https://doi.org/10.1093/biomet/asaa075

Kuffner, T A; Lee, S M; Young, G A (September 2020, Biometrika)

Summary We establish a general theory of optimality for block bootstrap distribution estimation for sample quantiles under mild strong mixing conditions. In contrast to existing results, we study the block bootstrap for varying numbers of blocks. This corresponds to a hybrid between the sub- sampling bootstrap and the moving block bootstrap, in which the number of blocks is between 1 and the ratio of sample size to block length. The hybrid block bootstrap is shown to give theoretical benefits, and startling improvements in accuracy in distribution estimation in important practical settings. The conclusion that bootstrap samples should be of smaller size than the original sample has significant implications for computational efficiency and scalability of bootstrap methodologies with dependent data. Our main theorem determines the optimal number of blocks and block length to achieve the best possible convergence rate for the block bootstrap distribution estimator for sample quantiles. We propose an intuitive method for empirical selection of the optimal number and length of blocks, and demonstrate its value in a nontrivial example.
more » « less
Full Text Available
On the validity of the formal Edgeworth expansion for posterior densities

https://doi.org/10.1214/19-AOS1871

Kolassa, John E.; Kuffner, Todd A. (August 2020, The Annals of Statistics)
null (Ed.)
Full Text Available
Discussion: Models as Approximations

https://doi.org/10.1214/19-STS756

Ghanem, Dalia; Kuffner, Todd A. (November 2019, Statistical Science)

Full Text Available
On prediction of future insurance claims when the model is uncertain

Hong, Liang; Kuffner, Todd; Martin, Ryan (January 2019, Variance)

Predictive modeling is arguably one of the most important tasks actuaries face in their day-to-day work. In practice, actuaries may have a number of reasonable models to consider, all of which will provide different predictions. The most common strategy is first to use some kind of model selection tool to select a ``best model" and then to use that model to make predictions. However, there is reason to be concerned about the use of the classical distribution theory to develop predictions because this theory ignores the selection effect. Since accuracy of predictions is crucial to the insurer’s pricing and solvency, care is needed to develop valid prediction methods. This paper investigates the effects of model selection on the validity of classical prediction tools and makes some recommendations for practitioners.
more » « less
Full Text Available
Principled Statistical Inference in Data Science

https://doi.org/10.1142/9781786345400_0002

Kuffner, Todd A.; Young, G.A. (June 2018, Statistical Data Science)

We discuss the challenges of principled statistical inference in modern data science. Conditionality principles are argued as key to achieving valid statistical inference, in particular when this is performed after selecting a model from sample data itself.
more » « less
Full Text Available
On overfitting and post-selection uncertainty assessments

https://doi.org/10.1093/biomet/asx083

Hong, L; Kuffner, T A; Martin, R (January 2018, Biometrika)

Full Text Available

Search for: All records