Estimating location parameters in sample-heterogeneous distributions

Pensia, Ankit; Jog, Varun; Loh, Po-Ling

doi:10.1093/imaiai/iaab013

Citation Details

Estimating location parameters in sample-heterogeneous distributions

Abstract Estimating the mean of a probability distribution using i.i.d. samples is a classical problem in statistics, wherein finite-sample optimal estimators are sought under various distributional assumptions. In this paper, we consider the problem of mean estimation when independent samples are drawn from $$d$$-dimensional non-identical distributions possessing a common mean. When the distributions are radially symmetric and unimodal, we propose a novel estimator, which is a hybrid of the modal interval, shorth and median estimators and whose performance adapts to the level of heterogeneity in the data. We show that our estimator is near optimal when data are i.i.d. and when the fraction of ‘low-noise’ distributions is as small as $$\varOmega \left (\frac{d \log n}{n}\right )$$, where $$n$$ is the number of samples. We also derive minimax lower bounds on the expected error of any estimator that is agnostic to the scales of individual data points. Finally, we extend our theory to linear regression. In both the mean estimation and regression settings, we present computationally feasible versions of our estimators that run in time polynomial in the number of data points. more »

Award ID(s):: 1749857 1907786

PAR ID:: 10289004

Author(s) / Creator(s):: Pensia, Ankit; Jog, Varun; Loh, Po-Ling

Date Published:: 2021-06-03

Journal Name:: Information and Inference: A Journal of the IMA

ISSN:: 2049-8772

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1093/imaiai/iaab013

More Like this