skip to main content


Title: One-dimensional empirical measures, order statistics, and Kantorovich transport distances
This work is devoted to the study of rates of convergence of the empirical measures μn over a sample (Xk) of independent identically distributed real-valued random variables towards the common distribution μ in Kantorovich transport distances Wp. The focus is on finite range bounds on the expected Kantorovich distances E(Wp(μn, μ)) in terms of moments and analytic conditions on the measure μ and its distribution function. The study describes a variety of rates, from the standard one to slower rates, and both lower and upper-bounds on E(Wp(μn,μ)) for fixed n in various instances. Order statistics, reduction to uniform samples and analysis of beta distributions, inverse distribution functions, log-concavity are main tools in the investigation. Two detailed appendices collect classical and some new facts on inverse distribution functions and beta distributions and their densities necessary to the investigation.  more » « less
Award ID(s):
1855575
NSF-PAR ID:
10147991
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Memoirs of the American Mathematical Society
Volume:
261
Issue:
1259
ISSN:
1947-6221
Page Range / eLocation ID:
v+126
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Structured population models are among the most widely used tools in ecology and evolution. Integral projection models (IPMs) use continuous representations of how survival, reproduction and growth change as functions of state variables such as size, requiring fewer parameters to be estimated than projection matrix models (PPMs). Yet, almost all published IPMs make an important assumption that size‐dependent growth transitions are or can be transformed to be normally distributed. In fact, many organisms exhibit highly skewed size transitions. Small individuals can grow more than they can shrink, and large individuals may often shrink more dramatically than they can grow. Yet, the implications of such skew for inference from IPMs has not been explored, nor have general methods been developed to incorporate skewed size transitions into IPMs, or deal with other aspects of real growth rates, including bounds on possible growth or shrinkage.

    Here, we develop a flexible approach to modelling skewed growth data using a modified beta regression model. We propose that sizes first be converted to a (0,1) interval by estimating size‐dependent minimum and maximum sizes through quantile regression. Transformed data can then be modelled using beta regression with widely available statistical tools. We demonstrate the utility of this approach using demographic data for a long‐lived plant, gorgonians and an epiphytic lichen. Specifically, we compare inferences of population parameters from discrete PPMs to those from IPMs that either assume normality or incorporate skew using beta regression or, alternatively, a skewed normal model.

    The beta and skewed normal distributions accurately capture the mean, variance and skew of real growth distributions. Incorporating skewed growth into IPMs decreases population growth and estimated life span relative to IPMs that assume normally distributed growth, and more closely approximate the parameters of PPMs that do not assume a particular growth distribution. A bounded distribution, such as the beta, also avoids the eviction problem caused by predicting some growth outside the modelled size range.

    Incorporating biologically relevant skew in growth data has important consequences for inference from IPMs. The approaches we outline here are flexible and easy to implement with existing statistical tools.

     
    more » « less
  2. Abstract

    The search for neutrino events in correlation with gravitational wave (GW) events for three observing runs (O1, O2 and O3) from 09/2015 to 03/2020 has been performed using the Borexino data-set of the same period. We have searched for signals of neutrino-electron scattering and inverse beta-decay (IBD) within a time window of$$\pm \, 1000$$±1000 s centered at the detection moment of a particular GW event. The search was done with three visible energy thresholds of 0.25, 0.8 and 3.0 MeV. Two types of incoming neutrino spectra were considered: the mono-energetic line and the supernova-like spectrum. GW candidates originated by merging binaries of black holes (BHBH), neutron stars (NSNS) and neutron star and black hole (NSBH) were analyzed separately. Additionally, the subset of most intensive BHBH mergers at closer distances and with larger radiative mass than the rest was considered. In total, follow-ups of 74 out of 93 gravitational waves reported in the GWTC-3 catalog were analyzed and no statistically significant excess over the background was observed. As a result, the strongest upper limits on GW-associated neutrino and antineutrino fluences for all flavors ($$\nu _e, \nu _\mu , \nu _\tau $$νe,νμ,ντ) at the level$$10^9{-}10^{15}~\textrm{cm}^{-2}\,\textrm{GW}^{-1}$$109-1015cm-2GW-1have been obtained in the 0.5–5 MeV neutrino energy range.

     
    more » « less
  3. null (Ed.)
    A bstract We consider a class of models in which the neutrinos acquire Majorana masses through mixing with singlet neutrinos that emerge as composite states of a strongly coupled hidden sector. In this framework, the light neutrinos are partially composite particles that obtain their masses through the inverse seesaw mechanism. We focus on the scenario in which the strong dynamics is approximately conformal in the ultraviolet, and the compositeness scale lies at or below the weak scale. The small parameters in the Lagrangian necessary to realize the observed neutrino masses can naturally arise as a consequence of the scaling dimensions of operators in the conformal field theory. We show that this class of models has interesting implications for a wide variety of experiments, including colliders and beam dumps, searches for lepton flavor violation and neutrinoless double beta decay, and cosmological observations. At colliders and beam dumps, this scenario can give rise to striking signals involving multiple displaced vertices. The exchange of hidden sector states can lead to observable rates for flavor violating processes such as μ → eγ and μ → e conversion. If the compositeness scale lies at or below a hundred MeV, the rate for neutrinoless double beta decay is suppressed by form factors and may be reduced by an order of magnitude or more. The late decays of relic singlet neutrinos can give rise to spectral distortions in the cosmic microwave background that are large enough to be observed in future experiments. 
    more » « less
  4. Causal effects are often characterized with averages, which can give an incomplete picture of the underlying counterfactual distributions. Here we consider estimating the entire counterfactual density and generic functionals thereof. We focus on two kinds of target parameters. The first is a density approximation, defined by a projection onto a finite-dimensional model using a generalized distance metric, which includes f-divergences as well as Lp norms. The second is the distance between counterfactual densities, which can be used as a more nuanced effect measure than the mean difference, and as a tool for model selection. We study nonparametric efficiency bounds for these targets, giving results for smooth but otherwise generic models and distances. Importantly, we show how these bounds connect to means of particular non-trivial functions of counterfactuals, linking the problems of density and mean estimation. We go on to propose doubly robust-style estimators for the density approximations and distances, and study their rates of convergence, showing they can be optimally efficient in large nonparametric models. We also give analogous methods for model selection and aggregation, when many models may be available and of interest. Our results all hold for generic models and distances, but throughout we highlight what happens for particular choices, such as L2 projections on linear models, and KL projections on exponential families. Finally we illustrate by estimating the density of CD4 count among patients with HIV, had all been treated with combination therapy versus zidovudine alone, as well as a density effect. Our results suggest combination therapy may have increased CD4 count most for high-risk patients. Our methods are implemented in the freely available R package npcausal on GitHub. 
    more » « less
  5. null (Ed.)
    Sensitivity properties describe how changes to the input of a program affect the output, typically by upper bounding the distance between the outputs of two runs by a monotone function of the distance between the corresponding inputs. When programs are probabilistic, the distance between outputs is a distance between distributions. The Kantorovich lifting provides a general way of defining a distance between distributions by lifting the distance of the underlying sample space; by choosing an appropriate distance on the base space, one can recover other usual probabilistic distances, such as the Total Variation distance. We develop a relational pre-expectation calculus to upper bound the Kantorovich distance between two executions of a probabilistic program. We illustrate our methods by proving algorithmic stability of a machine learning algorithm, convergence of a reinforcement learning algorithm, and fast mixing for card shuffling algorithms. We also consider some extensions: using our calculus to show convergence of Markov chains to the uniform distribution over states and an asynchronous extension to reason about pairs of program executions with different control flow. 
    more » « less