skip to main content

Title: Scattering Statistics of Generalized Spatial Poisson Point Processes
We present a machine learning model for the analysis of randomly generated discrete signals, modeled as the points of an inhomogeneous, compound Poisson point process. Like the wavelet scattering transform introduced by Mallat, our construction is naturally invariant to translations and reflections, but it decouples the roles of scale and frequency, replacing wavelets with Gabor-type measurements. We show that, with suitable nonlinearities, our measurements distinguish Poisson point processes from common self-similar processes, and separate different types of Poisson point processes.  more » « less
Award ID(s):
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Page Range / eLocation ID:
5528 - 5532
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Point processes provide a powerful framework for modeling the distribution and interactions of events in time or space. Their flexibility has given rise to a variety of sophisticated models in statistics and machine learning, yet model diagnostic and criticism techniques re- main underdeveloped. In this work, we pro- pose a general Stein operator for point pro- cesses based on the Papangelou conditional intensity function. We then establish a kernel goodness-of-fit test by defining a Stein dis- crepancy measure for general point processes. Notably, our test also applies to non-Poisson point processes whose intensity functions con- tain intractable normalization constants due to the presence of complex interactions among points. We apply our proposed test to sev- eral point process models, and show that it outperforms a two-sample test based on the maximum mean discrepancy. 
    more » « less

    Viral deep-sequencing data play a crucial role toward understanding disease transmission network flows, providing higher resolution compared to standard Sanger sequencing. To more fully utilize these rich data and account for the uncertainties in outcomes from phylogenetic analyses, we propose a spatial Poisson process model to uncover human immunodeficiency virus (HIV) transmission flow patterns at the population level. We represent pairings of individuals with viral sequence data as typed points, with coordinates representing covariates such as gender and age and point types representing the unobserved transmission statuses (linkage and direction). Points are associated with observed scores on the strength of evidence for each transmission status that are obtained through standard deep-sequence phylogenetic analysis. Our method is able to jointly infer the latent transmission statuses for all pairings and the transmission flow surface on the source-recipient covariate space. In contrast to existing methods, our framework does not require preclassification of the transmission statuses of data points, and instead learns them probabilistically through a fully Bayesian inference scheme. By directly modeling continuous spatial processes with smooth densities, our method enjoys significant computational advantages compared to previous methods that rely on discretization of the covariate space. We demonstrate that our framework can capture age structures in HIV transmission at high resolution, bringing valuable insights in a case study on viral deep-sequencing data from Southern Uganda.

    more » « less
  3. Abstract. We present a framework for estimating concentrations of episodicallyelevated high-temperature marine ice nucleating particles (INPs) in the seasurface microlayer and their subsequent emission into the atmosphericboundary layer. These episodic INPs have been observed in multipleship-based and coastal field campaigns, but the processes controlling theirocean concentrations and transfer to the atmosphere are not yet fullyunderstood. We use a combination of empirical constraints and simulationoutputs from an Earth system model to explore different hypotheses forexplaining the variability of INP concentrations, and the occurrence ofepisodic INPs, in the marine atmosphere. In our calculations, we examine the following two proposed oceanic sources of high-temperature INPs: heterotrophic bacteria and marine biopolymer aggregates (MBPAs). Furthermore, we assume that the emission of these INPs is determined by the production of supermicron sea spray aerosol formed from jet drops, with an entrainment probability that is described by Poisson statistics. The concentration of jet drops is derived from the number concentration of supermicron sea spray aerosol calculated from model runs. We then derive the resulting number concentrations of marine high-temperature INPs (at 253 K) in the atmospheric boundary layer and compare their variability to atmospheric observations of INP variability. Specifically, we compare against concentrations of episodically occurring high-temperature INPs observed during field campaigns in the Southern Ocean, the Equatorial Pacific, and the North Atlantic. In this case study, we evaluate our framework at 253 K because reliable observational data at this temperature are available across three different ocean regions, but suitable data are sparse at higher temperatures. We find that heterotrophic bacteria and MBPAs acting as INPs provide only apartial explanation for the observed high INP concentrations. We note,however, that there are still substantial knowledge gaps, particularlyconcerning the identity of the oceanic INPs contributing most frequently toepisodic high-temperature INPs, their specific ice nucleation activity, andthe enrichment of their concentrations during the sea–air transfer process. Therefore, targeted measurements investigating the composition of these marine INPs and drivers for their emissions are needed, ideally incombination with modeling studies focused on the potential cloud impacts ofthese high-temperature INPs. 
    more » « less
  4. We say that one point process on the line R mimics another at a bandwidth B if for each n ≥ 1 the two point processes have n-level correlation functions that agree when integrated against all band-limited test functions on bandwidth [−B, B]. This paper asks the question of for what values a and B can a given point process on the real line be mimicked at bandwidth B by a point process supported on the lattice aZ. For Poisson point processes we give a complete answer for allowed parameter ranges (a,B), and for the sine process we give existence and nonexistence regions for parameter ranges. The results for the sine process have an application to the Alternative Hypothesis regarding the scaled spacing of zeros of the Riemann zeta function, given in a companion paper. 
    more » « less
  5. Abstract

    We develop a prior probability model for temporal Poisson process intensities through structured mixtures of Erlang densities with common scale parameter, mixing on the integer shape parameters. The mixture weights are constructed through increments of a cumulative intensity function which is modeled nonparametrically with a gamma process prior. Such model specification provides a novel extension of Erlang mixtures for density estimation to the intensity estimation setting. The prior model structure supports general shapes for the point process intensity function, and it also enables effective handling of the Poisson process likelihood normalizing term resulting in efficient posterior simulation. The Erlang mixture modeling approach is further elaborated to develop an inference method for spatial Poisson processes. The methodology is examined relative to existing Bayesian nonparametric modeling approaches, including empirical comparison with Gaussian process prior based models, and is illustrated with synthetic and real data examples.

    more » « less