ABSTRACT This paper provides a comprehensive overview of how fitting of baryon acoustic oscillations (BAO) is carried out within the upcoming Dark Energy Spectroscopic Instrument’s (DESI) 2024 results using its DR1 data set, and the associated systematic error budget from theory and modelling of the BAO. We derive new results showing how non-linearities in the clustering of galaxies can cause potential biases in measurements of the isotropic ($$\alpha _{\mathrm{iso}}$$) and anisotropic ($$\alpha _{\mathrm{ap}}$$) BAO distance scales, and how these can be effectively removed with an appropriate choice of reconstruction algorithm. We then demonstrate how theory leads to a clear choice for how to model the BAO and develop, implement, and validate a new model for the remaining smooth-broad-band (i.e. without BAO) component of the galaxy clustering. Finally, we explore the impact of all remaining modelling choices on the BAO constraints from DESI using a suite of high-precision simulations, arriving at a set of best practices for DESI BAO fits, and an associated theory and modelling systematic error. Overall, our results demonstrate the remarkable robustness of the BAO to all our modelling choices and motivate a combined theory and modelling systematic error contribution to the post-reconstruction DESI BAO measurements of no more than 0.1 per cent (0.2 per cent) for its isotropic (anisotropic) distance measurements. We expect the theory and best practices laid out to here to be applicable to other BAO experiments in the era of DESI and beyond.
more »
« less
Estimation and Clustering in Popularity Adjusted Block Model
Abstract The paper considers the Popularity Adjusted Block model (PABM) introduced by Sengupta and Chen (Journal of the Royal Statistical Society Series B, 2018, 80, 365–386). We argue that the main appeal of the PABM is the flexibility of the spectral properties of the graph which makes the PABM an attractive choice for modelling networks that appear in biological sciences. We expand the theory of PABM to the case of an arbitrary number of communities which possibly grows with a number of nodes in the network and is not assumed to be known. We produce estimators of the probability matrix and of the community structure and, in addition, provide non-asymptotic upper bounds for the estimation and the clustering errors. We use the Sparse Subspace Clustering (SSC) approach for partitioning the network into communities, the approach that, to the best of our knowledge, has not been used for the clustering network data. The theory is supplemented by a simulation study. In addition, we show advantages of the PABM for modelling a butterfly similarity network and a human brain functional network.
more »
« less
- PAR ID:
- 10398643
- Publisher / Repository:
- Oxford University Press
- Date Published:
- Journal Name:
- Journal of the Royal Statistical Society Series B: Statistical Methodology
- Volume:
- 83
- Issue:
- 2
- ISSN:
- 1369-7412
- Format(s):
- Medium: X Size: p. 293-317
- Size(s):
- p. 293-317
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Monitoring wildlife abundance across space and time is an essential task to study their population dynamics and inform effective management. Acoustic recording units are a promising technology for efficiently monitoring bird populations and communities. While current acoustic data models provide information on the presence/absence of individual species, new approaches are needed to monitor population abundance, ideally across large spatio‐temporal regions.We present an integrated modelling framework that combines high‐quality but temporally sparse bird point count survey data with acoustic recordings. Our models account for imperfect detection in both data types and false positive errors in the acoustic data. Using simulations, we compare the accuracy and precision of abundance estimates using differing amounts of acoustic vocalizations obtained from a clustering algorithm, point count data, and a subset of manually validated acoustic vocalizations. We also use our modelling framework in a case study to estimate abundance of the Eastern Wood‐Pewee (Contopus virens) in Vermont, USA.The simulation study reveals that combining acoustic and point count data via an integrated model improves accuracy and precision of abundance estimates compared with models informed by either acoustic or point count data alone. Improved estimates are obtained across a wide range of scenarios, with the largest gains occurring when detection probability for the point count data is low. Combining acoustic data with only a small number of point count surveys yields estimates of abundance without the need for validating any of the identified vocalizations from the acoustic data. Within our case study, the integrated models provided moderate support for a decline of the Eastern Wood‐Pewee in this region.Our integrated modelling approach combines dense acoustic data with few point count surveys to deliver reliable estimates of species abundance without the need for manual identification of acoustic vocalizations or a prohibitively expensive large number of repeated point count surveys. Our proposed approach offers an efficient monitoring alternative for large spatio‐temporal regions when point count data are difficult to obtain or when monitoring is focused on rare species with low detection probability.more » « less
-
null (Ed.)ABSTRACT Halo models provide a simple and computationally inexpensive way to investigate the connection between galaxies and their dark matter haloes. However, these models rely on the assumption that the role of baryons can easily be parametrized in the modelling procedure. We aim to examine the ability of halo occupation distribution (HOD) modelling to reproduce the galaxy clustering found in two different hydrodynamic simulations, Illustris and EAGLE. For each simulation, we measure several galaxy clustering statistics on two different luminosity threshold samples. We then apply a simple five parameter HOD, which was fit to each simulation separately, to the corresponding dark matter-only simulations, and measure the same clustering statistics. We find that the halo mass function is shifted to lower masses in the hydrodynamic simulations, resulting in a galaxy number density that is too high when an HOD is applied to the dark matter-only simulation. However, the exact way in which baryons alter the mass function is remarkably different in the two simulations. After applying a correction to the halo mass function in each simulation, the HOD is able to accurately reproduce all clustering statistics for the high luminosity sample of galaxies. For the low luminosity sample, we find evidence that in addition to correcting the halo mass function, including spatial, velocity, and assembly bias parameters in the HOD is necessary to accurately reproduce clustering statistics.more » « less
-
Summary Recent studies have focused on linking marine microbial communities with environmental factors, yet, relatively little is known about the drivers of microbial community patterns across the complex gradients from the nearshore to open ocean. Here, we examine microbial dynamics in 15 five‐station transects beginning at the estuarine Piver's Island Coastal Observatory (PICO) time‐series site and continuing 87 km across the continental shelf to the oligotrophic waters of the Sargasso Sea. 16S rRNA gene libraries reveal strong clustering by sampling site with distinct nearshore, continental shelf and offshore oceanic communities. Water temperature and distance from shore (which serves as a proxy for gradients in factors such as productivity, terrestrial input and nutrients) both most influence community composition. However, at the phylotype level, modelling shows the distribution of some taxa is linked to temperature, others to distance from shore and some by both factors, highlighting that taxa with distinct environmental preferences underlie apparent clustering by station. Thus, continental margins contain microbial communities that are distinct from those of either the nearshore or the offshore environments and contain mixtures of phylotypes with nearshore or offshore preferences rather than those unique to the shelf environment.more » « less
-
ABSTRACT AimEcological theory suggests that dispersal limitation and selection by climatic factors influence bacterial community assembly at a continental scale, yet the conditions governing the relative importance of each process remains unclear. The carnivorous pitcher plantSarracenia purpureaprovides a model aquatic microecosystem to assess bacterial communities across the host plant's north–south range in North America. This study determined the relative influences of dispersal limitation and environmental selection on the assembly of bacterial communities inhabitingS. purpureapitchers at the continental scale. LocationEastern United States and Canada. Time Period2016. Major Taxa StudiedBacteria inhabitingS. purpureapitchers. MethodsPitcher morphology, fluid, inquilines and prey were measured, and pitcher fluid underwent DNA sequencing for bacterial community analysis. Null modelling of β‐diversity provided estimates for the contributions of selection and dispersal limitation to community assembly, complemented by an examination of spatial clustering of individuals. Phylogenetic and ecological associations of co‐occurrence network module bacteria was determined by assessing the phylogenetic diversity and habitat preferences of member taxa. ResultsDispersal limitation was evident from between‐site variation and spatial aggregation of individual bacterial taxa in theS. purpureapitcher system. Selection pressure was weak across the geographic range, yet network module analysis indicated environmental selection within subgroups. A group of aquatic bacteria held traits under selection in warmer, wetter climates, and midge abundance was associated with selection for traits held by a group of saprotrophs. Processes that increased pitcher fluid volume weakened selection in one module, possibly by supporting greater bacterial dispersal. ConclusionDispersal limitation governed bacterial community assembly inS. purpureapitchers at a continental scale (74% of between‐site comparisons) and was significantly greater than selection across the range. Network modules showed evidence for selection, demonstrating that multiple processes acted concurrently in bacterial community assembly at the continental scale.more » « less
An official website of the United States government
