skip to main content


Title: Ultralarge-scale approximations and galaxy clustering: Debiasing constraints on cosmological parameters
ABSTRACT

Upcoming galaxy surveys will allow us to probe the growth of the cosmic large-scale structure with improved sensitivity compared to current missions, and will also map larger areas of the sky. This means that in addition to the increased precision in observations, future surveys will also access the ultralarge-scale regime, where commonly neglected effects such as lensing, redshift-space distortions, and relativistic corrections become important for calculating correlation functions of galaxy positions. At the same time, several approximations usually made in these calculations such as the Limber approximation break down at those scales. The need to abandon these approximations and simplifying assumptions at large scales creates severe issues for parameter estimation methods. On the one hand, exact calculations of theoretical angular power spectra become computationally expensive, and the need to perform them thousands of times to reconstruct posterior probability distributions for cosmological parameters makes the approach unfeasible. On the other hand, neglecting relativistic effects and relying on approximations may significantly bias the estimates of cosmological parameters. In this work, we quantify this bias and investigate how an incomplete modelling of various effects on ultralarge scales could lead to false detections of new physics beyond the standard ΛCDM model. Furthermore, we propose a simple debiasing method that allows us to recover true cosmologies without running the full parameter estimation pipeline with exact theoretical calculations. This method can therefore provide a fast way of obtaining accurate values of cosmological parameters and estimates of exact posterior probability distributions from ultralarge-scale observations.

 
more » « less
NSF-PAR ID:
10361185
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Monthly Notices of the Royal Astronomical Society
Volume:
510
Issue:
2
ISSN:
0035-8711
Page Range / eLocation ID:
p. 1964-1977
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Modifications of the matter power spectrum due to baryonic physics are one of the major theoretical uncertainties in cosmological weak lensing measurements. Developing robust mitigation schemes for this source of systematic uncertainty increases the robustness of cosmological constraints, and may increase their precision if they enable the use of information from smaller scales. Here we explore the performance of two mitigation schemes for baryonic effects in weak lensing cosmic shear: the principal component analysis (PCA) method and the halo-model approach in hmcode. We construct mock tomographic shear power spectra from four hydrodynamical simulations, and run simulated likelihood analyses with cosmolike assuming LSST-like survey statistics. With an angular scale cut of ℓmax < 2000, both methods successfully remove the biases in cosmological parameters due to the various baryonic physics scenarios, with the PCA method causing less degradation in the parameter constraints than hmcode. For a more aggressive ℓmax = 5000, the PCA method performs well for all but one baryonic physics scenario, requiring additional training simulations to account for the extreme baryonic physics scenario of Illustris; hmcode exhibits tensions in the 2D posterior distributions of cosmological parameters due to lack of freedom in describing the power spectrum for $k \gt 10\ h^{-1}\, \mathrm{Mpc}$. We investigate variants of the PCA method and improve the bias mitigation through PCA by accounting for the noise properties in the data via Cholesky decomposition of the covariance matrix. Our improved PCA method allows us to retain more statistical constraining power while effectively mitigating baryonic uncertainties even for a broad range of baryonic physics scenarios. 
    more » « less
  2. Abstract

    There is untapped cosmological information in galaxy redshift surveys in the nonlinear regime. In this work, we use theAemulussuite of cosmologicalN-body simulations to construct Gaussian process emulators of galaxy clustering statistics at small scales (0.1–50h−1Mpc) in order to constrain cosmological and galaxy bias parameters. In addition to standard statistics—the projected correlation functionwp(rp), the redshift-space monopole of the correlation functionξ0(s), and the quadrupoleξ2(s)—we emulate statistics that include information about the local environment, namely the underdensity probability functionPU(s) and the density-marked correlation functionM(s). This extends the model ofAemulusIII for redshift-space distortions by including new statistics sensitive to galaxy assembly bias. In recovery tests, we find that the beyond-standard statistics significantly increase the constraining power on cosmological parameters of interest: includingPU(s) andM(s) improves the precision of our constraints on Ωmby 27%,σ8by 19%, and the growth of structure parameter,fσ8, by 12% compared to standard statistics. We additionally find that scales below ∼6h−1Mpc contain as much information as larger scales. The density-sensitive statistics also contribute to constraining halo occupation distribution parameters and a flexible environment-dependent assembly bias model, which is important for extracting the small-scale cosmological information as well as understanding the galaxy–halo connection. This analysis demonstrates the potential of emulating beyond-standard clustering statistics at small scales to constrain the growth of structure as a test of cosmic acceleration.

     
    more » « less
  3. ABSTRACT

    Luminous red galaxies (LRGs) and blue star-forming emission-line galaxies (ELGs) are key tracers of large-scale structure used by cosmological surveys. Theoretical predictions for such data are often done via simplistic models for the galaxy–halo connection. In this work, we use the large, high-fidelity hydrodynamical simulation of the MillenniumTNG project (MTNG) to inform a new phenomenological approach for obtaining an accurate and flexible galaxy-halo model on small scales. Our aim is to study LRGs and ELGs at two distinct epochs, z = 1 and z = 0, and recover their clustering down to very small scales, $r \sim 0.1 \ h^{-1}\, {\rm Mpc}$, i.e. the one-halo regime, while a companion paper extends this to a two-halo model for larger distances. The occupation statistics of ELGs in MTNG inform us that (1) the satellite occupations exhibit a slightly super-Poisson distribution, contrary to commonly made assumptions, and (2) that haloes containing at least one ELG satellite are twice as likely to host a central ELG. We propose simple recipes for modelling these effects, each of which calls for the addition of a single free parameter to simpler halo occupation models. To construct a reliable satellite population model, we explore the LRG and ELG satellite radial and velocity distributions and compare them with those of subhaloes and particles in the simulation. We find that ELGs are anisotropically distributed within haloes, which together with our occupation results provides strong evidence for cooperative galaxy formation (manifesting itself as one-halo galaxy conformity); i.e. galaxies with similar properties form in close proximity to each other. Our refined galaxy-halo model represents a useful improvement of commonly used analysis tools and thus can be of help to increase the constraining power of large-scale structure surveys.

     
    more » « less
  4. Abstract

    Integrated population models (IPMs) have become increasingly popular for the modelling of populations, as investigators seek to combine survey and demographic data to understand processes governing population dynamics. These models are particularly useful for identifying and exploring knowledge gaps within life histories, because they allow investigators to estimate biologically meaningful parameters, such as immigration or reproduction, that were previously unidentifiable without additional data. AsIPMs have been developed relatively recently, there is much to learn about model behaviour. Behaviour of parameters, such as estimates near boundaries, and the consequences of varying degrees of dependency among datasets, has been explored. However, the reliability of parameter estimates remains underexamined, particularly when models include parameters that are not identifiable from one data source, but are indirectly identifiable from multiple datasets and a presumed model structure, such as the estimation of immigration using capture‐recapture, fecundity and count data, combined with a life‐history model.

    To examine the behaviour of model parameter estimates, we simulated stable populations closed to immigration and emigration. We simulated two scenarios that might induce error into survival estimates: marker induced bias in the capture–mark–recapture data and heterogeneity in the mortality process. We subsequently fit capture–mark–recapture, state‐space and fecundity models, as well asIPMs that estimated additional parameters.

    Simulation results suggested that when model assumptions are violated, estimation of additional, previously unidentifiable, parameters usingIPMs may be extremely sensitive to these violations of model assumption. For example, when annual marker loss was simulated, estimates of survival rates were low and estimates of immigration rate from anIPMwere high. When heterogeneity in the mortality process was induced, there were substantial relative differences between the medians of posterior distributions and truth for juvenile survival and fecundity.

    Our results have important implications for biological inference when usingIPMs, as well as future model development and implementation. Specifically, using multiple datasets to identify additional parameters resulted in the posterior distributions of additional parameters directly reflecting the effects of the violations of model assumptions in integrated modelling frameworks. We suggest that investigators interpret posterior distributions of these parameters as a combination of biological process and systematic error.

     
    more » « less
  5. ABSTRACT

    We introduce a novel technique for constraining cosmological parameters and galaxy assembly bias using non-linear redshift-space clustering of galaxies. We scale cosmological N-body simulations and insert galaxies with the SubHalo Abundance Matching extended (SHAMe) empirical model to generate over 175 000 clustering measurements spanning all relevant cosmological and SHAMe parameter values. We then build an emulator capable of reproducing the projected galaxy correlation function at the monopole, quadrupole, and hexadecapole level for separations between $0.1\, h^{-1}\, {\rm Mpc}$ and $25\, h^{-1}\, {\rm Mpc}$. We test this approach by using the emulator and Monte Carlo Markov Chain (MCMC) inference to jointly estimate cosmology and assembly bias parameters both for the MTNG740 hydrodynamic simulation and for a semi-analytical model (SAM) galaxy formation built on the MTNG740-DM dark matter-only simulation, obtaining unbiased results for all cosmological parameters. For instance, for MTNG740 and a galaxy number density of $n\sim 0.01 h^{3}\, {\rm Mpc}^{-3}$, we obtain $\sigma _{8}=0.799^{+0.039}_{-0.044}$ and $\Omega _\mathrm{M}h^2= 0.138^{+ 0.025}_{- 0.018}$ (which are within 0.4 and 0.2σ of the MTNG cosmology). For fixed Hubble parameter (h), the constraint becomes $\Omega _\mathrm{M}h^2= 0.137^{+ 0.011}_{- 0.012}$. Our method performs similarly well for the SAM and for other tested sample densities. We almost always recover the true amount of galaxy assembly bias within 1σ. The best constraints are obtained when scales smaller than $2\, h^{-1}\, {\rm Mpc}$ are included, as well as when at least the projected correlation function and the monopole are incorporated. These methods offer a powerful way to constrain cosmological parameters using galaxy surveys.

     
    more » « less