skip to main content


This content will become publicly available on July 1, 2024

Title: Robust Field-level Likelihood-free Inference with Galaxies
Abstract We train graph neural networks to perform field-level likelihood-free inference using galaxy catalogs from state-of-the-art hydrodynamic simulations of the CAMELS project. Our models are rotational, translational, and permutation invariant and do not impose any cut on scale. From galaxy catalogs that only contain 3D positions and radial velocities of ∼1000 galaxies in tiny ( 25 h − 1 Mpc ) 3 volumes our models can infer the value of Ω m with approximately 12% precision. More importantly, by testing the models on galaxy catalogs from thousands of hydrodynamic simulations, each having a different efficiency of supernova and active galactic nucleus feedback, run with five different codes and subgrid models—IllustrisTNG, SIMBA, Astrid, Magneticum, SWIFT-EAGLE—we find that our models are robust to changes in astrophysics, subgrid physics, and subhalo/galaxy finder. Furthermore, we test our models on 1024 simulations that cover a vast region in parameter space—variations in five cosmological and 23 astrophysical parameters—finding that the model extrapolates really well. Our results indicate that the key to building a robust model is the use of both galaxy positions and velocities, suggesting that the network has likely learned an underlying physical relation that does not depend on galaxy formation and is valid on scales larger than ∼10 h −1 kpc.  more » « less
Award ID(s):
2108078 2108944
NSF-PAR ID:
10442412
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
The Astrophysical Journal
Volume:
952
Issue:
1
ISSN:
0004-637X
Page Range / eLocation ID:
69
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract We train graph neural networks on halo catalogs from Gadget N -body simulations to perform field-level likelihood-free inference of cosmological parameters. The catalogs contain ≲5000 halos with masses ≳10 10 h −1 M ⊙ in a periodic volume of ( 25 h − 1 Mpc ) 3 ; every halo in the catalog is characterized by several properties such as position, mass, velocity, concentration, and maximum circular velocity. Our models, built to be permutationally, translationally, and rotationally invariant, do not impose a minimum scale on which to extract information and are able to infer the values of Ω m and σ 8 with a mean relative error of ∼6%, when using positions plus velocities and positions plus masses, respectively. More importantly, we find that our models are very robust: they can infer the value of Ω m and σ 8 when tested using halo catalogs from thousands of N -body simulations run with five different N -body codes: Abacus, CUBEP 3 M, Enzo, PKDGrav3, and Ramses. Surprisingly, the model trained to infer Ω m also works when tested on thousands of state-of-the-art CAMELS hydrodynamic simulations run with four different codes and subgrid physics implementations. Using halo properties such as concentration and maximum circular velocity allow our models to extract more information, at the expense of breaking the robustness of the models. This may happen because the different N -body codes are not converged on the relevant scales corresponding to these parameters. 
    more » « less
  2. Abstract Understanding the halo–galaxy connection is fundamental in order to improve our knowledge on the nature and properties of dark matter. In this work, we build a model that infers the mass of a halo given the positions, velocities, stellar masses, and radii of the galaxies it hosts. In order to capture information from correlations among galaxy properties and their phase space, we use Graph Neural Networks (GNNs), which are designed to work with irregular and sparse data. We train our models on galaxies from more than 2000 state-of-the-art simulations from the Cosmology and Astrophysics with MachinE Learning Simulations project. Our model, which accounts for cosmological and astrophysical uncertainties, is able to constrain the masses of the halos with a ∼0.2 dex accuracy. Furthermore, a GNN trained on a suite of simulations is able to preserve part of its accuracy when tested on simulations run with a different code that utilizes a distinct subgrid physics model, showing the robustness of our method. The PyTorch Geometric implementation of the GNN is publicly available on GitHub ( https://github.com/PabloVD/HaloGraphNet ). 
    more » « less
  3. Abstract Galaxies can be characterized by many internal properties such as stellar mass, gas metallicity, and star formation rate. We quantify the amount of cosmological and astrophysical information that the internal properties of individual galaxies and their host dark matter halos contain. We train neural networks using hundreds of thousands of galaxies from 2000 state-of-the-art hydrodynamic simulations with different cosmologies and astrophysical models of the CAMELS project to perform likelihood-free inference on the value of the cosmological and astrophysical parameters. We find that knowing the internal properties of a single galaxy allows our models to infer the value of Ω m , at fixed Ω b , with a ∼10% precision, while no constraint can be placed on σ 8 . Our results hold for any type of galaxy, central or satellite, massive or dwarf, at all considered redshifts, z ≤ 3, and they incorporate uncertainties in astrophysics as modeled in CAMELS. However, our models are not robust to changes in subgrid physics due to the large intrinsic differences the two considered models imprint on galaxy properties. We find that the stellar mass, stellar metallicity, and maximum circular velocity are among the most important galaxy properties to determine the value of Ω m . We believe that our results can be explained by considering that changes in the value of Ω m , or potentially Ω b /Ω m , affect the dark matter content of galaxies, which leaves a signature in galaxy properties distinct from the one induced by galactic processes. Our results suggest that the low-dimensional manifold hosting galaxy properties provides a tight direct link between cosmology and astrophysics. 
    more » « less
  4. ABSTRACT

    Cosmological inference with large galaxy surveys requires theoretical models that combine precise predictions for large-scale structure with robust and flexible galaxy formation modelling throughout a sufficiently large cosmic volume. Here, we introduce the millenniumTNG (MTNG) project which combines the hydrodynamical galaxy formation model of illustrisTNG with the large volume of the millennium simulation. Our largest hydrodynamic simulation, covering $(500 \, h^{-1}{\rm Mpc})^3 \simeq (740\, {\rm Mpc})^3$, is complemented by a suite of dark-matter-only simulations with up to 43203 dark matter particles (a mass resolution of $1.32\times 10^8 \, h^{-1}{\rm M}_\odot$) using the fixed-and-paired technique to reduce large-scale cosmic variance. The hydro simulation adds 43203 gas cells, achieving a baryonic mass resolution of $2\times 10^7 \, h^{-1}{\rm M}_\odot$. High time-resolution merger trees and direct light-cone outputs facilitate the construction of a new generation of semi-analytic galaxy formation models that can be calibrated against both the hydro simulation and observation, and then applied to even larger volumes – MTNG includes a flagship simulation with 1.1 trillion dark matter particles and massive neutrinos in a volume of $(3000\, {\rm Mpc})^3$. In this introductory analysis we carry out convergence tests on basic measures of non-linear clustering such as the matter power spectrum, the halo mass function and halo clustering, and we compare simulation predictions to those from current cosmological emulators. We also use our simulations to study matter and halo statistics, such as halo bias and clustering at the baryonic acoustic oscillation scale. Finally we measure the impact of baryonic physics on the matter and halo distributions.

     
    more » « less
  5. ABSTRACT

    Although galactic winds play a critical role in regulating galaxy formation, hydrodynamic cosmological simulations do not resolve the scales that govern the interaction between winds and the ambient circumgalactic medium (CGM). We implement the Physically Evolved Wind (PhEW) model of Huang et al. in the gizmo hydrodynamics code and perform test cosmological simulations with different choices of model parameters and numerical resolution. PhEW adopts an explicit subgrid model that treats each wind particle as a collection of clouds that exchange mass and metals with their surroundings and evaporate by conduction and hydrodynamic instabilities as calibrated on much higher resolution cloud scale simulations. In contrast to a conventional wind algorithm, we find that PhEW results are robust to numerical resolution and implementation details because the small scale interactions are defined by the model itself. Compared to our previous wind simulations with the same resolution, our PhEW simulations are in better agreement with low-redshift galactic stellar mass functions at M* < 1011M⊙ because PhEW particles shed mass to the CGM before escaping low mass haloes. PhEW radically alters the CGM metal distribution because PhEW particles disperse metals to the ambient medium as their clouds dissipate, producing a CGM metallicity distribution that is skewed but unimodal and is similar between cold and hot gas. While the temperature distributions and radial profiles of gaseous haloes are similar in simulations with PhEW and conventional winds, these changes in metal distribution will affect their predicted UV/X-ray properties in absorption and emission.

     
    more » « less