skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A Universal Equation to Predict Ω m from Halo and Galaxy Catalogs
Abstract We discover analytic equations that can infer the value of Ωmfrom the positions and velocity moduli of halo and galaxy catalogs. The equations are derived by combining a tailored graph neural network (GNN) architecture with symbolic regression. We first train the GNN on dark matter halos from GadgetN-body simulations to perform field-level likelihood-free inference, and show that our model can infer Ωmwith ∼6% accuracy from halo catalogs of thousands ofN-body simulations run with six different codes: Abacus, CUBEP3M, Gadget, Enzo, PKDGrav3, and Ramses. By applying symbolic regression to the different parts comprising the GNN, we derive equations that can predict Ωmfrom halo catalogs of simulations run with all of the above codes with accuracies similar to those of the GNN. We show that, by tuning a single free parameter, our equations can also infer the value of Ωmfrom galaxy catalogs of thousands of state-of-the-art hydrodynamic simulations of the CAMELS project, each with a different astrophysics model, run with five distinct codes that employ different subgrid physics: IllustrisTNG, SIMBA, Astrid, Magneticum, SWIFT-EAGLE. Furthermore, the equations also perform well when tested on galaxy catalogs from simulations covering a vast region in parameter space that samples variations in 5 cosmological and 23 astrophysical parameters. We speculate that the equations may reflect the existence of a fundamental physics relation between the phase-space distribution of generic tracers and Ωm, one that is not affected by galaxy formation physics down to scales as small as 10h−1kpc.  more » « less
Award ID(s):
2009309
PAR ID:
10627206
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
American Astronomical Society
Date Published:
Journal Name:
The Astrophysical Journal
Volume:
956
Issue:
2
ISSN:
0004-637X
Page Range / eLocation ID:
149
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract We train graph neural networks on halo catalogs from Gadget N -body simulations to perform field-level likelihood-free inference of cosmological parameters. The catalogs contain ≲5000 halos with masses ≳10 10 h −1 M ⊙ in a periodic volume of ( 25 h − 1 Mpc ) 3 ; every halo in the catalog is characterized by several properties such as position, mass, velocity, concentration, and maximum circular velocity. Our models, built to be permutationally, translationally, and rotationally invariant, do not impose a minimum scale on which to extract information and are able to infer the values of Ω m and σ 8 with a mean relative error of ∼6%, when using positions plus velocities and positions plus masses, respectively. More importantly, we find that our models are very robust: they can infer the value of Ω m and σ 8 when tested using halo catalogs from thousands of N -body simulations run with five different N -body codes: Abacus, CUBEP 3 M, Enzo, PKDGrav3, and Ramses. Surprisingly, the model trained to infer Ω m also works when tested on thousands of state-of-the-art CAMELS hydrodynamic simulations run with four different codes and subgrid physics implementations. Using halo properties such as concentration and maximum circular velocity allow our models to extract more information, at the expense of breaking the robustness of the models. This may happen because the different N -body codes are not converged on the relevant scales corresponding to these parameters. 
    more » « less
  2. Abstract We train graph neural networks to perform field-level likelihood-free inference using galaxy catalogs from state-of-the-art hydrodynamic simulations of the CAMELS project. Our models are rotational, translational, and permutation invariant and do not impose any cut on scale. From galaxy catalogs that only contain 3D positions and radial velocities of ∼1000 galaxies in tiny ( 25 h − 1 Mpc ) 3 volumes our models can infer the value of Ω m with approximately 12% precision. More importantly, by testing the models on galaxy catalogs from thousands of hydrodynamic simulations, each having a different efficiency of supernova and active galactic nucleus feedback, run with five different codes and subgrid models—IllustrisTNG, SIMBA, Astrid, Magneticum, SWIFT-EAGLE—we find that our models are robust to changes in astrophysics, subgrid physics, and subhalo/galaxy finder. Furthermore, we test our models on 1024 simulations that cover a vast region in parameter space—variations in five cosmological and 23 astrophysical parameters—finding that the model extrapolates really well. Our results indicate that the key to building a robust model is the use of both galaxy positions and velocities, suggesting that the network has likely learned an underlying physical relation that does not depend on galaxy formation and is valid on scales larger than ∼10 h −1 kpc. 
    more » « less
  3. Abstract Cosmological simulations like CAMELS and IllustrisTNG characterize hundreds of thousands of galaxies using various internal properties. Previous studies have demonstrated that machine learning can be used to infer the cosmological parameter Ωmfrom the internal properties of even a single randomly selected simulated galaxy. This ability was hypothesized to originate from galaxies occupying a low-dimensional manifold within a higher-dimensional galaxy property space, which shifts with variations in Ωm. In this work, we investigate how galaxies occupy the high-dimensional galaxy property space, particularly the effect of Ωmand other cosmological and astrophysical parameters on the putative manifold. We achieve this by using an autoencoder with an information-ordered bottleneck, a neural layer with adaptive compression, to perform dimensionality reduction on individual galaxy properties from CAMELS simulations, which are run with various combinations of cosmological and astrophysical parameters. We find that for an autoencoder trained on the fiducial set of parameters, the reconstruction error increases significantly when the test set deviates from fiducial values of ΩmandASN1, indicating that these parameters shift galaxies off the fiducial manifold. In contrast, variations in other parameters such asσ8cause negligible error changes, suggesting galaxies shift along the manifold. These findings provide direct evidence that the ability to infer Ωmfrom individual galaxies is tied to the way Ωmshifts the manifold. Physically, this implies that parameters likeσ8produce galaxy property changes resembling natural scatter, while parameters like ΩmandASN1create unsampled properties, extending beyond the natural scatter in the fiducial model. 
    more » « less
  4. Abstract We present a four-step group-finding algorithm for the Gas in Galaxy Groups (G3) initiative, a spin-off of thez∼ 0 REsolved Spectroscopy Of a Local VolumE (RESOLVE) and Environmental COntext (ECO) surveys. In preparation for future comparisons to intermediate redshift (e.g., the LADUMA survey), we design the group finder to adapt to incomplete, shallow, or nonuniform data. We use mock catalogs to optimize the group finder’s performance. Compared to friends-of-friends (with false-pair splitting), the G3 algorithm offers improved completeness and halo-mass recovery with minimal loss of purity. Combining it with the volume-limited Hicensus data for RESOLVE and ECO, we examine the Hicontent of galaxy groups as a function of group halo mass. Group-integrated HimassMH I,grprises monotonically over halo massesMhalo∼ 1011–1014.5M, pivoting in slope atMhalo∼ 1011.4M, the gas-richness threshold scale. We present the first measurement of the scatter in this relation, which has a median of ∼0.3 dex and is asymmetric toward lowerMH I,grp. We discuss interesting tensions with theoretical predictions and prior measurements of theMH I,grp–Mhalorelation. In an appendix, we release RESOLVE DR4 and ECO DR3, including updates to survey redshifts, photometry, and group catalogs, as well as a major expansion of the ECO Hiinventory with value-added data products. 
    more » « less
  5. Abstract The connection between galaxies and dark matter halos is often quantified using the stellar mass–halo mass (SMHM) relation. Optical and near-infrared imaging surveys have led to a broadly consistent picture of the evolving SMHM relation based on measurements of galaxy abundances and angular correlation functions. Spectroscopic surveys atz≳ 2 can also constrain the SMHM relation via the galaxy autocorrelation function and through the cross-correlation between galaxies and Lyαabsorption measured in transverse sight lines; however, such studies are very few and have produced some unexpected or inconclusive results. We use ∼3000 spectra ofz∼ 2.5 galaxies from the LyαTomography IMACS Survey (LATIS) to measure the galaxy–galaxy and galaxy–Lyαcorrelation functions in four bins of stellar mass spanning 109.2≲M*/M≲ 1010.5. Parallel analyses of the MultiDarkN-body and ASTRID hydrodynamic cosmological simulations allow us to model the correlation functions, estimate covariance matrices, and infer halo masses. We find that results of the two methods are mutually consistent and broadly accord with standard SMHM relations. This consistency demonstrates that we are able to measure and model Lyαtransmission fluctuationsδFin LATIS accurately. We also show that the galaxy–Lyαcross-correlation, a free by-product of optical spectroscopic galaxy surveys at these redshifts, can constrain halo masses with similar precision to galaxy–galaxy clustering. 
    more » « less