skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: An observationally driven multifield approach for probing the circum-galactic medium with convolutional neural networks
ABSTRACT The circum-galactic medium (CGM) can feasibly be mapped by multiwavelength surveys covering broad swaths of the sky. With multiple large data sets becoming available in the near future, we develop a likelihood-free Deep Learning technique using convolutional neural networks (CNNs) to infer broad-scale physical properties of a galaxy’s CGM and its halo mass for the first time. Using CAMELS (Cosmology and Astrophysics with MachinE Learning Simulations) data, including IllustrisTNG, SIMBA, and Astrid models, we train CNNs on Soft X-ray and 21-cm (H i) radio two-dimensional maps to trace hot and cool gas, respectively, around galaxies, groups, and clusters. Our CNNs offer the unique ability to train and test on ‘multifield’ data sets comprised of both H i and X-ray maps, providing complementary information about physical CGM properties and improved inferences. Applying eRASS:4 survey limits shows that X-ray is not powerful enough to infer individual haloes with masses log (Mhalo/M⊙) < 12.5. The multifield improves the inference for all halo masses. Generally, the CNN trained and tested on Astrid (SIMBA) can most (least) accurately infer CGM properties. Cross-simulation analysis – training on one galaxy formation model and testing on another – highlights the challenges of developing CNNs trained on a single model to marginalize over astrophysical uncertainties and perform robust inferences on real data. The next crucial step in improving the resulting inferences on the physical properties of CGM depends on our ability to interpret these deep-learning models.  more » « less
Award ID(s):
2206055 2108944
PAR ID:
10483084
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Monthly Notices of the Royal Astronomical Society
Volume:
527
Issue:
4
ISSN:
0035-8711
Format(s):
Medium: X Size: p. 10038-10058
Size(s):
p. 10038-10058
Sponsoring Org:
National Science Foundation
More Like this
  1. The circum-galactic medium (CGM) can feasibly be mapped by multiwavelength surveys covering broad swaths of the sky. With multiple large data sets becoming available in the near future, we develop a likelihood-free Deep Learning technique using convolutional neural networks (CNNs) to infer broad-scale physical properties of a galaxy’s CGM and its halo mass for the first time. Using CAMELS (Cosmology and Astrophysics with MachinE Learning Simulations) data, including IllustrisTNG, SIMBA, and Astrid models, we train CNNs on Soft X-ray and 21-cm (H I ) radio two-dimensional maps to trace hot and cool gas, respectively, around galaxies, groups, and clusters. Our CNNs offer the unique ability to train and test on ‘multifield’ data sets comprised of both H I and X-ray maps, providing complementary information about physical CGM properties and impro v ed inferences. Applying eRASS:4 surv e y limits shows that X-ray is not powerful enough to infer individual haloes with masses log ( M halo /M  ) < 12.5. The multifield impro v es the inference for all halo masses. Generally, the CNN trained and tested on Astrid (SIMBA) can most (least) accurately infer CGM properties. Cross-simulation analysis –training on one galaxy formation model and testing on another –highlights the challenges of developing CNNs trained on a single model to marginalize over astrophysical uncertainties and perform robust inferences on real data. The next crucial step in improving the resulting inferences on the physical properties of CGM depends on our ability to interpret these deep-learning models. 
    more » « less
  2. The circumgalactic medium (CGM) around massive galaxies plays a crucial role in regulating star formation and feedback. Using the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) suite, we develop emulators for the X-ray surface brightness profile and the X-ray luminosity–stellar mass scaling relation, to investigate how stellar and active galactic nucleus (AGN) feedback shape the X-ray properties of the hot CGM. Our analysis shows that at CGM scales (1012 Mhalo/Me  1013, 10 r kpc−1  400), stellar feedback more significantly impacts the X-ray properties than AGN feedback within the parameters studied. Comparing the emulators to recent eROSITA All Sky Survey (eRASS) observations, it is found that stronger feedback than is currently implemented in the IllustrisTNG, SIMBA, and Astrid simulations is required to match the observed CGM properties. However, adopting these enhanced feedback parameters causes deviations in the stellar mass–halo mass relations from observational constraints below the group-mass scale. This tension suggests possible unaccounted for systematics in X-ray CGM observations or inadequacies in the feedback models of cosmological simulations. 
    more » « less
  3. Abstract The circumgalactic medium (CGM) around massive galaxies plays a crucial role in regulating star formation and feedback. Using the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) suite, we develop emulators for the X-ray surface brightness profile and the X-ray luminosity–stellar mass scaling relation, to investigate how stellar and active galactic nucleus (AGN) feedback shape the X-ray properties of the hot CGM. Our analysis shows that at CGM scales (1012≲Mhalo/M≲ 1013, 10 ≲rkpc−1≲ 400), stellar feedback more significantly impacts the X-ray properties than AGN feedback within the parameters studied. Comparing the emulators to recent eROSITA All Sky Survey (eRASS) observations, it is found that stronger feedback than is currently implemented in the IllustrisTNG, SIMBA, and Astrid simulations is required to match the observed CGM properties. However, adopting these enhanced feedback parameters causes deviations in the stellar mass–halo mass relations from observational constraints below the group-mass scale. This tension suggests possible unaccounted-for systematics in X-ray CGM observations or inadequacies in the feedback models of cosmological simulations. 
    more » « less
  4. Abstract We present CAMELS-ASTRID, the third suite of hydrodynamical simulations in the Cosmology and Astrophysics with MachinE Learning (CAMELS) project, along with new simulation sets that extend the model parameter space based on the previous frameworks of CAMELS-TNG and CAMELS-SIMBA, to provide broader training sets and testing grounds for machine-learning algorithms designed for cosmological studies. CAMELS-ASTRID employs the galaxy formation model following the ASTRID simulation and contains 2124 hydrodynamic simulation runs that vary three cosmological parameters (Ωm8, Ωb) and four parameters controlling stellar and active galactic nucleus (AGN) feedback. Compared to the existing TNG and SIMBA simulation suites in CAMELS, the fiducial model of ASTRID features the mildest AGN feedback and predicts the least baryonic effect on the matter power spectrum. The training set of ASTRID covers a broader variation in the galaxy populations and the baryonic impact on the matter power spectrum compared to its TNG and SIMBA counterparts, which can make machine-learning models trained on the ASTRID suite exhibit better extrapolation performance when tested on other hydrodynamic simulation sets. We also introduce extension simulation sets in CAMELS that widely explore 28 parameters in the TNG and SIMBA models, demonstrating the enormity of the overall galaxy formation model parameter space and the complex nonlinear interplay between cosmology and astrophysical processes. With the new simulation suites, we show that building robust machine-learning models favors training and testing on the largest possible diversity of galaxy formation models. We also demonstrate that it is possible to train accurate neural networks to infer cosmological parameters using the high-dimensional TNG-SB28 simulation set. 
    more » « less
  5. The baryonic physics shaping galaxy formation and evolution are complex, spanning a vast range of scales and making them challenging to model. Cosmological simulations rely on subgrid models that produce significantly different predictions. Understanding how models of stellar and active galactic nucleus (AGN) feedback affect baryon behavior across different halo masses and redshifts is essential. Using the SIMBA and IllustrisTNG suites from the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project, we explore the effect of parameters governing the subgrid implementation of stellar and AGN feedback. We find that while IllustrisTNG shows higher cumulative feedback energy across all halos, SIMBA demonstrates a greater spread of baryons, quantified by the closure radius and circumgalactic medium (CGM) gas fraction. This suggests that feedback in SIMBA couples more effectively to baryons and drives them more efficiently within the host halo. There is evidence that the different feedback modes are highly interrelated in these subgrid models. The parameters controlling the stellar feedback efficiency significantly impact AGN feedback, as seen in the suppression of black hole mass growth and delayed activation of AGN feedback to higher-mass halos with increasing stellar feedback efficiency in both simulations. Additionally, the AGN feedback efficiency parameters affect the CGM gas fraction at low halo masses in SIMBA, hinting at complex, nonlinear interactions between the AGN and supernova feedback modes. Overall, we demonstrate that stellar and AGN feedback are intimately interwoven, especially at low redshift, due to subgrid implementation, resulting in halo property effects that might initially seem counterintuitive. 
    more » « less