skip to main content

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Thursday, February 13 until 2:00 AM ET on Friday, February 14 due to maintenance. We apologize for the inconvenience.


Title: Revealing the Galaxy–Halo Connection through Machine Learning
Abstract

Understanding the connections between galaxy stellar mass, star formation rate, and dark matter halo mass represents a key goal of the theory of galaxy formation. Cosmological simulations that include hydrodynamics, physical treatments of star formation, feedback from supernovae, and the radiative transfer of ionizing photons can capture the processes relevant for establishing these connections. The complexity of these physics can prove difficult to disentangle and obfuscate how mass-dependent trends in the galaxy population originate. Here, we train a machine-learning method called Explainable Boosting Machines (EBMs) to infer how the stellar mass and star formation rate of nearly 6 million galaxies simulated by the Cosmic Reionization on Computers project depend on the physical properties of halo mass, the peak circular velocity of the galaxy during its formation historyvpeak, cosmic environment, and redshift. The resulting EBM models reveal the relative importance of these properties in setting galaxy stellar mass and star formation rate, withvpeakproviding the most dominant contribution. Environmental properties provide substantial improvements for modeling the stellar mass and star formation rate in only ≲10% of the simulated galaxies. We also provide alternative formulations of EBM models that enable low-resolution simulations, which cannot track the interior structure of dark matter halos, to predict the stellar mass and star formation rate of galaxies computed by high-resolution simulations with detailed baryonic physics.

 
more » « less
PAR ID:
10401826
Author(s) / Creator(s):
; ; ; ; ; ; ;
Publisher / Repository:
DOI PREFIX: 10.3847
Date Published:
Journal Name:
The Astrophysical Journal
Volume:
945
Issue:
2
ISSN:
0004-637X
Format(s):
Medium: X Size: Article No. 122
Size(s):
Article No. 122
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT We introduce a suite of cosmological volume simulations to study the evolution of galaxies as part of the Feedback in Realistic Environments project. FIREbox, the principal simulation of the present suite, provides a representative sample of galaxies (∼1000 galaxies with $M_{\rm star}\gt 10^8\, M_\odot$ at z  = 0) at a resolution ($\Delta {}x\sim {}20\, {\rm pc}$ , $m_{\rm b}\sim {}6\times {}10^4\, M_\odot$ ) comparable to state-of-the-art galaxy zoom-in simulations. FIREbox captures the multiphase nature of the interstellar medium in a fully cosmological setting (L = 22.1 Mpc) thanks to its exceptionally high dynamic range (≳106) and the inclusion of multichannel stellar feedback. Here, we focus on validating the simulation predictions by comparing to observational data. We find that star formation rates, gas masses, and metallicities of simulated galaxies with $M_{\rm star}\lt 10^{10.5-11}\, M_\odot$ broadly agree with observations. These galaxy scaling relations extend to low masses ($M_{\rm star}\sim {}10^7\, M_\odot$ ) and follow a (broken) power-law relationship. Also reproduced are the evolution of the cosmic HI density and the HI column density distribution at z ∼ 0–5. At low z , FIREbox predicts a peak in the stellar-mass–halo-mass relation but also a higher abundance of massive galaxies and a higher cosmic star formation rate density than observed, showing that stellar feedback alone is insufficient to reproduce the properties of massive galaxies at late times. Given its high resolution and sample size, FIREbox offers a baseline prediction of galaxy formation theory in a ΛCDM Universe while also highlighting modelling challenges to be addressed in next-generation galaxy simulations. 
    more » « less
  2. ABSTRACT

    Understanding what shapes the cold gas component of galaxies, which both provides the fuel for star formation and is strongly affected by the subsequent stellar feedback, is a crucial step towards a better understanding of galaxy evolution. Here, we analyse the H i properties of a sample of 46 Milky Way halo-mass galaxies, drawn from cosmological simulations (EMP-Pathfinder and Firebox). This set of simulations comprises galaxies evolved self-consistently across cosmic time with different baryonic sub-grid physics: three different star formation models [constant star formation efficiency (SFE) with different star formation eligibility criteria, and an environmentally dependent, turbulence-based SFE] and two different feedback prescriptions, where only one sub-sample includes early stellar feedback. We use these simulations to assess the impact of different baryonic physics on the H i content of galaxies. We find that the galaxy-wide H i properties agree with each other and with observations. However, differences appear for small-scale properties. The thin H i discs observed in the local universe are only reproduced with a turbulence-dependent SFE and/or early stellar feedback. Furthermore, we find that the morphology of H i discs is particularly sensitive to the different physics models: galaxies simulated with a turbulence-based SFE have discs that are smoother and more rotationally symmetric, compared to those simulated with a constant SFE; galaxies simulated with early stellar feedback have more regular discs than supernova-feedback-only galaxies. We find that the rotational asymmetry of the H i discs depends most strongly on the underlying physics model, making this a promising observable for understanding the physics responsible for shaping the interstellar medium of galaxies.

     
    more » « less
  3. Abstract We predict the stellar mass–halo mass (SMHM) relationship for dwarf galaxies, using simulated galaxies with peak halo masses of M peak = 10 11 M ⊙ down into the ultra-faint dwarf range to M peak = 10 7 M ⊙ . Our simulated dwarfs have stellar masses of M star = 790 M ⊙ to 8.2 × 10 8 M ⊙ , with corresponding V -band magnitudes from −2 to −18.5. For M peak > 10 10 M ⊙ , the simulated SMHM relationship agrees with literature determinations, including exhibiting a small scatter of 0.3 dex. However, the scatter in the SMHM relation increases for lower-mass halos. We first present results for well-resolved halos that contain a simulated stellar population, but recognize that whether a halo hosts a galaxy is inherently mass resolution dependent. We thus adopt a probabilistic model to populate “dark” halos below our resolution limit to predict an “intrinsic” slope and scatter for the SMHM relation. We fit linearly growing log-normal scatter in stellar mass, which grows to more than 1 dex at M peak = 10 8 M ⊙ . At the faintest end of the SMHM relation probed by our simulations, a galaxy cannot be assigned a unique halo mass based solely on its luminosity. Instead, we provide a formula to stochastically populate low-mass halos following our results. Finally, we show that our growing log-normal scatter steepens the faint-end slope of the predicted stellar mass function. 
    more » « less
  4. Abstract We describe a public data release of the FIRE-2 cosmological zoom-in simulations of galaxy formation (available at http://flathub.flatironinstitute.org/fire ) from the Feedback In Realistic Environments (FIRE) project. FIRE-2 simulations achieve parsec-scale resolution to explicitly model the multiphase interstellar medium while implementing direct models for stellar evolution and feedback, including stellar winds, core-collapse and Type Ia supernovae, radiation pressure, photoionization, and photoelectric heating. We release complete snapshots from three suites of simulations. The first comprises 20 simulations that zoom in on 14 Milky Way (MW)–mass galaxies, five SMC/LMC-mass galaxies, and four lower-mass galaxies including one ultrafaint; we release 39 snapshots across z = 0–10. The second comprises four massive galaxies, with 19 snapshots across z = 1–10. Finally, a high-redshift suite comprises 22 simulations, with 11 snapshots across z = 5–10. Each simulation also includes dozens of resolved lower-mass (satellite) galaxies in its zoom-in region. Snapshots include all stored properties for all dark matter, gas, and star particles, including 11 elemental abundances for stars and gas, and formation times (ages) of star particles. We also release accompanying (sub)halo catalogs, which include galaxy properties and member star particles. For the simulations to z = 0, including all MW-mass galaxies, we release the formation coordinates and an “ex situ” flag for all star particles, pointers to track particles across snapshots, catalogs of stellar streams, and multipole basis expansions for the halo mass distributions. We describe publicly available python packages for reading and analyzing these simulations. 
    more » « less
  5. Abstract

    Observations of gravitational waves from binary black hole (BBH) mergers have measured the redshift evolution of the BBH merger rate. The number density of galaxies in the Universe evolves differently with redshift based on their physical properties, such as their stellar masses and star formation rates. In this work we show that the measured population-level redshift distribution of BBHs sheds light on the properties of their probable host galaxies. We first assume that the hosts of BBHs can be described by a mixture model of galaxies weighted by stellar mass or star formation rate, and find that we can place upper limits on the fraction of mergers coming from a stellar-mass-weighted sample of galaxies. We then constrain the parameters of a physically motivated power-law delay-time distribution using GWTC-3 data, and self-consistently track galaxies in theUniverseMachinesimulations with this delay-time model to infer the probable host galaxies of BBHs over a range of redshifts. We find that the inferred host galaxy distribution at redshiftz= 0.21 has a median star formation rate ∼ 0.9Myr−1and a median stellar mass of ∼1.9 × 1010M. We also provide distributions for the mean stellar age, halo mass, halo radius, peculiar velocity, and large-scale bias associated with the host galaxies, as well as their absolute magnitudes in theBandKsbands. Our results can be used to design optimal electromagnetic follow-up strategies for BBHs, and also to aid the measurement of cosmological parameters using the statistical dark-siren method.

     
    more » « less