The mass assembly history (MAH) of dark matter haloes plays a crucial role in shaping the formation and evolution of galaxies. MAHs are used extensively in semi-analytic and empirical models of galaxy formation, yet current analytic methods to generate them are inaccurate and unable to capture their relationship with the halo internal structure and large-scale environment. This paper introduces florah (FLOw-based Recurrent model for Assembly Histories), a machine-learning framework for generating assembly histories of ensembles of dark matter haloes. We train florah on the assembly histories from the Gadget at Ultra-high Redshift with Extra Fine Time-steps and vsmdplN-body simulations and demonstrate its ability to recover key properties such as the time evolution of mass and concentration. We obtain similar results for the galaxy stellar mass versus halo mass relation and its residuals when we run the Santa Cruz semi-analytic model on florah-generated assembly histories and halo formation histories extracted from an N-body simulation. We further show that florah also reproduces the dependence of clustering on properties other than mass (assembly bias), which is not captured by other analytic methods. By combining multiple networks trained on a suite of simulations with different redshift ranges and mass resolutions, we are able to construct accurate main progenitor branches with a wide dynamic mass range from $z=0$ up to an ultra-high redshift $z \approx 20$, currently far beyond that of a single N-body simulation. florah is the first step towards a machine learning-based framework for planting full merger trees; this will enable the exploration of different galaxy formation scenarios with great computational efficiency at unprecedented accuracy.
We build a deep learning framework that connects the local formation process of dark matter haloes to the halo bias. We train a convolutional neural network (CNN) to predict the final mass and concentration of dark matter haloes from the initial conditions. The CNN is then used as a surrogate model to derive the response of the haloes’ mass and concentration to long-wavelength perturbations in the initial conditions, and consequently the halo bias parameters following the ‘response bias’ definition. The CNN correctly predicts how the local properties of dark matter haloes respond to changes in the large-scale environment, despite no explicit knowledge of halo bias being provided during training. We show that the CNN recovers the known trends for the linear and second-order density bias parameters b1 and b2, as well as for the local primordial non-Gaussianity linear bias parameter bϕ. The expected secondary assembly bias dependence on halo concentration is also recovered by the CNN: at fixed mass, halo concentration has only a mild impact on b1, but a strong impact on bϕ. Our framework opens a new window for discovering which physical aspects of the halo’s Lagrangian patch determine assembly bias, which in turn can inform physical models of halo formation and bias.
more » « less- PAR ID:
- 10431337
- Publisher / Repository:
- Oxford University Press
- Date Published:
- Journal Name:
- Monthly Notices of the Royal Astronomical Society
- Volume:
- 524
- Issue:
- 2
- ISSN:
- 0035-8711
- Page Range / eLocation ID:
- p. 1746-1756
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
ABSTRACT -
ABSTRACT Approximate methods to populate dark-matter haloes with galaxies are of great utility to galaxy surveys. However, the limitations of simple halo occupation models (HODs) preclude a full use of small-scale galaxy clustering data and call for more sophisticated models. We study two galaxy populations, luminous red galaxies (LRGs) and star-forming emission-line galaxies (ELGs), at two epochs, z = 1 and z = 0, in the large-volume, high-resolution hydrodynamical simulation of the MillenniumTNG project. In a partner study we concentrated on the small-scale, one-halo regime down to r ∼ 0.1 h−1 Mpc, while here we focus on modelling galaxy assembly bias in the two-halo regime, r ≳ 1 h−1 Mpc. Interestingly, the ELG signal exhibits scale dependence out to relatively large scales (r ∼ 20 h−1 Mpc), implying that the linear bias approximation for this tracer is invalid on these scales, contrary to common assumptions. The 10–15 per cent discrepancy is only reconciled when we augment our halo occupation model with a dependence on extrinsic halo properties (‘shear’ being the best-performing one) rather than intrinsic ones (e.g. concentration, peak mass). We argue that this fact constitutes evidence for two-halo galaxy conformity. Including tertiary assembly bias (i.e. a property beyond mass and ‘shear’) is not an essential requirement for reconciling the galaxy assembly bias signal of LRGs, but the combination of external and internal properties is beneficial for recovering ELG the clustering. We find that centrals in low-mass haloes dominate the assembly bias signal of both populations. Finally, we explore the predictions of our model for higher order statistics such as nearest neighbour counts. The latter supplies additional information about galaxy assembly bias and can be used to break degeneracies between halo model parameters.
-
ABSTRACT Secondary halo properties beyond mass, such as the mass accretion rate (MAR), concentration, and the half mass scale, are essential in understanding the formation of large-scale structure and dark matter haloes. In this paper, we study the impact of secondary halo properties on the galaxy-galaxy lensing observable, ΔΣ. We build an emulator trained on N-body simulations to model ΔΣ and quantify the impact of different secondary parameters on the ΔΣ profile. We focus on the impact of MAR on ΔΣ. We show that a 3σ detection of variations in MAR at fixed halo mass could be achieved with the Hyper Suprime Cam survey assuming no baryonic effects and a proxy for MAR with scatter <1.5. We show that the full radial profile of ΔΣ depends on secondary properties at fixed halo mass. Consequently, an emulator that can perform full shape fitting yields better than two times improvement upon the constraints on MAR than only using the outer part of the halo. Finally, we highlight that miscentring and MAR impact the radial profile of ΔΣ in a similar fashion, implying that miscentring and MAR need to be modelled jointly for unbiased estimates of both effects. We show that present-day lensing data sets have the statistical capability to place constraints on halo MAR within our assumptions. Our analysis opens up new possibilities for observationally measuring the assembly history of the dark matter haloes that host galaxies and clusters.
-
null (Ed.)ABSTRACT Halo models provide a simple and computationally inexpensive way to investigate the connection between galaxies and their dark matter haloes. However, these models rely on the assumption that the role of baryons can easily be parametrized in the modelling procedure. We aim to examine the ability of halo occupation distribution (HOD) modelling to reproduce the galaxy clustering found in two different hydrodynamic simulations, Illustris and EAGLE. For each simulation, we measure several galaxy clustering statistics on two different luminosity threshold samples. We then apply a simple five parameter HOD, which was fit to each simulation separately, to the corresponding dark matter-only simulations, and measure the same clustering statistics. We find that the halo mass function is shifted to lower masses in the hydrodynamic simulations, resulting in a galaxy number density that is too high when an HOD is applied to the dark matter-only simulation. However, the exact way in which baryons alter the mass function is remarkably different in the two simulations. After applying a correction to the halo mass function in each simulation, the HOD is able to accurately reproduce all clustering statistics for the high luminosity sample of galaxies. For the low luminosity sample, we find evidence that in addition to correcting the halo mass function, including spatial, velocity, and assembly bias parameters in the HOD is necessary to accurately reproduce clustering statistics.more » « less
-
ABSTRACT We introduce a new model of the evolution of the concentration of dark matter haloes, c(t). For individual haloes, our model approximates c(t) as a power law with a time-dependent index, such that at early times, concentration has a nearly constant value of c ≈ 3–4, and as cosmic time progresses, c(t) smoothly increases. Using large samples of halo merger trees taken from the Bolshoi–Planck and MultiDark Planck 2 cosmological simulations, we demonstrate that our three-parameter model can approximate the evolution of the concentration of individual haloes with a typical accuracy of 0.1 dex for $t\gtrsim 2\, {\rm Gyr}$ for all Bolshoi–Planck and MultiDark Planck 2 haloes of present-day peak mass $M_{0}\gtrsim 10^{11.5}\, {\rm M}_{\odot }$. We additionally present a new model of the evolution of the concentration of halo populations, which we show faithfully reproduces both average concentration growth and the diversity of smooth trajectories of c(t), including capturing correlations with halo mass and halo assembly history. Our publicly available source code, diffprof, can be used to generate Monte Carlo realizations of the concentration histories of cosmologically representative halo populations. diffprof is differentiable due to its implementation in the jax autodiff library, which facilitates the incorporation of our model into existing analytical halo model frameworks.