skip to main content


Title: The GIGANTES Data Set: Precision Cosmology from Voids in the Machine-learning Era
Abstract

We presentGIGANTES, the most extensive and realistic void catalog suite ever released—containing over 1 billion cosmic voids covering a volume larger than the observable universe, more than 20 TB of data, and created by running the void finderVIDEonQUIJOTE’s halo simulations. TheGIGANTESsuite, spanning thousands of cosmological models, opens up the study of voids, answering compelling questions: Do voids carry unique cosmological information? How is this information correlated with galaxy information? Leveraging the large number of voids in theGIGANTESsuite, our Fisher constraints demonstrate voids contain additional information, critically tightening constraints on cosmological parameters. We use traditional void summary statistics (void size function, void density profile) and the void autocorrelation function, which independently yields an error of 0.13 eV on ∑mνfor a 1h−3Gpc3simulation, without cosmic microwave background priors. Combining halos and voids we forecast an error of 0.09 eV from the same volume, representing a gain of 60% compared to halos alone. Extrapolating to next generation multi-Gpc3surveys such as the Dark Energy Spectroscopic Instrument, Euclid, the Spectro-Photometer for the History of the Universe and Ices Explorer, and the Roman Space Telescope, we expect voids should yield an independent determination of neutrino mass. Crucially,GIGANTESis the first void catalog suite expressly built for intensive machine-learning exploration. We illustrate this by training a neural network to perform likelihood-free inference on the void size function, giving a ∼20% constraint on Ωm. Cosmology problems provide an impetus to develop novel deep-learning techniques. WithGIGANTES, machine learning gains an impressive data set, offering unique problems that will stimulate new techniques.

 
more » « less
NSF-PAR ID:
10369721
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
DOI PREFIX: 10.3847
Date Published:
Journal Name:
The Astrophysical Journal
Volume:
935
Issue:
2
ISSN:
0004-637X
Format(s):
Medium: X Size: Article No. 100
Size(s):
["Article No. 100"]
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    We investigate the properties of voids and void galaxies in theTNG300simulation. Using a luminous galaxy catalog and a spherical void-finding algorithm, we identify 5078 voids at redshiftz= 0. The voids cover 83% of the simulation volume and have a median radius of 4.4h−1Mpc. We identify two populations of field galaxies based on whether the galaxies reside within a void (“void galaxies”; 75,220 objects) or outside a void (“nonvoid galaxies”; 527,454 objects). Within the voids, mass does not directly trace light. Instead, the mean radial underdensity profile as defined by the locations of void galaxies is systematically lower than the mean radial underdensity profile as defined by the dark matter (i.e., the voids are more “devoid” of galaxies than they are of mass). Within the voids, the integrated underdensity profiles of the dark matter and the galaxies are independent of the local background density (i.e., voids-in-voids versus voids-in-clouds). Beyond the void radii, however, the integrated underdensity profiles of both the dark matter and the galaxies exhibit strong dependencies on the local background density. Compared to nonvoid galaxies, void galaxies are on average younger, less massive, bluer in color, less metal enriched, and have smaller radii. In addition, the specific star formation rates of void galaxies are ∼20% higher than nonvoid galaxies and, in the case of galaxies with central supermassive black holes withMBH≳ 3 × 106h−1M, the fraction of active void galaxies is ∼25% higher than active nonvoid galaxies.

     
    more » « less
  2. Abstract

    As the next generation of large galaxy surveys come online, it is becoming increasingly important to develop and understand the machine-learning tools that analyze big astronomical data. Neural networks are powerful and capable of probing deep patterns in data, but they must be trained carefully on large and representative data sets. We present a new “hump” of the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project: CAMELS-SAM, encompassing one thousand dark-matter-only simulations of (100h−1cMpc)3with different cosmological parameters (Ωmandσ8) and run through the Santa Cruz semi-analytic model for galaxy formation over a broad range of astrophysical parameters. As a proof of concept for the power of this vast suite of simulated galaxies in a large volume and broad parameter space, we probe the power of simple clustering summary statistics to marginalize over astrophysics and constrain cosmology using neural networks. We use the two-point correlation, count-in-cells, and void probability functions, and we probe nonlinear and linear scales across 0.68 <R<27h−1cMpc. We find our neural networks can both marginalize over the uncertainties in astrophysics to constrain cosmology to 3%–8% error across various types of galaxy selections, while simultaneously learning about the SC-SAM astrophysical parameters. This work encompasses vital first steps toward creating algorithms able to marginalize over the uncertainties in our galaxy formation models and measure the underlying cosmology of our Universe. CAMELS-SAM has been publicly released alongside the rest of CAMELS, and it offers great potential to many applications of machine learning in astrophysics:https://camels-sam.readthedocs.io.

     
    more » « less
  3. Abstract

    Quantifying the connection between galaxies and their host dark matter halos has been key for testing cosmological models on various scales. BelowM∼ 109M, such studies have primarily relied on the satellite galaxy population orbiting the Milky Way (MW). Here we present new constraints on the connection between satellite galaxies and their host dark matter subhalos using the largest sample of satellite galaxies in the Local Volume (D≲ 12 Mpc) to date. We use 250 confirmed and 71 candidate dwarf satellites around 27 MW-like hosts from the Exploration of Local VolumE Satellites (ELVES) Survey and use the semianalyticalSatGenmodel for predicting the population of dark matter subhalos expected in the same volume. Through a Bayesian model comparison of the observed and the forward-modeled satellite stellar mass functions (SSMFs), we infer the satellite stellar-to-halo mass relation. We find that the observed SSMF is best reproduced when subhalos at the low-mass end are populated by a relation of the formMMpeakα, with a moderate slope ofαconst=2.10±0.01and a low scatter, constant as a function of the peak halo mass, ofσconst=0.060.05+0.07. A model with a steeper slope (αgrow= 2.39 ± 0.06) and a scatter that grows with decreasingMpeakis also consistent with the observed SSMF but is not required. Our new model for the satellite–subhalo connection, based on hundreds of Local Volume satellite galaxies, is in line with what was previously derived using only MW satellites.

     
    more » « less
  4. Abstract

    The formation of globular clusters and their relation to the distribution of dark matter have long puzzled astronomers. One of the most recently proposed globular cluster formation channels ties ancient star clusters to the large-scale streaming velocity of baryons relative to dark matter in the early universe. These streaming velocities affect the global infall of baryons into dark matter halos, the high-redshift halo mass function, and the earliest generations of stars. In some cases, streaming velocities may result in dense regions of dark matter-free gas that becomes Jeans unstable, potentially leading to the formation of compact star clusters. We investigate this hypothesis using cosmological hydrodynamical simulations that include a full chemical network and the formation and destruction of H2, a process crucial for the formation of the first stars. We find that high-density gas in regions with significant streaming velocities is indeed somewhat offset from the centers of dark matter halos, but this offset is typically significantly smaller than the virial radius. Gas outside of dark matter halos never reaches Jeans-unstable densities in our simulations. We postulate that low-level (Z≈ 10−3Z) metal enrichment by Population III supernovae may enable cooling in the extra-virial regions, allowing gas outside of dark matter halos to cool to the cosmic microwave background temperature and become Jeans unstable. Follow-up simulations that include both streaming velocities and metal enrichment by Population III supernovae are needed to understand if streaming velocities provide one path for the formation of globular clusters in the early universe.

     
    more » « less
  5. null (Ed.)
    ABSTRACT A number of independent observations suggest that the intergalactic medium was significantly neutral at z = 7 and that reionization was, perhaps, still in progress at z = 5.7. The narrowband survey, SILVERRUSH, has mapped over 2000 Lyman-α emitters (LAEs) at these redshifts ( G58). Previous analyses have assumed that reionization was over by z = 5.7, but this data may actually sample the final stages of reionization when the last neutral islands were relegated to the cosmic voids. Motivated by these developments, we re-examine LAE void and peak statistics and their ability to constrain reionization. We construct models of the LAE distribution in (1 Gpc h−1)3 volumes, spanning a range of neutral fractions at z = 5.7 and 6.6. Models with a higher neutral fraction show an enhanced probability of finding holes in the LAE distribution. When comparing models at fixed mean surface density, however, LAEs obscured by neutral gas in the voids must be compensated by visible LAEs elsewhere. Hence, in these models, the likelihood of finding an overdense peak is also enhanced in the latter half of reionization. Compared to the widely used angular two-point correlation function (2PCF), we find that the void probability function (VPF) provides a more sensitive test of models during the latter half of reionization. By comparison, at neutral fractions $\sim 50{{\ \rm per\ cent}}$, the VPF and a simple peak thresholding statistic are both similar to the 2PCF in constraining power. Lastly, we find that the cosmic variance and large-scale asymmetries observed in the SILVERRUSH fields are consistent with large-scale structure in a ΛCDM universe. 
    more » « less