skip to main content


Title: Deep forest: Neural network reconstruction of the Lyman-α forest
ABSTRACT We explore the use of Deep Learning to infer physical quantities from the observable transmitted flux in the Ly α forest. We train a Neural Network using redshift z = 3 outputs from cosmological hydrodynamic simulations and mock data sets constructed from them. We evaluate how well the trained network is able to reconstruct the optical depth for Ly α forest absorption from noisy and often saturated transmitted flux data. The Neural Network outperforms an alternative reconstruction method involving log inversion and spline interpolation by approximately a factor of 2 in the optical depth root mean square error. We find no significant dependence in the improvement on input data signal to noise, although the gain is greatest in high optical depth regions. The Ly α forest optical depth studied here serves as a simple, one dimensional, example but the use of Deep Learning and simulations to approach the inverse problem in cosmology could be extended to other physical quantities and higher dimensional data.  more » « less
Award ID(s):
1909193 2020295
NSF-PAR ID:
10290272
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Monthly Notices of the Royal Astronomical Society
Volume:
506
Issue:
4
ISSN:
0035-8711
Page Range / eLocation ID:
5212 to 5222
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT

    We explore the use of deep learning to infer the temperature of the intergalactic medium from the transmitted flux in the high-redshift Ly α forest. We train neural networks on sets of simulated spectra from redshift z = 2–3 outputs of cosmological hydrodynamic simulations, including high-temperature regions added in post-processing to approximate bubbles heated by He ii reionization. We evaluate how well the trained networks are able to reconstruct the temperature from the effect of Doppler broadening in the simulated input Ly α forest absorption spectra. We find that for spectra with high resolution (10 $\, {\rm km}\, {\rm s}^{-1}$ pixel) and moderate signal-to-noise ratio (20–50), the neural network is able to reconstruct the intergalactic medium temperature smoothed on scales of $\sim 6 \, h^{-1}\, {\rm Mpc}$ quite well. Concentrating on discontinuities, we find that high-temperature regions of width $25 \, h^{-1}\, {\rm Mpc}$ and temperature $20\, 000$ K can be fairly easily detected and characterized. We show an example where multiple sightlines are combined to yield tomographic images of hot bubbles. Deep learning techniques may be useful in this way to help us understand the complex temperature structure of the intergalactic medium around the time of helium reionization.

     
    more » « less
  2. Abstract The density and temperature properties of the intergalactic medium (IGM) reflect the heating and ionization history during cosmological structure formation, and are primarily probed by the Ly α forest of neutral hydrogen absorption features in the observed spectra of background sources. We present the methodology and initial results from the Cholla IGM Photoheating Simulation (CHIPS) suite performed with the graphics process unit–accelerated Cholla code to study the IGM at high, uniform spatial resolution maintained over large volumes. In this first paper, we examine the IGM structure in CHIPS cosmological simulations that include IGM uniform photoheating and photoionization models where hydrogen reionization is completed early or by redshift z ∼ 6. Comparing with observations of the large- and small-scale Ly α transmitted flux power spectra P ( k ) at redshifts 2 ≲ z ≲ 5.5, the relative agreement of the models depends on scale, with the self-consistent Puchwein et al. IGM photoheating and photoionization model in good agreement with the flux P ( k ) at k ≳ 0.01 s km −1 at redshifts 2 ≲ z ≲ 3.5. On larger scales, the P ( k ) measurements increase in amplitude from z ∼ 4.6 to z ∼ 2.2, faster than the models, and lie in between the model predictions at 2.2 ≲ z ≲ 4.6 for k ≈ 0.002–0.01 s km −1 . We argue that the models could improve by changing the He ii photoheating rate associated with active galactic nuclei to reduce the IGM temperature at z ∼ 3. At higher redshifts, z ≳ 4.5, the observed flux P ( k ) amplitude increases at a rate intermediate between the models, and we argue that for models where hydrogen reionization is completed late ( z ∼ 5.5–6), resolving this disagreement will require inhomogeneous or “patchy” reionization. We then use an additional set of simulations to demonstrate that our results have numerically converged and are not strongly affected by varying cosmological parameters. 
    more » « less
  3. ABSTRACT We compare a sample of five high-resolution, high S/N  Ly α forest spectra of bright 6 < z < ∼6.5 QSOs aimed at spectrally resolving the last remaining transmission spikes at z > 5 with those obtained from mock absorption spectra from the Sherwoodand Sherwood–Relics simulation suites of hydrodynamical simulations of the intergalactic medium (IGM). We use a profile-fitting procedure for the inverted transmitted flux, 1 − F, similar to the widely used Voigt profile fitting of the transmitted flux F at lower redshifts, to characterize the transmission spikes that probe predominately underdense regions of the IGM. We are able to reproduce the width and height distributions of the transmission spikes, both with optically thin simulations of the post-reionization Universe using a homogeneous UV background and full radiative transfer simulations of a late reionization model. We find that the width of the fitted components of the simulated transmission spikes is very sensitive to the instantaneous temperature of the reionized IGM. The internal structures of the spikes are more prominent in low temperature models of the IGM. The width distribution of the observed transmission spikes, which require high spectral resolution (≤ 8  km s−1) to be resolved, is reproduced for optically thin simulations with a temperature at mean density of T0 = (11 000 ± 1600, 10 500 ± 2100, 12 000 ± 2200) K at z = (5.4, 5.6, 5.8). This is weakly dependent on the slope of the temperature-density relation, which is favoured to be moderately steeper than isothermal. In the inhomogeneous, late reionization, full radiative transfer simulations where islands of neutral hydrogen persist to z ∼ 5.3, the width distribution of the observed transmission spikes is consistent with the range of T0 caused by spatial fluctuations in the temperature–density relation. 
    more » « less
  4. Abstract

    We present a new investigation of the intergalactic medium near reionization using dark gaps in the Lyβforest. With its lower optical depth, Lyβoffers a potentially more sensitive probe to any remaining neutral gas compared to the commonly used Lyαline. We identify dark gaps in the Lyβforest using spectra of 42 QSOs atzem> 5.5, including new data from the XQR-30 VLT Large Programme. Approximately 40% of these QSO spectra exhibit dark gaps longer than 10h−1Mpc atz≃ 5.8. By comparing the results to predictions from simulations, we find that the data are broadly consistent both with models where fluctuations in the Lyαforest are caused solely by ionizing ultraviolet background fluctuations and with models that include large neutral hydrogen patches atz< 6 due to a late end to reionization. Of particular interest is a very long (L= 28h−1Mpc) and dark (τeff≳ 6) gap persisting down toz≃ 5.5 in the Lyβforest of thez= 5.85 QSO PSO J025−11. This gap may support late reionization models with a volume-weighted average neutral hydrogen fraction of 〈xH I〉 ≳ 5% byz= 5.6. Finally, we infer constraints on 〈xH I〉 over 5.5 ≲z≲ 6.0 based on the observed Lyβdark gap length distribution and a conservative relationship between gap length and neutral fraction derived from simulations. We find 〈xH I〉 ≤ 0.05, 0.17, and 0.29 atz≃ 5.55, 5.75, and 5.95, respectively. These constraints are consistent with models where reionization ends significantly later thanz= 6.

     
    more » « less
  5. ABSTRACT

    The full-shape correlations of the Lyman alpha (Ly α) forest contain a wealth of cosmological information through the Alcock–Paczyński effect. However, these measurements are challenging to model without robustly testing and verifying the theoretical framework used for analysing them. Here, we leverage the accuracy and volume of the N-body simulation suite AbacusSummit to generate high-resolution Ly α skewers and quasi-stellar object (QSO) catalogues. One of the main goals of our mocks is to aid in the full-shape Ly α analysis planned by the Dark Energy Spectroscopic Instrument (DESI) team. We provide optical depth skewers for six of the fiducial cosmology base-resolution simulations ($L_{\rm box} = 2\, h^{-1}\, {\rm Gpc}$, N = 69123) at z = 2.5. We adopt a simple recipe based on the Fluctuating Gunn–Peterson Approximation (FGPA) for constructing these skewers from the matter density in an N-body simulation and calibrate it against the 1D and 3D Ly α power spectra extracted from the hydrodynamical simulation IllustrisTNG (TNG; $L_{\rm box} = 205\, h^{-1}\, {\rm Mpc}$, N = 25003). As an important application, we study the non-linear broadening of the baryon acoustic oscillation (BAO) peak and show the cross-correlation between DESI-like QSOs and our Ly α forest skewers. We find differences on small scales between the Kaiser approximation prediction and our mock measurements of the Ly α × QSO cross-correlation, which would be important to account for in upcoming analyses. The AbacusSummit Ly α forest mocks open up the possibility for improved modelling of cross-correlations between Ly α and cosmic microwave background (CMB) lensing and Ly α and QSOs, and for forecasts of the 3-point Ly α correlation function. Our catalogues and skewers are publicly available on Globus via the National Energy Research Scientific Computing Center (NERSC) (full link under the section ‘Data Availability’).

     
    more » « less