skip to main content


Title: Deep generative models for galaxy image simulations
ABSTRACT Image simulations are essential tools for preparing and validating the analysis of current and future wide-field optical surveys. However, the galaxy models used as the basis for these simulations are typically limited to simple parametric light profiles, or use a fairly limited amount of available space-based data. In this work, we propose a methodology based on deep generative models to create complex models of galaxy morphologies that may meet the image simulation needs of upcoming surveys. We address the technical challenges associated with learning this morphology model from noisy and point spread function (PSF)-convolved images by building a hybrid Deep Learning/physical Bayesian hierarchical model for observed images, explicitly accounting for the PSF and noise properties. The generative model is further made conditional on physical galaxy parameters, to allow for sampling new light profiles from specific galaxy populations. We demonstrate our ability to train and sample from such a model on galaxy postage stamps from the HST/ACS COSMOS survey, and validate the quality of the model using a range of second- and higher order morphology statistics. Using this set of statistics, we demonstrate significantly more realistic morphologies using these deep generative models compared to conventional parametric models. To help make these generative models practical tools for the community, we introduce galsim-hub, a community-driven repository of generative models, and a framework for incorporating generative models within the galsim image simulation software.  more » « less
Award ID(s):
2020295
NSF-PAR ID:
10293510
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Monthly Notices of the Royal Astronomical Society
Volume:
504
Issue:
4
ISSN:
0035-8711
Page Range / eLocation ID:
5543 to 5555
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Weak lensing measurements suffer from well-known shear estimation biases, which can be partially corrected for with the use of image simulations. In this work we present an analysis of simulated images that mimic Hubble Space Telescope/Advance Camera for Surveys observations of high-redshift galaxy clusters, including cluster specific issues such as non-weak shear and increased blending. Our synthetic galaxies have been generated to have similar observed properties as the background-selected source samples studied in the real images. First, we used simulations with galaxies placed on a grid to determine a revised signal-to-noise-dependent ( S / N KSB ) correction for multiplicative shear measurement bias, and to quantify the sensitivity of our KSB+ bias calibration to mismatches of galaxy or PSF properties between the real data and the simulations. Next, we studied the impact of increased blending and light contamination from cluster and foreground galaxies, finding it to be negligible for high-redshift ( z  >  0.7) clusters, whereas shear measurements can be affected at the ∼1% level for lower redshift clusters given their brighter member galaxies. Finally, we studied the impact of fainter neighbours and selection bias using a set of simulated images that mimic the positions and magnitudes of galaxies in Cosmic Assembly Near-IR Deep Extragalactic Legacy Survey (CANDELS) data, thereby including realistic clustering. While the initial SExtractor object detection causes a multiplicative shear selection bias of −0.028 ± 0.002, this is reduced to −0.016 ± 0.002 by further cuts applied in our pipeline. Given the limited depth of the CANDELS data, we compared our CANDELS-based estimate for the impact of faint neighbours on the multiplicative shear measurement bias to a grid-based analysis, to which we added clustered galaxies to even fainter magnitudes based on Hubble Ultra Deep Field data, yielding a refined estimate of ∼ − 0.013. Our sensitivity analysis suggests that our pipeline is calibrated to an accuracy of ∼0.015 once all corrections are applied, which is fully sufficient for current and near-future weak lensing studies of high-redshift clusters. As an application, we used it for a refined analysis of three highly relaxed clusters from the South Pole Telescope Sunyaev-Zeldovich survey, where we now included measurements down to the cluster core ( r  >  200 kpc) as enabled by our work. Compared to previously employed scales ( r  >  500 kpc), this tightens the cluster mass constraints by a factor 1.38 on average. 
    more » « less
  2. ABSTRACT

    Machine learning models can greatly improve the search for strong gravitational lenses in imaging surveys by reducing the amount of human inspection required. In this work, we test the performance of supervised, semi-supervised, and unsupervised learning algorithms trained with the ResNetV2 neural network architecture on their ability to efficiently find strong gravitational lenses in the Deep Lens Survey (DLS). We use galaxy images from the survey, combined with simulated lensed sources, as labeled data in our training data sets. We find that models using semi-supervised learning along with data augmentations (transformations applied to an image during training, e.g. rotation) and Generative Adversarial Network (GAN) generated images yield the best performance. They offer 5 – 10 times better precision across all recall values compared to supervised algorithms. Applying the best performing models to the full 20 deg2 DLS survey, we find 3 Grade-A lens candidates within the top 17 image predictions from the model. This increases to 9 Grade-A and 13 Grade-B candidates when 1 per cent (∼2500 images) of the model predictions are visually inspected. This is ≳ 10 × the sky density of lens candidates compared to current shallower wide-area surveys (such as the Dark Energy Survey), indicating a trove of lenses awaiting discovery in upcoming deeper all-sky surveys. These results suggest that pipelines tasked with finding strong lens systems can be highly efficient, minimizing human effort. We additionally report spectroscopic confirmation of the lensing nature of two Grade-A candidates identified by our model, further validating our methods.

     
    more » « less
  3. ABSTRACT

    Weak gravitational lensing is one of the most powerful tools for cosmology, while subject to challenges in quantifying subtle systematic biases. The point spread function (PSF) can cause biases in weak lensing shear inference when the PSF model does not match the true PSF that is convolved with the galaxy light profile. Although the effect of PSF size and shape errors – i.e. errors in second moments – is well studied, weak lensing systematics associated with errors in higher moments of the PSF model require further investigation. The goal of our study is to estimate their potential impact for LSST weak lensing analysis. We go beyond second moments of the PSF by using image simulations to relate multiplicative bias in shear to errors in the higher moments of the PSF model. We find that the current level of errors in higher moments of the PSF model in data from the Hyper Suprime-Cam survey can induce a ∼0.05 per cent shear bias, making this effect unimportant for ongoing surveys but relevant at the precision of upcoming surveys such as LSST.

     
    more » « less
  4. ABSTRACT

    Using the TNG50 cosmological simulation and observations from the Kilo-Degree Survey (KiDS), we investigate the connection between galaxy mergers and optical morphology in the local Universe over a wide range of galaxy stellar masses (8.5 ≤ log (M*/M⊙) ≤ 11). To this end, we have generated over 16 000 synthetic images of TNG50 galaxies designed to match KiDS observations, including the effects of dust attenuation and scattering, and used the statmorph code to measure various image-based morphological diagnostics in the r-band for both data sets. Such measurements include the Gini–M20 and concentration–asymmetry–smoothness statistics. Overall, we find good agreement between the optical morphologies of TNG50 and KiDS galaxies, although the former are slightly more concentrated and asymmetric than their observational counterparts. Afterwards, we trained a random forest classifier to identify merging galaxies in the simulation (including major and minor mergers) using the morphological diagnostics as the model features, along with merger statistics from the merger trees as the ground truth. We find that the asymmetry statistic exhibits the highest feature importance of all the morphological parameters considered. Thus, the performance of our algorithm is comparable to that of the more traditional method of selecting highly asymmetric galaxies. Finally, using our trained model, we estimate the galaxy merger fraction in both our synthetic and observational galaxy samples, finding in both cases that the galaxy merger fraction increases steadily as a function of stellar mass.

     
    more » « less
  5. Cosmological simulations of galaxy formation are limited by finite computational resources. We draw from the ongoing rapid advances in artificial intelligence (AI; specifically deep learning) to address this problem. Neural networks have been developed to learn from high-resolution (HR) image data and then make accurate superresolution (SR) versions of different low-resolution (LR) images. We apply such techniques to LR cosmological N-body simulations, generating SR versions. Specifically, we are able to enhance the simulation resolution by generating 512 times more particles and predicting their displacements from the initial positions. Therefore, our results can be viewed as simulation realizations themselves, rather than projections, e.g., to their density fields. Furthermore, the generation process is stochastic, enabling us to sample the small-scale modes conditioning on the large-scale environment. Our model learns from only 16 pairs of small-volume LR-HR simulations and is then able to generate SR simulations that successfully reproduce the HR matter power spectrum to percent level up to16h1Mpcand the HR halo mass function to within10%down to1011M. We successfully deploy the model in a box 1,000 times larger than the training simulation box, showing that high-resolution mock surveys can be generated rapidly. We conclude that AI assistance has the potential to revolutionize modeling of small-scale galaxy-formation physics in large cosmological volumes.

     
    more » « less