skip to main content


Title: Morphological Parameters and Associated Uncertainties for 8 Million Galaxies in the Hyper Suprime-Cam Wide Survey
Abstract

We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for ∼8 million galaxies in the Hyper Suprime-Cam Wide survey withz≤ 0.75 andm≤ 23. GaMPEN is a machine-learning framework that estimates Bayesian posteriors for a galaxy’s bulge-to-total light ratio (LB/LT), effective radius (Re), and flux (F). By first training on simulations of galaxies and then applying transfer learning using real data, we trained GaMPEN with <1% of our data set. This two-step process will be critical for applying machine-learning algorithms to future large imaging surveys, such as the Rubin-Legacy Survey of Space and Time, the Nancy Grace Roman Space Telescope, and Euclid. By comparing our results to those obtained using light profile fitting, we demonstrate that GaMPEN’s predicted posterior distributions are well calibrated (≲5% deviation) and accurate. This represents a significant improvement over light profile fitting algorithms, which underestimate uncertainties by as much as ∼60%. For an overlapping subsample, we also compare the derived morphological parameters with values in two external catalogs and find that the results agree within the limits of uncertainties predicted by GaMPEN. This step also permits us to define an empirical relationship between the Sérsic index andLB/LTthat can be used to convert between these two parameters. The catalog presented here represents a significant improvement in size (∼10×), depth (∼4 mag), and uncertainty quantification over previous state-of-the-art bulge+disk decomposition catalogs. With this work, we also release GaMPEN’s source code and trained models, which can be adapted to other data sets.

 
more » « less
NSF-PAR ID:
10441093
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
DOI PREFIX: 10.3847
Date Published:
Journal Name:
The Astrophysical Journal
Volume:
953
Issue:
2
ISSN:
0004-637X
Format(s):
Medium: X Size: Article No. 134
Size(s):
["Article No. 134"]
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    We introduce a novel machine-learning framework for estimating the Bayesian posteriors of morphological parameters for arbitrarily large numbers of galaxies. The Galaxy Morphology Posterior Estimation Network (GaMPEN) estimates values and uncertainties for a galaxy’s bulge-to-total-light ratio (LB/LT), effective radius (Re), and flux (F). To estimate posteriors, GaMPEN uses the Monte Carlo Dropout technique and incorporates the full covariance matrix between the output parameters in its loss function. GaMPEN also uses a spatial transformer network (STN) to automatically crop input galaxy frames to an optimal size before determining their morphology. This will allow it to be applied to new data without prior knowledge of galaxy size. Training and testing GaMPEN on galaxies simulated to matchz< 0.25 galaxies in Hyper Suprime-Cam Wideg-band images, we demonstrate that GaMPEN achieves typical errors of 0.1 inLB/LT, 0.″17 (∼7%) inRe, and 6.3 × 104nJy (∼1%) inF. GaMPEN's predicted uncertainties are well calibrated and accurate (<5% deviation)—for regions of the parameter space with high residuals, GaMPEN correctly predicts correspondingly large uncertainties. We also demonstrate that we can apply categorical labels (i.e., classifications such ashighly bulge dominated) to predictions in regions with high residuals and verify that those labels are ≳97% accurate. To the best of our knowledge, GaMPEN is the first machine-learning framework for determining joint posterior distributions of multiple morphological parameters and is also the first application of an STN to optical imaging in astronomy.

     
    more » « less
  2. ABSTRACT We present the MaNGA PyMorph photometric Value Added Catalogue (MPP-VAC-DR17) and the MaNGA Deep Learning Morphological VAC (MDLM-VAC-DR17) for the final data release of the MaNGA survey, which is part of the SDSS Data Release 17 (DR17). The MPP-VAC-DR17 provides photometric parameters from Sérsic and Sérsic+Exponential fits to the two-dimensional surface brightness profiles of the MaNGA DR17 galaxy sample in the g, r, and i bands (e.g. total fluxes, half-light radii, bulge-disc fractions, ellipticities, position angles, etc.). The MDLM-VAC-DR17 provides deep-learning-based morphological classifications for the same galaxies. The MDLM-VAC-DR17 includes a number of morphological properties, for example, a T-Type, a finer separation between elliptical and S0, as well as the identification of edge-on and barred galaxies. While the MPP-VAC-DR17 simply extends the MaNGA PyMorph photometric VAC published in the SDSS Data Release 15 (MPP-VAC-DR15) to now include galaxies that were added to make the final DR17, the MDLM-VAC-DR17 implements some changes and improvements compared to the previous release (MDLM-VAC-DR15): Namely, the low end of the T-Types is better recovered in this new version. The catalogue also includes a separation between early or late type, which classifies the two populations in a complementary way to the T-Type, especially at the intermediate types (−1 < T-Type < 2), where the T-Type values show a large scatter. In addition, k-fold-based uncertainties on the classifications are also provided. To ensure robustness and reliability, we have also visually inspected all the images. We describe the content of the catalogues and show some interesting ways in which they can be combined. 
    more » « less
  3. Abstract

    We present a machine-learning framework to accurately characterize the morphologies of active galactic nucleus (AGN) host galaxies withinz< 1. We first use PSFGAN to decouple host galaxy light from the central point source, then we invoke the Galaxy Morphology Network (GaMorNet) to estimate whether the host galaxy is disk-dominated, bulge-dominated, or indeterminate. Using optical images from five bands of the HSC Wide Survey, we build models independently in three redshift bins: low (0 <z< 0.25), mid (0.25 <z< 0.5), and high (0.5 <z< 1.0). By first training on a large number of simulated galaxies, then fine-tuning using far fewer classified real galaxies, our framework predicts the actual morphology for ∼60%–70% of the host galaxies from test sets, with a classification precision of ∼80%–95%, depending on the redshift bin. Specifically, our models achieve a disk precision of 96%/82%/79% and bulge precision of 90%/90%/80% (for the three redshift bins) at thresholds corresponding to indeterminate fractions of 30%/43%/42%. The classification precision of our models has a noticeable dependency on host galaxy radius and magnitude. No strong dependency is observed on contrast ratio. Comparing classifications of real AGNs, our models agree well with traditional 2D fitting with GALFIT. The PSFGAN+GaMorNetframework does not depend on the choice of fitting functions or galaxy-related input parameters, runs orders of magnitude faster than GALFIT, and is easily generalizable via transfer learning, making it an ideal tool for studying AGN host galaxy morphology in forthcoming large imaging surveys.

     
    more » « less
  4. Abstract

    The observed lensed fraction of high-redshift quasars (∼0.2%) is significantly lower than previous theoretical predictions (≳4%). We revisit the lensed fraction of high-redshift quasars predicted by theoretical models, where we adopt recent measurements of galaxy velocity dispersion functions (VDFs) and explore a wide range of quasar luminosity function (QLF) parameters. We use both analytical methods and mock catalogs, which give consistent results. For ordinary QLF parameters and the depth of current high-redshift quasar surveys (mz≲ 22), our model suggests a multiply imaged fraction ofFmulti∼ 0.4%–0.8%. The predicted lensed fraction is ∼1%–6% for the brightestzs∼ 6 quasars (mz≲ 19), depending on the QLF. The systematic uncertainties of the predicted lensed fraction in previous models can be as large as 2–4 times and are dominated by the VDF. Applying VDFs from recent measurements decreases the predicted lensed fraction and relieves the tension between observations and theoretical models. Given the depth of current imaging surveys, there are ∼15 lensed quasars atzs> 5.5 detectable over the sky. Upcoming sky surveys like the Legacy Survey of Space and Time survey and the Euclid survey will find several tens of lensed quasars at this redshift range.

     
    more » « less
  5. Abstract

    As the next generation of large galaxy surveys come online, it is becoming increasingly important to develop and understand the machine-learning tools that analyze big astronomical data. Neural networks are powerful and capable of probing deep patterns in data, but they must be trained carefully on large and representative data sets. We present a new “hump” of the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project: CAMELS-SAM, encompassing one thousand dark-matter-only simulations of (100h−1cMpc)3with different cosmological parameters (Ωmandσ8) and run through the Santa Cruz semi-analytic model for galaxy formation over a broad range of astrophysical parameters. As a proof of concept for the power of this vast suite of simulated galaxies in a large volume and broad parameter space, we probe the power of simple clustering summary statistics to marginalize over astrophysics and constrain cosmology using neural networks. We use the two-point correlation, count-in-cells, and void probability functions, and we probe nonlinear and linear scales across 0.68 <R<27h−1cMpc. We find our neural networks can both marginalize over the uncertainties in astrophysics to constrain cosmology to 3%–8% error across various types of galaxy selections, while simultaneously learning about the SC-SAM astrophysical parameters. This work encompasses vital first steps toward creating algorithms able to marginalize over the uncertainties in our galaxy formation models and measure the underlying cosmology of our Universe. CAMELS-SAM has been publicly released alongside the rest of CAMELS, and it offers great potential to many applications of machine learning in astrophysics:https://camels-sam.readthedocs.io.

     
    more » « less