skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Lessons learned from the two largest Galaxy morphological classification catalogues built by convolutional neural networks
ABSTRACT We compare the two largest galaxy morphology catalogues, which separate early- and late-type galaxies at intermediate redshift. The two catalogues were built by applying supervised deep learning (convolutional neural networks, CNNs) to the Dark Energy Survey data down to a magnitude limit of ∼21 mag. The methodologies used for the construction of the catalogues include differences such as the cutout sizes, the labels used for training, and the input to the CNN – monochromatic images versus gri-band normalized images. In addition, one catalogue is trained using bright galaxies observed with DES (i < 18), while the other is trained with bright galaxies (r < 17.5) and ‘emulated’ galaxies up to r-band magnitude 22.5. Despite the different approaches, the agreement between the two catalogues is excellent up to i < 19, demonstrating that CNN predictions are reliable for samples at least one magnitude fainter than the training sample limit. It also shows that morphological classifications based on monochromatic images are comparable to those based on gri-band images, at least in the bright regime. At fainter magnitudes, i > 19, the overall agreement is good (∼95 per cent), but is mostly driven by the large spiral fraction in the two catalogues. In contrast, the agreement within the elliptical population is not as good, especially at faint magnitudes. By studying the mismatched cases, we are able to identify lenticular galaxies (at least up to i < 19), which are difficult to distinguish using standard classification approaches. The synergy of both catalogues provides an unique opportunity to select a population of unusual galaxies.  more » « less
Award ID(s):
2020295
PAR ID:
10382671
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; « less
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Monthly Notices of the Royal Astronomical Society
Volume:
518
Issue:
2
ISSN:
0035-8711
Page Range / eLocation ID:
p. 2794-2809
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Abstract We present morphological classifications of ∼27 million galaxies from the Dark Energy Survey (DES) Data Release 1 (DR1) using a supervised deep learning algorithm. The classification scheme separates: (a) early-type galaxies (ETGs) from late-types (LTGs), and (b) face-on galaxies from edge-on. Our Convolutional Neural Networks (CNNs) are trained on a small subset of DES objects with previously known classifications. These typically have mr ≲ 17.7mag; we model fainter objects to mr < 21.5 mag by simulating what the brighter objects with well determined classifications would look like if they were at higher redshifts. The CNNs reach 97% accuracy to mr < 21.5 on their training sets, suggesting that they are able to recover features more accurately than the human eye. We then used the trained CNNs to classify the vast majority of the other DES images. The final catalog comprises five independent CNN predictions for each classification scheme, helping to determine if the CNN predictions are robust or not. We obtain secure classifications for ∼ 87% and 73% of the catalog for the ETG vs. LTG and edge-on vs. face-on models, respectively. Combining the two classifications (a) and (b) helps to increase the purity of the ETG sample and to identify edge-on lenticular galaxies (as ETGs with high ellipticity). Where a comparison is possible, our classifications correlate very well with Sérsic index (n), ellipticity (ε) and spectral type, even for the fainter galaxies. This is the largest multi-band catalog of automated galaxy morphologies to date. 
    more » « less
  2. ABSTRACT We present a mock image catalogue of ∼100 000 MUV ≃ −22.5 to −19.6 mag galaxies at z = 7–12 from the bluetides cosmological simulation. We create mock images of each galaxy with the James Webb Space Telescope (JWST), Hubble, Roman, and Euclid Space Telescopes, as well as Subaru, and VISTA, with a range of near- and mid-infrared filters. We perform photometry on the mock images to estimate the success of these instruments for detecting high-z galaxies. We predict that JWST will have unprecedented power in detecting high-z galaxies, with a 95 per cent completeness limit at least 2.5 mag fainter than VISTA and Subaru, 1.1 mag fainter than Hubble, and 0.9 mag fainter than Roman, for the same wavelength and exposure time. Focusing on JWST, we consider a range of exposure times and filters, and find that the NIRCam F356W and F277W filters will detect the faintest galaxies, with 95 per cent completeness at m ≃ 27.4 mag in 10-ks exposures. We also predict the number of high-z galaxies that will be discovered by upcoming JWST imaging surveys. We predict that the COSMOS-Web survey will detect ∼1000 M1500 Å < −20.1 mag galaxies at 6.5 < z < 7.5, by virtue of its large survey area. JADES-Medium will detect almost $$100{{\ \rm per\ cent}}$$ of M1500 Å ≲ −20 mag galaxies at z < 8.5 due to its significant depth, however, with its smaller survey area it will detect only ∼100 of these galaxies at 6.5 < z < 7.5. Cosmic variance results in a large range in the number of predicted galaxies each survey will detect, which is more evident in smaller surveys such as CEERS and the PEARLS NEP and GOODS-S fields. 
    more » « less
  3. Using Zwicky Transient Facility (ZTF) observations, we identify a pair of "sibling" Type Ia supernovae (SNe Ia), i.e., hosted by the same galaxy at z = 0.0541. They exploded within 200 days from each other at a separation of 0.6″ corresponding to a projected distance of only 0.6 kpc. Performing SALT2 light curve fits to the gri ZTF photometry, we show that for these equally distant "standardizable candles", there is a difference of 2 magnitudes in their rest frame B-band peaks, and the fainter SN has a significantly red SALT2 colour c=0.57± 0.04, while the stretch values x1 of the two SNe are similar, suggesting that the fainter SN is attenuated by dust in the interstellar medium of the host galaxy. We use these measurements to infer the SALT2 colour standardization parameter, β = 3.5 ± 0.3, independent of the underlying cosmology and Malmquist bias. Assuming the colour excess is entirely due to dust, the result differs by 2σ from the average Milky-Way total-to-selective extinction ratio, but is in good agreement with the colour-brightness corrections empirically derived from the most recent SN Ia Hubble-Lemaitre diagram fits. Thus we suggest that SN "siblings", which will increasingly be discovered in the coming years, can be used to probe the validity of the colour and lightcurve shape corrections using in SN Ia cosmology while avoiding important systematic effects in their inference from global multi-parameter fits to inhomogeneous data-sets, and also help constrain the role of interstellar dust in SN Ia cosmology. 
    more » « less
  4. null (Ed.)
    ABSTRACT The James Webb Space Telescope (JWST) is expected to observe galaxies at z > 10 that are presently inaccessible. Here, we use a self-consistent empirical model, the universemachine, to generate mock galaxy catalogues and light-cones over the redshift range z = 0−15. These data include realistic galaxy properties (stellar masses, star formation rates, and UV luminosities), galaxy–halo relationships, and galaxy–galaxy clustering. Mock observables are also provided for different model parameters spanning observational uncertainties at z < 10. We predict that Cycle 1 JWST surveys will very likely detect galaxies with M* > 107 M⊙ and/or M1500 < −17 out to at least z ∼ 13.5. Number density uncertainties at z > 12 expand dramatically, so efforts to detect z > 12 galaxies will provide the most valuable constraints on galaxy formation models. The faint-end slopes of the stellar mass/luminosity functions at a given mass/luminosity threshold steepen as redshift increases. This is because observable galaxies are hosted by haloes in the exponentially falling regime of the halo mass function at high redshifts. Hence, these faint-end slopes are robustly predicted to become shallower below current observable limits (M* < 107 M⊙ or M1500 > −17). For reionization models, extrapolating luminosity functions with a constant faint-end slope from M1500 = −17 down to M1500 = −12 gives the most reasonable upper limit for the total UV luminosity and cosmic star formation rate up to z ∼ 12. We compare to three other empirical models and one semi-analytic model, showing that the range of predicted observables from our approach encompasses predictions from other techniques. Public catalogues and light-cones for common fields are available online. 
    more » « less
  5. null (Ed.)
    ABSTRACT We measure the size–mass relation and its evolution between redshifts 1 < z < 3, using galaxies lensed by six foreground Hubble Frontier Fields clusters. The power afforded by strong gravitation lensing allows us to observe galaxies with higher angular resolution beyond current facilities. We select a stellar mass limited sample and divide them into star-forming or quiescent classes based on their rest-frame UVJ colours from the ASTRODEEP catalogues. Source reconstruction is carried out with the recently released lenstruction software, which is built on the multipurpose gravitational lensing software lenstronomy. We derive the empirical relation between size and mass for the late-type galaxies with $$M_{*}\gt 3\times 10^{9}\, \mathrm{M}_{\odot }$$ at 1 < z < 2.5 and $$M_{*}\gt 5\times 10^{9}\, \mathrm{M}_{\odot }$$ at 2.5 < z < 3, and at a fixed stellar mass, we find galaxy sizes evolve as $$R \rm _{eff} \propto (1+z)^{-1.05\pm 0.37}$$. The intrinsic scatter is <0.1 dex at z < 1.5 but increases to ∼0.3 dex at higher redshift. The results are in good agreement with those obtained in blank fields. We evaluate the uncertainties associated with the choice of lens model by comparing size measurements using five different and publicly available models, finding the choice of lens model leads to a 3.7 per cent uncertainty of the median value, and ∼25  per cent scatter for individual galaxies. Our work demonstrates the use of strong lensing magnification to boost resolution does not introduce significant uncertainties in this kind of work, and paves the way for wholesale applications of the sophisticated lens reconstruction technique to higher redshifts and larger samples. 
    more » « less