skip to main content

Title: Galaxy Zoo: Clump Scout – Design and first application of a two-dimensional aggregation tool for citizen science

Galaxy Zoo: Clump Scout  is a web-based citizen science project designed to identify and spatially locate giant star forming clumps in galaxies that were imaged by the Sloan Digital Sky Survey Legacy Survey. We present a statistically driven software framework that is designed to aggregate two-dimensional annotations of clump locations provided by multiple independent Galaxy Zoo: Clump Scout volunteers and generate a consensus label that identifies the locations of probable clumps within each galaxy. The statistical model our framework is based on allows us to assign false-positive probabilities to each of the clumps we identify, to estimate the skill levels of each of the volunteers who contribute to Galaxy Zoo: Clump Scout and also to quantitatively assess the reliability of the consensus labels that are derived for each subject. We apply our framework to a data set containing 3561 454 two-dimensional points, which constitute 1739 259 annotations of 85 286 distinct subjects provided by 20 999 volunteers. Using this data set, we identify 128 100 potential clumps distributed among 44 126 galaxies. This data set can be used to study the prevalence and demographics of giant star forming clumps in low-redshift galaxies. The code for our aggregation software framework is publicly available at:

more » « less
Award ID(s):
2006894 2006400
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Monthly Notices of the Royal Astronomical Society
Page Range / eLocation ID:
p. 5882-5911
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Massive, star-forming clumps are a common feature of high-redshift star-forming galaxies. How they formed, and why they are so rare at low redshift, remains unclear. In this paper we identify the largest sample yet of clumpy galaxies (7050) at low redshift using data from the citizen science project Galaxy Zoo: Clump Scout, in which volunteers classified 58,550 Sloan Digital Sky Survey (SDSS) galaxies spanning redshift 0.02 <z< 0.15. We apply a robust completeness correction by comparing with simulated clumps identified by the same method. Requiring that the ratio of clump to galaxy flux in the SDSSuband be greater than 8% (similar to clump definitions used by other works), we estimate the fraction of local star-forming galaxies hosting at least one clump (fclumpy) to be3.220.34+0.38%. We also compute the same fraction with a less stringent relative flux cut of 3% (12.680.88+1.38%), as the higher number count and lower statistical noise of this fraction permit finer comparison with future low-redshift clumpy galaxy studies. Our results reveal a sharp decline infclumpyover 0 <z< 0.5. The minor merger rate remains roughly constant over the same span, so we suggest that minor mergers are unlikely to be the primary driver of clump formation. Instead, the rate of galaxy turbulence is a better tracer forfclumpyover 0 <z< 1.5 for galaxies of all masses, which supports the idea that clump formation is primarily driven by violent disk instability for all galaxy populations during this period.

    more » « less
  2. Abstract Giant, star-forming clumps are a common feature prevalent among high-redshift star-forming galaxies and play a critical role in shaping their chaotic morphologies and yet, their nature and role in galaxy evolution remains to be fully understood. A majority of the effort to study clumps has been focused at high redshifts, and local clump studies have often suffered from small sample sizes. In this work, we present an analysis of clump properties in the local universe, and for the first time, performed with a statistically significant sample. With the help of the citizen science-powered Galaxy Zoo: Hubble project, we select a sample of 92 z < 0.06 clumpy galaxies in Sloan Digital Sky Survey Stripe 82 galaxies. Within this sample, we identify 543 clumps using a contrast-based image analysis algorithm and perform photometry as well as estimate their stellar population properties. The overall properties of our z < 0.06 clump sample are comparable to the high-redshift clumps. However, contrary to the high-redshift studies, we find no evidence of a gradient in clump ages or masses as a function of their galactocentric distances. Our results challenge the inward migration scenario for clump evolution for the local universe, potentially suggesting a larger contribution of ex situ clumps and/or longer clump migration timescales. 
    more » « less
  3. ABSTRACT We present Galaxy Zoo DECaLS: detailed visual morphological classifications for Dark Energy Camera Legacy Survey images of galaxies within the SDSS DR8 footprint. Deeper DECaLS images (r = 23.6 versus r = 22.2 from SDSS) reveal spiral arms, weak bars, and tidal features not previously visible in SDSS imaging. To best exploit the greater depth of DECaLS images, volunteers select from a new set of answers designed to improve our sensitivity to mergers and bars. Galaxy Zoo volunteers provide 7.5 million individual classifications over 314 000 galaxies. 140 000 galaxies receive at least 30 classifications, sufficient to accurately measure detailed morphology like bars, and the remainder receive approximately 5. All classifications are used to train an ensemble of Bayesian convolutional neural networks (a state-of-the-art deep learning method) to predict posteriors for the detailed morphology of all 314 000 galaxies. We use active learning to focus our volunteer effort on the galaxies which, if labelled, would be most informative for training our ensemble. When measured against confident volunteer classifications, the trained networks are approximately 99 per cent accurate on every question. Morphology is a fundamental feature of every galaxy; our human and machine classifications are an accurate and detailed resource for understanding how galaxies evolve. 
    more » « less

    The formation mechanism of the most massive stars is far from completely understood. It is still unclear if the formation is core-fed or clump-fed, i.e. if the process is an extension of what happens in low-mass stars, or if the process is more dynamical such as a continuous, multiscale accretion from the gas at parsec (or even larger) scales. In this context, we introduce the SQUALO project, an ALMA 1.3 and 3 mm survey designed to investigate the properties of 13 massive clumps selected at various evolutionary stages, with the common feature that they all show evidence for accretion at the clump scale. In this work, we present the results obtained from the 1.3 mm continuum data. Our observations identify 55 objects with masses in the range 0.4 ≤ M ≤ 309 M⊙, with evidence that the youngest clumps already present some degree of fragmentation. The data show that physical properties such as mass and surface density of the fragments and their parent clumps are tightly correlated. The minimum distance between fragments decreases with evolution, suggesting a dynamical scenario in which massive clumps first fragment under the influence of non-thermal motions driven by the competition between turbulence and gravity. With time gravitational collapse takes over and the fragments organize themselves into more thermally supported objects while continuing to accrete from their parent clump. Finally, one source does not fragment, suggesting that the support of other mechanisms (such as magnetic fields) is crucial only in specific star-forming regions.

    more » « less
  5. ABSTRACT We examine the nature of kpc-scale clumps seen in high-redshift galaxies using a suite of cosmological simulations of galaxy formation. We identify rest-frame UV clumps in mock HST images smoothed to 500 pc resolution, and compare them with the intrinsic 3D clumps of young stars identified in the simulations with 100 pc resolution. According to this comparison for the progenitors of Milky Way-sized galaxies probed by our simulations, we expect that the stellar masses of the observed clumps are overestimated by as much as an order of magnitude, and that the sizes of these clumps are also overestimated by factor of several, due to a combination of spatial resolution and projection. The masses of young stars contributing most of the UV emission can also be overestimated by factor of a few. We find that most clumps of young stars present in a simulation at one time dissolve on a timescale shorter than ∼150 Myr. Some clumps with dense cores can last longer but eventually disperse. Most of the clumps are not bound structures, with virial parameter αvir > 1. We find similar results for clumps identified in mock maps of H α emission measure. We examine the predictions for effective clump sizes from the linear theory of gravitational perturbations and conclude that they are inconsistent with being formed by global disc instabilities. Instead, the observed clumps represent random projections of multiple compact star-forming regions. 
    more » « less