skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: FLEET: A Redshift-agnostic Machine Learning Pipeline to Rapidly Identify Hydrogen-poor Superluminous Supernovae
Over the past decade wide-field optical time-domain surveys have increased the discovery rate of transients to the point that ≲10% are being spectroscopically classified. Despite this, these surveys have enabled the discovery of new and rare types of transients, most notably the class of hydrogen-poor superluminous supernovae (SLSN-I), with about 150 events confirmed to date. Here we present a machine-learning classification algorithm targeted at rapid identification of a pure sample of SLSN-I to enable spectroscopic and multiwavelength follow-up. This algorithm is part of the Finding Luminous and Exotic Extragalactic Transients (FLEET) observational strategy. It utilizes both light-curve and contextual information, but without the need for a redshift, to assign each newly discovered transient a probability of being a SLSN-I. This classifier can achieve a maximum purity of about 85% (with 20% completeness) when observing a selection of SLSN-I candidates. Additionally, we present two alternative classifiers that use either redshifts or complete light curves and can achieve an even higher purity and completeness. At the current discovery rate, the FLEET algorithm can provide about 20 SLSN-I candidates per year for spectroscopic follow-up with 85% purity; with the Legacy Survey of Space and Time we anticipate this will rise to more than $$\sim {10}^{3}$$ events per year.  more » « less
Award ID(s):
2108676 2433718
PAR ID:
10468104
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
AAS
Date Published:
Journal Name:
The Astrophysical Journal
Volume:
904
Issue:
1
ISSN:
1538-4357
Page Range / eLocation ID:
74
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract In 2019 November, we began operating Finding Luminous and Exotic Extragalactic Transients (FLEET), a machine-learning algorithm designed to photometrically identify Type I superluminous supernovae (SLSNe) in transient alert streams. Through this observational campaign, we spectroscopically classified 21 of the 50 SLSNe identified worldwide between 2019 November and 2022 January. Based on our original algorithm, we anticipated that FLEET would achieve a purity of about 50% for transients with a probability of being an SLSN,P(SLSN-I) > 0.5; the true on-sky purity we obtained is closer to 80%. Similarly, we anticipated FLEET could reach a completeness of about 30%, and we indeed measure an upper limit on the completeness of ≲33%. Here we present FLEET 2.0, an updated version of FLEET trained on 4780 transients (almost three times more than FLEET 1.0). FLEET 2.0 has a similar predicted purity to FLEET 1.0 but outperforms FLEET 1.0 in terms of completeness, which is now closer to ≈40% for transients withP(SLSN-I) > 0.5. Additionally, we explore the possible systematics that might arise from the use of FLEET for target selection. We find that the population of SLSNe recovered by FLEET is mostly indistinguishable from the overall SLSN population in terms of physical and most observational parameters. We provide FLEET as an open source package on GitHub: https://github.com/gmzsebastian/FLEET. 
    more » « less
  2. Abstract We present an expansion of FLEET, a machine-learning algorithm optimized to select transients that are most likely tidal disruption events (TDEs). FLEET is based on a random forest algorithm trained on both the light curves and host galaxy information of 4779 spectroscopically classified transients. We find that for transients with a probability of being a TDE,P(TDE) > 0.5, we can successfully recover TDEs with ≈40% completeness and ≈30% purity when using their first 20 days of photometry or a similar completeness and ≈50% purity when including 40 days of photometry, an improvement of almost 2 orders of magnitude compared to random selection. Alternatively, we can recover TDEs with a maximum purity of ≈80% and a completeness of ≈30% when considering only transients withP(TDE) > 0.8. We explore the use of FLEET for future time-domain surveys such as the Legacy Survey of Space and Time on the Vera C. Rubin Observatory (Rubin) and the Nancy Grace Roman Space Telescope (Roman). We estimate that ∼104well-observed TDEs could be discovered every year by Rubin and ∼200 TDEs by Roman. Finally, we run FLEET on the TDEs from our Rubin survey simulation and find that we can recover ∼30% of them at redshiftz< 0.5 withP(TDE) > 0.5, or ∼3000 TDEs yr–1that FLEET could uncover from the Rubin stream. We have demonstrated that we will be able to run FLEET on Rubin photometry as soon as this survey begins. FLEET is provided as an open source package on GitHub: https://github.com/gmzsebastian/FLEET. 
    more » « less
  3. ABSTRACT We present two catalogues of active galactic nucleus (AGN) candidates selected from the latest data of two all-sky surveys – Data Release 2 of the Gaia mission and the unWISE catalogue of the Wide-field Infrared Survey Explorer (WISE). We train a random forest classifier to predict the probability of each source in the Gaia–unWISE joint sample being an AGN, PRF, based on Gaia astrometric and photometric measurements and unWISE photometry. The two catalogues, which we designate C75 and R85, are constructed by applying different PRF threshold cuts to achieve an overall completeness of 75 per cent (≈90 per cent at GaiaG ≤ 20 mag) and reliability of 85 per cent, respectively. The C75 (R85) catalogue contains 2734 464 (2182 193) AGN candidates across the effective 36 000 deg2 sky, of which ≈0.91 (0.52) million are new discoveries. Photometric redshifts of the AGN candidates are derived by a random forest regressor using Gaia and WISE magnitudes and colours. The estimated overall photometric redshift accuracy is 0.11. Cross-matching the AGN candidates with a sample of known bright cluster galaxies, we identify a high-probability strongly lensed AGN candidate system, SDSS J1326+4806, with a large image separation of 21$${^{\prime\prime}_{.}}$$06. All the AGN candidates in our catalogues will have ∼5-yr long light curves from Gaia by the end of the mission, and thus will be a great resource for AGN variability studies. Our AGN catalogues will also be helpful in AGN target selections for future spectroscopic surveys, especially those in the Southern hemisphere. The C75 catalogue can be downloaded at https://www.ast.cam.ac.uk/~ypshu/AGN_Catalogues.html. 
    more » « less
  4. Abstract We present the Young Supernova Experiment Data Release 1 (YSE DR1), comprised of processed multicolor PanSTARRS1grizand Zwicky Transient Facility (ZTF)grphotometry of 1975 transients with host–galaxy associations, redshifts, spectroscopic and/or photometric classifications, and additional data products from 2019 November 24 to 2021 December 20. YSE DR1 spans discoveries and observations from young and fast-rising supernovae (SNe) to transients that persist for over a year, with a redshift distribution reachingz≈ 0.5. We present relative SN rates from YSE’s magnitude- and volume-limited surveys, which are consistent with previously published values within estimated uncertainties for untargeted surveys. We combine YSE and ZTF data, and create multisurvey SN simulations to train the ParSNIP and SuperRAENN photometric classification algorithms; when validating our ParSNIP classifier on 472 spectroscopically classified YSE DR1 SNe, we achieve 82% accuracy across three SN classes (SNe Ia, II, Ib/Ic) and 90% accuracy across two SN classes (SNe Ia, core-collapse SNe). Our classifier performs particularly well on SNe Ia, with high (>90%) individual completeness and purity, which will help build an anchor photometric SNe Ia sample for cosmology. We then use our photometric classifier to characterize our photometric sample of 1483 SNe, labeling 1048 (∼71%) SNe Ia, 339 (∼23%) SNe II, and 96 (∼6%) SNe Ib/Ic. YSE DR1 provides a training ground for building discovery, anomaly detection, and classification algorithms, performing cosmological analyses, understanding the nature of red and rare transients, exploring tidal disruption events and nuclear variability, and preparing for the forthcoming Vera C. Rubin Observatory Legacy Survey of Space and Time. 
    more » « less
  5. Abstract We present Lightcurve Anomaly Identification and Similarity Search (LAISS), an automated pipeline to detect anomalous astrophysical transients in real-time data streams. We deploy our anomaly detection model on the nightly Zwicky Transient Facility (ZTF) Alert Stream via the ANTARES broker, identifying a manageable ∼1–5 candidates per night for expert vetting and coordinating follow-up observations. Our method leverages statistical light-curve and contextual host galaxy features within a random forest classifier, tagging transients of rare classes (spectroscopicanomalies), of uncommon host galaxy environments (contextualanomalies), and of peculiar or interaction-powered phenomena (behavioralanomalies). Moreover, we demonstrate the power of a low-latency (∼ms) approximate similarity search method to find transient analogs with similar light-curve evolution and host galaxy environments. We use analogs for data-driven discovery, characterization, (re)classification, and imputation in retrospective and real-time searches. To date, we have identified ∼50 previously known and previously missed rare transients from real-time and retrospective searches, including but not limited to superluminous supernovae (SLSNe), tidal disruption events, SNe IIn, SNe IIb, SNe I-CSM, SNe Ia-91bg-like, SNe Ib, SNe Ic, SNe Ic-BL, and M31 novae. Lastly, we report the discovery of 325 total transients, all observed between 2018 and 2021 and absent from public catalogs (∼1% of all ZTF Astronomical Transient reports to the Transient Name Server through 2021). These methods enable a systematic approach to finding the “needle in the haystack” in large-volume data streams. Because of its integration with the ANTARES broker,LAISSis built to detect exciting transients in Rubin data. 
    more » « less