null
(Ed.)
Abstract. We trained two Random Forest (RF) machine learning models for cloud mask andcloud thermodynamic-phase detection using spectral observations from Visible InfraredImaging Radiometer Suite (VIIRS)on board Suomi National Polar-orbiting Partnership (SNPP). Observations from Cloud-Aerosol Lidarwith Orthogonal Polarization (CALIOP) were carefully selected toprovide reference labels. The two RF models were trained for all-day anddaytime-only conditions using a 4-year collocated VIIRS and CALIOP dataset from2013 to 2016. Due to the orbit difference, the collocated CALIOP and SNPPVIIRS training samples cover a broad-viewing zenith angle range, which is agreat benefit to overall model performance. The all-day model uses three VIIRSinfrared (IR) bands (8.6, 11, and 12 µm), and the daytime model uses fiveNear-IR (NIR) and Shortwave-IR (SWIR) bands (0.86, 1.24, 1.38, 1.64, and 2.25 µm) together with the three IR bands to detect clear, liquid water, and icecloud pixels. Up to seven surface types, i.e., ocean water, forest, cropland,grassland, snow and ice, barren desert, and shrubland, were consideredseparately to enhance performance for both models. Detection of cloudypixels and thermodynamic phase with the two RF models was compared againstcollocated CALIOP products from 2017. It is shown that, when using a conservativescreening process that excludes the most challenging cloudy pixels forpassive remote sensing, the two RF models have high accuracy rates incomparison to the CALIOP reference for both cloud detection andthermodynamic phase. Other existing SNPP VIIRS and Aqua MODIS cloud mask andphase products are also evaluated, with results showing that the two RFmodels and the MODIS MYD06 optical property phase product are the top threealgorithms with respect to lidar observations during the daytime. During thenighttime, the RF all-day model works best for both cloud detection andphase, particularly for pixels over snow and ice surfaces. The present RFmodels can be extended to other similar passive instruments if trainingsamples can be collected from CALIOP or other lidars. However, the qualityof reference labels and potential sampling issues that may impact modelperformance would need further attention.
more »
« less
An official website of the United States government

