AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery

Zhou, Hangyu; Kao, Chia-Hsiang; Phoo, Cheng Perng; Mall, Utkarsh; Hariharan, Bharath; Bala, Kavita

Citation Details

Clouds in satellite imagery pose a significant challenge for downstream applica- tions. A major challenge in current cloud removal research is the absence of a comprehensive benchmark and a sufficiently large and diverse training dataset. To address this problem, we introduce the largest public dataset — AllClear for cloud removal, featuring 23,742 globally distributed regions of interest (ROIs) with diverse land-use patterns, comprising 4 million images in total. Each ROI includes complete temporal captures from the year 2022, with (1) multi-spectral optical im- agery from Sentinel-2 and Landsat 8/9, (2) synthetic aperture radar (SAR) imagery from Sentinel-1, and (3) auxiliary remote sensing products such as cloud masks and land cover maps. We validate the effectiveness of our dataset by benchmarking performance, demonstrating the scaling law — the PSNR rises from 28.47 to 33.87 with 30× more data, and conducting ablation studies on the temporal length and the importance of individual modalities. This dataset aims to provide comprehensive coverage of the Earth’s surface and promote better cloud removal results. more »

Award ID(s):: 2144117

PAR ID:: 10566036

Author(s) / Creator(s):: Zhou, Hangyu; Kao, Chia-Hsiang; Phoo, Cheng Perng; Mall, Utkarsh; Hariharan, Bharath; Bala, Kavita

Publisher / Repository:: NeurIPS 2024

Date Published:: 2024-12-13

Format(s):: Medium: X

Location:: Vancouver

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this