skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Generalized deep learning model for photovoltaic module segmentation from satellite and aerial imagery
As solar photovoltaic (PV) has emerged as a dominant player in the energy market, there has been an exponential surge in solar deployment and investment within this sector. With the rapid growth of solar energy adoption, accurate and efficient detection of PV panels has become crucial for effective solar energy mapping and planning. This paper presents the application of the Mask2Former model for segmenting PV panels from a diverse, multi-resolution dataset of satellite and aerial imagery. Our primary objective is to harness Mask2Former’s deep learning capabilities to achieve precise segmentation of PV panels in real-world scenarios. We fine-tune the pre-existing Mask2Former model on a carefully curated multi-resolution dataset and a crowdsourced dataset of satellite and aerial images, showcasing its superiority over other deep learning models like U-Net and DeepLabv3+. Most notably, Mask2Former establishes a new state-of-the-art in semantic segmentation by achieving over 95% IoU scores. Our research contributes significantly to the advancement solar energy mapping and sets a benchmark for future studies in this field.  more » « less
Award ID(s):
2050731
PAR ID:
10539735
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Elsevier
Date Published:
Journal Name:
Solar Energy
Volume:
274
Issue:
C
ISSN:
0038-092X
Page Range / eLocation ID:
112539
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. State-of-the-art deep learning technology has been successfully applied to relatively small selected areas of very high spatial resolution (0.15 and 0.25 m) optical aerial imagery acquired by a fixed-wing aircraft to automatically characterize ice-wedge polygons (IWPs) in the Arctic tundra. However, any mapping of IWPs at regional to continental scales requires images acquired on different sensor platforms (particularly satellite) and a refined understanding of the performance stability of the method across sensor platforms through reliable evaluation assessments. In this study, we examined the transferability of a deep learning Mask Region-Based Convolutional Neural Network (R-CNN) model for mapping IWPs in satellite remote sensing imagery (~0.5 m) covering 272 km2 and unmanned aerial vehicle (UAV) (0.02 m) imagery covering 0.32 km2. Multi-spectral images were obtained from the WorldView-2 satellite sensor and pan-sharpened to ~0.5 m, and a 20 mp CMOS sensor camera onboard a UAV, respectively. The training dataset included 25,489 and 6022 manually delineated IWPs from satellite and fixed-wing aircraft aerial imagery near the Arctic Coastal Plain, northern Alaska. Quantitative assessments showed that individual IWPs were correctly detected at up to 72% and 70%, and delineated at up to 73% and 68% F1 score accuracy levels for satellite and UAV images, respectively. Expert-based qualitative assessments showed that IWPs were correctly detected at good (40–60%) and excellent (80–100%) accuracy levels for satellite and UAV images, respectively, and delineated at excellent (80–100%) level for both images. We found that (1) regardless of spatial resolution and spectral bands, the deep learning Mask R-CNN model effectively mapped IWPs in both remote sensing satellite and UAV images; (2) the model achieved a better accuracy in detection with finer image resolution, such as UAV imagery, yet a better accuracy in delineation with coarser image resolution, such as satellite imagery; (3) increasing the number of training data with different resolutions between the training and actual application imagery does not necessarily result in better performance of the Mask R-CNN in IWPs mapping; (4) and overall, the model underestimates the total number of IWPs particularly in terms of disjoint/incomplete IWPs. 
    more » « less
  2. Recently, there has been a growing interest in automatically collecting distributed solar photovoltaic (PV) installation information in smart grid systems, including the quantity and locations of solar PV deployments, as well as their profiling information across a given geospatial region. Most recent approaches are still suffering low detection accuracy due to insufficient sample and principal feature learning when building their models and also separation of rooftop object segmentation and identification during their detection processes. In addition, they cannot report accurate multi-deployment results. To address these problems, we design a new system-SolarDetector+, which can automatically and accurately detect and profile distributed solar PV arrays without any extra cost. In essence, SolarDetector+first leverages multiple data augmentation techniques, including Cycle-Consistent Adversarial Networks, Latent Diffusion Models, and Generative Adversarial networks, to build a large rooftop satellite imagery dataset (RSID). Then, SolarDetector+employs Mask R-convolutional neural networks algorithm to accurately identify rooftop solar PV arrays and learn the detailed installation information for each solar PV array simultaneously. We find that pre-trained SolarDetector+yields an average Matthews correlation coefficient of 0.862 to detect solar PV arrays over RSID, which is ∼50% better than the most recent open source detection system—SolarFinder. 
    more » « less
  3. Among major energy conversion methods, photovoltaic (PV) solar cells have been the most popular and widely employed for a variety of applications. Although a PV solar panel has been shown as one of the most efficient green energy sources, its 2D surface solar light harvesting has reached great limitations as it requires large surface areas. There is, therefore, an increasing need to seek solar harvest in a three-dimensional fashion for enhanced energy density. In addition to a conventional 2D solar panel in the x-y area, we extend another dimension of solar harvesting in the z-axis through multiple CdTe solar panels arranged in parallel. The high transparency allows sunlight to partially penetrate multiple solar panels, resulting in significantly increased solar harvesting surface area in a 3D fashion. The advantages of the 3D multi-panel solar harvesting system include: i) enlarged solar light collecting surface area, therefore increased energy density, ii) the total output power from multiple panels can exceed that of the single panel, and iii) significantly reduced surface area needed for densely populated cities. With five CdTe solar panels of different transparencies in parallel, the multilayer system can produce collective output power 233% higher than that of the single solar panel under the same surface area when arranged in descending (i.e., PV panel with the highest transparency on top and lowest at bottom). The PCE of the multi-panel system has also increased 233% in descending order indicating the viability of 3D solar harvesting. The multi-panel system will dimensionally transform solar harvesting from 2D to 3D for more efficient energy generation. 
    more » « less
  4. High resolution mapping of coastal habitats is invaluable for resource inventory, change detection, and inventory of aquaculture applications. However, coastal areas, especially the interior of mangroves, are often difficult to access. An Unmanned Aerial Vehicle (UAV), equipped with a multispectral sensor, affords an opportunity to improve upon satellite imagery for coastal management because of the very high spatial resolution, multispectral capability, and opportunity to collect real-time observations. Despite the recent and rapid development of UAV mapping applications, few articles have quantitatively compared how much improvement there is of UAV multispectral mapping methods compared to more conventional remote sensing data such as satellite imagery. The objective of this paper is to quantitatively demonstrate the improvements of a multispectral UAV mapping technique for higher resolution images used for advanced mapping and assessing coastal land cover. We performed multispectral UAV mapping fieldwork trials over Indian River Lagoon along the central Atlantic coast of Florida. Ground Control Points (GCPs) were collected to generate a rigorous geo-referenced dataset of UAV imagery and support comparison to geo-referenced satellite and aerial imagery. Multi-spectral satellite imagery (Sentinel-2) was also acquired to map land cover for the same region. NDVI and object-oriented classification methods were used for comparison between UAV and satellite mapping capabilities. Compared with aerial images acquired from Florida Department of Environmental Protection, the UAV multi-spectral mapping method used in this study provided advanced information of the physical conditions of the study area, an improved land feature delineation, and a significantly better mapping product than satellite imagery with coarser resolution. The study demonstrates a replicable UAV multi-spectral mapping method useful for study sites that lack high quality data. 
    more » « less
  5. In this article, we propose a deep learning based semantic segmentation model that identifies and segments defects in electroluminescence (EL) images of silicon photovoltaic (PV) cells. The proposed model can differentiate between cracks, contact interruptions, cell interconnect failures, and contact corrosion for both multicrystalline and monocrystalline silicon cells. Our model utilizes a segmentation Deeplabv3 model with a ResNet-50 backbone. It was trained on 17,064 EL images including 256 physically realistic simulated images of PV cells generated to deal with class imbalance. While performing semantic segmentation for five defect classes, this model achieves a weighted F1-score of 0.95, an unweighted F1-score of 0.69, a pixel-level global accuracy of 95.4%, and a mean intersection over union score of 57.3%. In addition, we introduce the UCF EL Defect dataset, a large-scale dataset consisting of 17,064 EL images, which will be publicly available for use by the PV and computer vision research communities. 
    more » « less