skip to main content


Title: Solar-TK: A Data-Driven Toolkit for Solar PV Performance Modeling and Forecasting
Solar energy capacity is continuing to increase. The key challenge with integrating solar into buildings and the electric grid is its high power generation variability, which is a function of many factors, including a site's location, time, weather, and numerous physical attributes. There has been significant prior work on solar performance modeling and forecasting that infers a site's current and future solar generation based on these factors. Accurate solar performance models and forecasts are also a pre-requisite for conducting a wide range of building and grid energy-efficiency research. Unfortunately, much of the prior work is not accessible to researchers, either because it has not been released as open source, is time-consuming to re-implement, or requires access to proprietary data sources. To address the problem, we present Solar-TK, a data-driven toolkit for solar performance modeling and forecasting that is simple, extensible, and publicly accessible. Solar-TK's simple approach models and forecasts a site's solar output given only its location and a small amount of historical generation data. Solar-TK's extensible design includes a small collection of independent modules that connect together to implement basic modeling and forecasting, while also enabling users to implement new energy analytics. We plan to release Solar-TK as open source to enable research that requires realistic solar models and forecasts, and to serve as a baseline for comparing new solar modeling and forecasting techniques. We compare Solar-TK's simple approach with PVlib and show that it yields comparable accuracy. We present three case studies showing how Solar-TK can advance energy-efficiency research.  more » « less
Award ID(s):
1645952
NSF-PAR ID:
10163745
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
IEEE 16th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)
Page Range / eLocation ID:
456 to 466
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Variation in solar irradiance causes power generation fluctuations in solar power plants. Power grid operators need accurate irradiance forecasts to manage this variability. Many factors affect irradiance, including the time of year, weather and time of day. Cloud cover is one of the most important variables that affects solar power generation, but is also characterized by a high degree of variability and uncertainty. Deep learning methods have the ability to learn long-term dependencies within sequential data. We investigate the application of Gated Recurrent Units (GRU) to forecast solar irradiance and present the results of applying multivariate GRU to forecast hourly solar irradiance in Phoenix, Arizona. We compare and evaluate the performance of GRU against Long Short-Term Memory (LSTM) using strictly historical solar irradiance data as well as the addition of exogenous weather variables and cloud cover data. Based on our results, we found that the addition of exogenous weather variables and cloud cover data in both GRU and LSTM significantly improved forecasting accuracy, performing better than univariate and statistical models. 
    more » « less
  2. Abstract

    Access to accurate, generalizable and scalable solar irradiance prediction is critical for smooth solar-grid integration, especially in the light of the accelerated global adoption of solar energy production. Both physical and statistical prediction models of solar irradiance have been proposed in the literature. Physical models require meteorological forecasts—generated by computationally expensive models—to predict solar irradiance, with limited accuracy in sub-daily predictions. Statistical models leveragein-situmeasurements which require expensive equipment and do not account for meso-scale atmospheric dynamics. We address these fundamental gaps by developing a convolutional global horizontal irradiance prediction model, using convolutional neural networks and publicly accessible satellite cloud images. Our proposed model predicts solar irradiance in 12 different locations in the US for various prediction time horizons. Our model yields up to 24% improvement in an hour-ahead predictions and 26% in a day-ahead predictions compared to a persistence forecast. Moreover, using saliency maps and target-location-focused cropping, we demonstrate the benefits of incorporating meso-scale atmospheric dynamics for prediction performance. Our results are critical for energy systems planners, utility managers and electricity market participants to ensure efficient harvesting of the solar energy and reliable operation of the grid.

     
    more » « less
  3. null (Ed.)
    The falling cost of solar energy deployments has resulted in ever-increasing growth in solar capacity worldwide. The primary challenge posed by increasing grid-tied solar capacity is handling its variability due to continuously changing conditions. Thus, prior work has developed highly sophisticated models to estimate and forecast solar power output based on many characteristics, including location, elevation, time, weather, shading, module type, wiring, etc. These models are highly accurate for estimating solar power, especially over long periods, for sites at low latitudes, i.e., closer to the equator. However, models for sites at higher latitudes are less accurate due to the effect of snow on solar output, since even a small amount of snow can cover panels and reduce power to zero. Improving the accuracy of these models for annual solar output by even 2--3% is significant, as power translates directly into revenue, which compounds over the system's lifetime. Thus, if a site produces 2--3% less power on average per year due to snow than a model predicts, it can mean the difference between a positive or negative return-on-investment. To address the problem, we develop DeepSnow, a data-driven approach that models the effect of snow on solar power generation. DeepSnow integrates with existing solar modeling frameworks, and uses publicly available snow data to learn its effect on solar generation. We leverage deep learning to quantify the effect of different snow variables on solar power using 4 million hourly readings from 40 solar sites. We evaluate our approach on 10 solar sites, and show that it yields a higher accuracy than the current approach for modeling snow effects used by the U.S. Department of Energy's System Advisor Model (SAM), a popular solar modeling framework. 
    more » « less
  4. Abstract Background West Nile virus (WNV) is the leading cause of mosquito-borne illness in the continental USA. WNV occurrence has high spatiotemporal variation, and current approaches to targeted control of the virus are limited, making forecasting a public health priority. However, little research has been done to compare strengths and weaknesses of WNV disease forecasting approaches on the national scale. We used forecasts submitted to the 2020 WNV Forecasting Challenge, an open challenge organized by the Centers for Disease Control and Prevention, to assess the status of WNV neuroinvasive disease (WNND) prediction and identify avenues for improvement. Methods We performed a multi-model comparative assessment of probabilistic forecasts submitted by 15 teams for annual WNND cases in US counties for 2020 and assessed forecast accuracy, calibration, and discriminatory power. In the evaluation, we included forecasts produced by comparison models of varying complexity as benchmarks of forecast performance. We also used regression analysis to identify modeling approaches and contextual factors that were associated with forecast skill. Results Simple models based on historical WNND cases generally scored better than more complex models and combined higher discriminatory power with better calibration of uncertainty. Forecast skill improved across updated forecast submissions submitted during the 2020 season. Among models using additional data, inclusion of climate or human demographic data was associated with higher skill, while inclusion of mosquito or land use data was associated with lower skill. We also identified population size, extreme minimum winter temperature, and interannual variation in WNND cases as county-level characteristics associated with variation in forecast skill. Conclusions Historical WNND cases were strong predictors of future cases with minimal increase in skill achieved by models that included other factors. Although opportunities might exist to specifically improve predictions for areas with large populations and low or high winter temperatures, areas with high case-count variability are intrinsically more difficult to predict. Also, the prediction of outbreaks, which are outliers relative to typical case numbers, remains difficult. Further improvements to prediction could be obtained with improved calibration of forecast uncertainty and access to real-time data streams (e.g. current weather and preliminary human cases). Graphical Abstract 
    more » « less
  5. To make a future run by renewable energy possible, we must design our power system to seamlessly collect, store, and transport the Earth's naturally occurring flows of energy – namely the sun and the wind. Such a future will require that accurate representations of wind and solar resources and their associated variability permeate power systems planning and operational tools. Practically speaking, we must merge weather and power systems modeling. Although many meteorological phenomena that affect wind and solar power production are well-studied in isolation, no coordinated effort has sought to improve medium- and long-term power systems planning using numerical weather prediction (NWP) models. One modern open-source NWP tool – the weather research and forecasting (WRF) model – offers the complexity and flexibility required to integrate weather prediction with a power systems model in any region. However, there are over one million distinct ways to set up WRF. Here, we present a methodology for optimizing the WRF model physics for forecasting wind power density and solar irradiance using a genetic algorithm. The top five setups created by our algorithm outperform all of the recommended setups. Using the simulation results, we train a random forest model to identify which WRF parameters contribute to the lowest forecast errors and produce plots depicting the performance of key physics options to guide energy researchers in quickly setting up an accurate WRF model. 
    more » « less