skip to main content

Title: Predicting Solar Proton Events of Solar Cycles 22–24 Using GOES Proton and Soft-X-Ray Flux Features

Solar energetic particle (SEP) events and their major subclass, solar proton events (SPEs), can have unfavorable consequences on numerous aspects of life and technology, making them one of the most harmful effects of solar activity. Garnering knowledge preceding such events by studying operational data flows is essential for their forecasting. Considering only solar cycle (SC) 24 in our previous study, we found that it may be sufficient to only utilize proton and soft X-ray (SXR) parameters for SPE forecasts. Here, we report a catalog recording ≥10 MeV ≥10 particle flux unit SPEs with their properties, spanning SCs 22–24, using NOAA’s Geostationary Operational Environmental Satellite flux data. We report an additional catalog of daily proton and SXR flux statistics for this period, employing it to test the application of machine learning (ML) on the prediction of SPEs using a support vector machine (SVM) and extreme gradient boosting (XGBoost). We explore the effects of training models with data from oneandtwo SCs, evaluating how transferable a model might be across different time periods. XGBoost proved to be more accurate than SVMs for almost every test considered, while also outperforming operational SWPC NOAA predictions and a persistence forecast. Interestingly, training done with SC 24 produces weaker true skill statistic and Heidke skill scores2, even when paired with SC 22 or SC 23, indicating transferability issues. This work contributes toward validating forecasts using long-spanning data—an understudied area in SEP research that should be considered to verify the cross cycle robustness of ML-driven forecasts.

more » « less
Award ID(s):
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
DOI PREFIX: 10.3847
Date Published:
Journal Name:
The Astrophysical Journal Supplement Series
Medium: X Size: Article No. 15
["Article No. 15"]
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Solar energetic particles (SEPs) are associated with extreme solar events that can cause major damage to space- and ground-based life and infrastructure. High-intensity SEP events, particularly ∼100 MeV SEP events, can pose severe health risks for astronauts owing to radiation exposure and affect Earth’s orbiting satellites (e.g., Landsat and the International Space Station). A major challenge in the SEP event prediction task is the lack of adequate SEP data because of the rarity of these events. In this work, we aim to improve the prediction of ∼30, ∼60, and ∼100 MeV SEP events by synthetically increasing the number of SEP samples. We explore the use of a univariate and multivariate time series of proton flux data as input to machine-learning-based prediction methods, such as time series forest (TSF). Our study covers solar cycles 22, 23, and 24. Our findings show that using data augmentation methods, such as the synthetic minority oversampling technique, remarkably increases the accuracy and F1-score of the classifiers used in this research, especially for TSF, where the average accuracy increased by 20%, reaching around 90% accuracy in the ∼100 MeV SEP prediction task. We also achieved higher prediction accuracy when using the multivariate time series data of the proton flux. Finally, we build a pipeline framework for our best-performing model, TSF, and provide a comprehensive hierarchical classification of the ∼100, ∼60, and ∼30 MeV and non-SEP prediction scenarios.

    more » « less
  2. Abstract

    The flux of energetic particles originating from the Sun fluctuates during the solar cycles. It depends on the number and properties of active regions (ARs) present in a single day and associated solar activities, such as solar flares and coronal mass ejections. Observational records of the Space Weather Prediction Center NOAA enable the creation of time-indexed databases containing information about ARs and particle flux enhancements, most widely known as solar energetic particle (SEP) events. In this work, we utilize the data available for solar cycles 21–24 and the initial phase of cycle 25 to perform a statistical analysis of the correlation between SEPs and properties of ARs inferred from the McIntosh and Hale classifications. We find that the complexity of the magnetic field, longitudinal location, area, and penumbra type of the largest sunspot of ARs are most correlated with the production of SEPs. It is found that most SEPs (≈60%, or 108 out of 181 considered events) were generated from an AR classified with the “k” McIntosh subclass as the second component, and these ARs are more likely to produce SEPs if they fall in a Hale class containing aδcomponent. The resulting database containing information about SEP events and ARs is publicly available and can be used for the development of machine learning models to predict the occurrence of SEPs.

    more » « less
  3. Abstract It is known that the weak state of the heliosphere due to diminished solar activity in cycle 24 backreacted on coronal mass ejections (CMEs) to make them appear wider for a given speed. One of the consequences of the weak state of the heliosphere is that more CMEs appear as halo CMEs (HCMEs), and halos are formed at shorter heliocentric distances. Current predictions for the strength of solar cycle (SC) 25 range from half to twice the strength of SC 24. We compare the HCME occurrence rate and other properties during the rise phase of cycles 23, 24, and 25 to weigh in on the strength of SC 25. We find that HCME and solar wind properties in SC 25 are intermediate between SCs 23 and 24, but closer to SC 24. The HCME occurrence rate, normalized to the sunspot number, is higher in SCs 24 and 25 than in SC 23. The solar wind total pressure in SC 25 is ∼35% smaller than that in SC 23. Furthermore, the occurrence rates of high-energy solar energetic particle events and intense geomagnetic storms are well below the corresponding values in SC 23, but similar to those in SC 24. We conclude that cycle 25 is likely to be similar to or slightly stronger than cycle 24, in agreement with polar-field precursor methods for cycle 25 prediction. 
    more » « less
  4. Abstract

    This paper presents a stochastic three-dimensional focused transport simulation of solar energetic particles (SEPs) produced by a data-driven coronal mass ejection (CME) shock propagating through a data-driven model of coronal and heliospheric magnetic fields. The injection of SEPs at the CME shock is treated using diffusive shock acceleration of post-shock suprathermal solar wind ions. A time-backward stochastic simulation is employed to solve the transport equation to obtain the SEP time–intensity profile at any location, energy, and pitch angle. The model is applied to a SEP event on 2020 May 29, observed by STEREO-A close to ∼1 au and by Parker Solar Probe (PSP) when it was about 0.33 au away from the Sun. The SEP event was associated with a very slow CME with a plane-of-sky speed of 337 km s−1at a height below 6RSas reported in the SOHO/LASCO CME catalog. We compute the time profiles of particle flux at PSP and STEREO-A locations, and estimate both the spectral index of the proton energy spectrum for energies between ∼2 and 16 MeV and the equivalent path length of the magnetic field lines experienced by the first arriving SEPs. We find that the simulation results are well correlated with observations. The SEP event could be explained by the acceleration of particles by a weak CME shock in the low solar corona that is not magnetically connected to the observers.

    more » « less
  5. Abstract

    This study explores the behavior of machine-learning-based flare forecasting models deployed in a simulated operational environment. Using Georgia State University’s Space Weather Analytics for Solar Flares benchmark data set, we examine the impacts of training methodology and the solar cycle on decision tree, support vector machine, and multilayer perceptron performance. We implement our classifiers using three temporal training windows: stationary, rolling, and expanding. The stationary window trains models using a single set of data available before the first forecasting instance, which remains constant throughout the solar cycle. The rolling window trains models using data from a constant time interval before the forecasting instance, which moves with the solar cycle. Finally, the expanding window trains models using all available data before the forecasting instance. For each window, a number of input features (1, 5, 10, 25, 50, and 120) and temporal sizes (5, 8, 11, 14, 17, and 20 months) were tested. To our surprise, we found that, for a window of 20 months, skill scores were comparable regardless of the window type, feature count, and classifier selected. Furthermore, reducing the size of this window only marginally decreased stationary and rolling window performance. This implies that, given enough data, a stationary window can be chosen over other window types, eliminating the need for model retraining. Finally, a moderately strong positive correlation was found to exist between a model’s false-positive rate and the solar X-ray background flux. This suggests that the solar cycle phase has a considerable influence on forecasting.

    more » « less