skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Predicting Solar Proton Events of Solar Cycles 22–24 Using GOES Proton and Soft-X-Ray Flux Features
Abstract Solar energetic particle (SEP) events and their major subclass, solar proton events (SPEs), can have unfavorable consequences on numerous aspects of life and technology, making them one of the most harmful effects of solar activity. Garnering knowledge preceding such events by studying operational data flows is essential for their forecasting. Considering only solar cycle (SC) 24 in our previous study, we found that it may be sufficient to only utilize proton and soft X-ray (SXR) parameters for SPE forecasts. Here, we report a catalog recording ≥10 MeV ≥10 particle flux unit SPEs with their properties, spanning SCs 22–24, using NOAA’s Geostationary Operational Environmental Satellite flux data. We report an additional catalog of daily proton and SXR flux statistics for this period, employing it to test the application of machine learning (ML) on the prediction of SPEs using a support vector machine (SVM) and extreme gradient boosting (XGBoost). We explore the effects of training models with data from oneandtwo SCs, evaluating how transferable a model might be across different time periods. XGBoost proved to be more accurate than SVMs for almost every test considered, while also outperforming operational SWPC NOAA predictions and a persistence forecast. Interestingly, training done with SC 24 produces weaker true skill statistic and Heidke skill scores2, even when paired with SC 22 or SC 23, indicating transferability issues. This work contributes toward validating forecasts using long-spanning data—an understudied area in SEP research that should be considered to verify the cross cycle robustness of ML-driven forecasts.  more » « less
Award ID(s):
1743321 2320147 1916509 1936361
PAR ID:
10500948
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
IOPSCIENCE
Date Published:
Journal Name:
The Astrophysical Journal Supplement Series
Volume:
270
Issue:
1
ISSN:
0067-0049
Page Range / eLocation ID:
15
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Solar energetic particle (SEP) events, in particular high-energy-range SEP events, pose significant risks to space missions, astronauts, and technological infrastructure. Accurate prediction of these high-impact events is crucial for mitigating potential hazards. In this study, we present an end-to-end ensemble machine learning (ML) framework for the prediction of high-impact ∼100 MeV SEP events. Our approach leverages diverse data modalities sourced from the Solar and Heliospheric Observatory and the Geostationary Operational Environmental Satellite integrating extracted active region polygons from solar extreme ultraviolet (EUV) imagery, time-series proton flux measurements, sunspot activity data, and detailed active region characteristics. To quantify the predictive contribution of each data modality (e.g., EUV or time series), we independently evaluate them using a range of ML models to assess their performance in forecasting SEP events. Finally, to enhance the SEP predictive performance, we train an ensemble learning model that combines all the models trained on individual data modalities, leveraging the strengths of each data modality. Our proposed ensemble approach shows promising performance, achieving a recall of 0.80 and 0.75 in balanced and imbalanced settings, respectively, underscoring the effectiveness of multimodal data integration for robust SEP event prediction and enhanced forecasting capabilities. 
    more » « less
  2. Abstract It is known that the weak state of the heliosphere due to diminished solar activity in cycle 24 backreacted on coronal mass ejections (CMEs) to make them appear wider for a given speed. One of the consequences of the weak state of the heliosphere is that more CMEs appear as halo CMEs (HCMEs), and halos are formed at shorter heliocentric distances. Current predictions for the strength of solar cycle (SC) 25 range from half to twice the strength of SC 24. We compare the HCME occurrence rate and other properties during the rise phase of cycles 23, 24, and 25 to weigh in on the strength of SC 25. We find that HCME and solar wind properties in SC 25 are intermediate between SCs 23 and 24, but closer to SC 24. The HCME occurrence rate, normalized to the sunspot number, is higher in SCs 24 and 25 than in SC 23. The solar wind total pressure in SC 25 is ∼35% smaller than that in SC 23. Furthermore, the occurrence rates of high-energy solar energetic particle events and intense geomagnetic storms are well below the corresponding values in SC 23, but similar to those in SC 24. We conclude that cycle 25 is likely to be similar to or slightly stronger than cycle 24, in agreement with polar-field precursor methods for cycle 25 prediction. 
    more » « less
  3. Abstract Solar energetic particles (SEPs) are associated with extreme solar events that can cause major damage to space- and ground-based life and infrastructure. High-intensity SEP events, particularly ∼100 MeV SEP events, can pose severe health risks for astronauts owing to radiation exposure and affect Earth’s orbiting satellites (e.g., Landsat and the International Space Station). A major challenge in the SEP event prediction task is the lack of adequate SEP data because of the rarity of these events. In this work, we aim to improve the prediction of ∼30, ∼60, and ∼100 MeV SEP events by synthetically increasing the number of SEP samples. We explore the use of a univariate and multivariate time series of proton flux data as input to machine-learning-based prediction methods, such as time series forest (TSF). Our study covers solar cycles 22, 23, and 24. Our findings show that using data augmentation methods, such as the synthetic minority oversampling technique, remarkably increases the accuracy and F1-score of the classifiers used in this research, especially for TSF, where the average accuracy increased by 20%, reaching around 90% accuracy in the ∼100 MeV SEP prediction task. We also achieved higher prediction accuracy when using the multivariate time series data of the proton flux. Finally, we build a pipeline framework for our best-performing model, TSF, and provide a comprehensive hierarchical classification of the ∼100, ∼60, and ∼30 MeV and non-SEP prediction scenarios. 
    more » « less
  4. Abstract Solar energetic particle (SEP) events pose significant risks to both space and ground-level infrastructure, as well as to human health in space. Understanding and predicting these events are critical for mitigating their potential impacts. In this paper, we address the challenge of predicting SEP events using proton flux data. We leverage some of the most recent advances in time series data mining, such as shapelets and the matrix profile, to propose a simple and easily understandable prediction approach. Our objective is to mitigate the interpretability challenges inherent to most machine learning models and to show that other methods exist that can not only yield accurate forecasts but also facilitate exploration and insight generation within the data domain. For this purpose, we construct a multivariate time series data set consisting of proton flux data recorded by the National Oceanic and Atmospheric Administration's geosynchronous orbit Earth-observing satellite. Then, we use our proposed approach to mine shapelets and make predictions using a random forest classifier. We demonstrate that our approach rivals state-of-the-art SEP prediction, offering superior interpretability and the ability to predict SEP events before their parent eruptive flares. 
    more » « less
  5. Abstract The flux of energetic particles originating from the Sun fluctuates during the solar cycles. It depends on the number and properties of active regions (ARs) present in a single day and associated solar activities, such as solar flares and coronal mass ejections. Observational records of the Space Weather Prediction Center NOAA enable the creation of time-indexed databases containing information about ARs and particle flux enhancements, most widely known as solar energetic particle (SEP) events. In this work, we utilize the data available for solar cycles 21–24 and the initial phase of cycle 25 to perform a statistical analysis of the correlation between SEPs and properties of ARs inferred from the McIntosh and Hale classifications. We find that the complexity of the magnetic field, longitudinal location, area, and penumbra type of the largest sunspot of ARs are most correlated with the production of SEPs. It is found that most SEPs (≈60%, or 108 out of 181 considered events) were generated from an AR classified with the “k” McIntosh subclass as the second component, and these ARs are more likely to produce SEPs if they fall in a Hale class containing aδcomponent. The resulting database containing information about SEP events and ARs is publicly available and can be used for the development of machine learning models to predict the occurrence of SEPs. 
    more » « less