skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Predicting Solar Proton Events of Solar Cycles 22–24 Using GOES Proton and Soft-X-Ray Flux Features
Abstract Solar energetic particle (SEP) events and their major subclass, solar proton events (SPEs), can have unfavorable consequences on numerous aspects of life and technology, making them one of the most harmful effects of solar activity. Garnering knowledge preceding such events by studying operational data flows is essential for their forecasting. Considering only solar cycle (SC) 24 in our previous study, we found that it may be sufficient to only utilize proton and soft X-ray (SXR) parameters for SPE forecasts. Here, we report a catalog recording ≥10 MeV ≥10 particle flux unit SPEs with their properties, spanning SCs 22–24, using NOAA’s Geostationary Operational Environmental Satellite flux data. We report an additional catalog of daily proton and SXR flux statistics for this period, employing it to test the application of machine learning (ML) on the prediction of SPEs using a support vector machine (SVM) and extreme gradient boosting (XGBoost). We explore the effects of training models with data from oneandtwo SCs, evaluating how transferable a model might be across different time periods. XGBoost proved to be more accurate than SVMs for almost every test considered, while also outperforming operational SWPC NOAA predictions and a persistence forecast. Interestingly, training done with SC 24 produces weaker true skill statistic and Heidke skill scores2, even when paired with SC 22 or SC 23, indicating transferability issues. This work contributes toward validating forecasts using long-spanning data—an understudied area in SEP research that should be considered to verify the cross cycle robustness of ML-driven forecasts.  more » « less
Award ID(s):
1743321 2320147 1916509 1936361
PAR ID:
10485449
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
DOI PREFIX: 10.3847
Date Published:
Journal Name:
The Astrophysical Journal Supplement Series
Volume:
270
Issue:
1
ISSN:
0067-0049
Format(s):
Medium: X Size: Article No. 15
Size(s):
Article No. 15
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Solar energetic particle (SEP) events, in particular high-energy-range SEP events, pose significant risks to space missions, astronauts, and technological infrastructure. Accurate prediction of these high-impact events is crucial for mitigating potential hazards. In this study, we present an end-to-end ensemble machine learning (ML) framework for the prediction of high-impact ∼100 MeV SEP events. Our approach leverages diverse data modalities sourced from the Solar and Heliospheric Observatory and the Geostationary Operational Environmental Satellite integrating extracted active region polygons from solar extreme ultraviolet (EUV) imagery, time-series proton flux measurements, sunspot activity data, and detailed active region characteristics. To quantify the predictive contribution of each data modality (e.g., EUV or time series), we independently evaluate them using a range of ML models to assess their performance in forecasting SEP events. Finally, to enhance the SEP predictive performance, we train an ensemble learning model that combines all the models trained on individual data modalities, leveraging the strengths of each data modality. Our proposed ensemble approach shows promising performance, achieving a recall of 0.80 and 0.75 in balanced and imbalanced settings, respectively, underscoring the effectiveness of multimodal data integration for robust SEP event prediction and enhanced forecasting capabilities. 
    more » « less
  2. Abstract Solar energetic particles (SEPs) are associated with extreme solar events that can cause major damage to space- and ground-based life and infrastructure. High-intensity SEP events, particularly ∼100 MeV SEP events, can pose severe health risks for astronauts owing to radiation exposure and affect Earth’s orbiting satellites (e.g., Landsat and the International Space Station). A major challenge in the SEP event prediction task is the lack of adequate SEP data because of the rarity of these events. In this work, we aim to improve the prediction of ∼30, ∼60, and ∼100 MeV SEP events by synthetically increasing the number of SEP samples. We explore the use of a univariate and multivariate time series of proton flux data as input to machine-learning-based prediction methods, such as time series forest (TSF). Our study covers solar cycles 22, 23, and 24. Our findings show that using data augmentation methods, such as the synthetic minority oversampling technique, remarkably increases the accuracy and F1-score of the classifiers used in this research, especially for TSF, where the average accuracy increased by 20%, reaching around 90% accuracy in the ∼100 MeV SEP prediction task. We also achieved higher prediction accuracy when using the multivariate time series data of the proton flux. Finally, we build a pipeline framework for our best-performing model, TSF, and provide a comprehensive hierarchical classification of the ∼100, ∼60, and ∼30 MeV and non-SEP prediction scenarios. 
    more » « less
  3. Abstract It is known that the weak state of the heliosphere due to diminished solar activity in cycle 24 backreacted on coronal mass ejections (CMEs) to make them appear wider for a given speed. One of the consequences of the weak state of the heliosphere is that more CMEs appear as halo CMEs (HCMEs), and halos are formed at shorter heliocentric distances. Current predictions for the strength of solar cycle (SC) 25 range from half to twice the strength of SC 24. We compare the HCME occurrence rate and other properties during the rise phase of cycles 23, 24, and 25 to weigh in on the strength of SC 25. We find that HCME and solar wind properties in SC 25 are intermediate between SCs 23 and 24, but closer to SC 24. The HCME occurrence rate, normalized to the sunspot number, is higher in SCs 24 and 25 than in SC 23. The solar wind total pressure in SC 25 is ∼35% smaller than that in SC 23. Furthermore, the occurrence rates of high-energy solar energetic particle events and intense geomagnetic storms are well below the corresponding values in SC 23, but similar to those in SC 24. We conclude that cycle 25 is likely to be similar to or slightly stronger than cycle 24, in agreement with polar-field precursor methods for cycle 25 prediction. 
    more » « less
  4. Abstract Solar energetic particle (SEP) events pose significant risks to both space and ground-level infrastructure, as well as to human health in space. Understanding and predicting these events are critical for mitigating their potential impacts. In this paper, we address the challenge of predicting SEP events using proton flux data. We leverage some of the most recent advances in time series data mining, such as shapelets and the matrix profile, to propose a simple and easily understandable prediction approach. Our objective is to mitigate the interpretability challenges inherent to most machine learning models and to show that other methods exist that can not only yield accurate forecasts but also facilitate exploration and insight generation within the data domain. For this purpose, we construct a multivariate time series data set consisting of proton flux data recorded by the National Oceanic and Atmospheric Administration's geosynchronous orbit Earth-observing satellite. Then, we use our proposed approach to mine shapelets and make predictions using a random forest classifier. We demonstrate that our approach rivals state-of-the-art SEP prediction, offering superior interpretability and the ability to predict SEP events before their parent eruptive flares. 
    more » « less
  5. Abstract Solar energetic particle (SEP) events, originating from solar flares and Coronal Mass Ejections, present significant hazards to space exploration and technology on Earth. Accurate prediction of these high‐energy events is essential for safeguarding astronauts, spacecraft, and electronic systems. In this study, we conduct an in‐depth investigation into the application of multimodal data fusion techniques for the prediction of high‐energy SEP events, particularly ∼100 MeV events. Our research utilizes six machine learning (ML) models, each finely tuned for time series analysis, including Univariate Time Series (UTS), Image‐based model (Image), Univariate Feature Concatenation (UFC), Univariate Deep Concatenation (UDC), Univariate Deep Merge (UDM), and Univariate Score Concatenation (USC). By combining time series proton flux data with solar X‐ray images, we exploit complementary insights into the underlying solar phenomena responsible for SEP events. Rigorous evaluation metrics, including accuracy, F1‐score, and other established measures, are applied, along withK‐fold cross‐validation, to ensure the robustness and generalization of our models. Additionally, we explore the influence of observation window sizes on classification accuracy. 
    more » « less