NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

H-Alpha anomalyzer: An anomaly detector for H-Alpha solar observations using a grid-based approach

https://doi.org/10.1016/j.softx.2025.102120

Khazaei, Mahsa; Mahdi, Heba; Chaurasiya, Kartik; Ahmadzadeh, Azim (May 2025, SoftwareX)

Free, publicly-accessible full text available May 1, 2026
Outlier Detection and Removal in Multivariate Time Series for a More Robust Machine Learning–based Solar Flare Prediction

https://doi.org/10.3847/1538-4365/adb9e3

Wen, Junzhi; Ahmadzadeh, Azim; Georgoulis, Manolis K; Sadykov, Viacheslav M; Angryk, Rafal A (April 2025, The Astrophysical Journal Supplement Series)

Abstract Timely and accurate prediction of solar flares is a crucial task due to the danger they pose to human life and infrastructure beyond Earth’s atmosphere. Although various machine learning algorithms have been employed to improve solar flare prediction, there has been limited focus on improving performance using outlier detection. In this study, we propose the use of a tree-based outlier detection algorithm, Isolation Forest (iForest), to identify multivariate time-series instances within the flare-forecasting benchmark data set, Space Weather Analytics for Solar Flares (SWAN-SF). By removing anomalous samples from the nonflaring class (N-class) data, we observe a significant improvement in both the true skill score and the updated Heidke skill score in two separate experiments. We focus on analyzing outliers detected by iForest at a 2.4% contamination rate, considered the most effective overall. Our analysis reveals a co-occurrence between the outliers we discovered and strong flares. Additionally, we investigated the similarity between the outliers and the strong-flare data and quantified it using Kullback–Leibler divergence. This analysis demonstrates a higher similarity between our outliers and strong-flare data when compared to the similarity between the outliers and the rest of the N-class data, supporting our rationale for using outlier detection to enhance SWAN-SF data for flare prediction. Furthermore, we explore a novel approach by treating our outliers as if they belong to flaring-class data in the training phase of our machine learning, resulting in further enhancements to our models’ performance.
more » « less
Free, publicly-accessible full text available April 1, 2026
MAGFILO: Manually Annotated GONG Filaments in H-Alpha Observations

https://doi.org/10.7910/DVN/J6JNVK

Ahmadzadeh, Azim; Adhyapak, Rohan; Chaurasiya, Kartik; Nagubandi, Laxmi Alekhya; Aparna, V; Martens, Petrus C; Pevtsov, Alexei; Bertello, Luca; Pevtsov, Alexander; Douglas, Naomi; et al (January 2024, Harvard Dataverse)

MAGFiLO is a dataset of manually annotated solar filaments from H-Alpha observations captured by the Global Oscillation Network Group (GONG). This dataset includes over ten thousand annotated filaments, spanning the years 2011 through 2022. Each annotation details one filament's segmentation, minimum bounding box, spine, and magnetic field chirality. MAGFiLO is the first dataset of its size, enabling advanced deep learning models to identify filaments and their features with unprecedented precision. It also provides a testbed for solar physicists interested in large-scale analysis of filaments.
more » « less
Measuring Class-Imbalance Sensitivity of Deterministic Performance Evaluation Metrics

https://doi.org/10.1109/ICIP46576.2022.9897445

Ahmadzadeh, Azim; Angryk, Rafal A. (October 2022, 2022 IEEE International Conference on Image Processing (ICIP))

The class-imbalance issue is intrinsic to many real-world machine learning tasks, particularly to the rare-event classification problems. Although the impact and treatment of imbalanced data is widely known, the magnitude of a metric’s sensitivity to class imbalance has attracted little attention. As a result, often the sensitive metrics are dismissed while their sensitivity may only be marginal. In this paper, we introduce an intuitive evaluation framework that quantifies metrics’ sensitivity to the class imbalance. Moreover, we reveal an interesting fact that there is a logarithmic behavior in metrics’ sensitivity meaning that the higher imbalance ratios are associated with the lower sensitivity of metrics. Our framework builds an intuitive understanding of the class-imbalance impact on metrics. We believe this can help avoid many common mistakes, specially the less-emphasized and incorrect assumption that all metrics’ quantities are comparable under different class-imbalance ratios.
more » « less
Full Text Available
CGAN-based synthetic multivariate time-series generation: a solution to data scarcity in solar flare forecasting

https://doi.org/10.1007/s00521-022-07361-8

Chen, Yang; Kempton, Dustin J.; Ahmadzadeh, Azim; Wen, Junzhi; Ji, Anli; Angryk, Rafal A. (August 2022, Neural Computing and Applications)

Full Text Available
Multiscale IOU: A Metric for Evaluation of Salient Object Detection with Fine Structures

https://doi.org/10.1109/ICIP42928.2021.9506337

Ahmadzadeh, Azim; Kempton, Dustin J.; Chen, Yang; Angryk, Rafal A. (September 2021, 2021 IEEE International Conference on Image Processing (ICIP))

Full Text Available
Feature Selection on a Flare Forecasting Testbed: A Comparative Study of 24 Methods

https://doi.org/10.1109/ICDMW53433.2021.00138

Yeolekar, Atharv; Patel, Sagar; Talla, Shreejaa; Puthucode, Krishna Rukmini; Ahmadzadeh, Azim; Sadykov, Viacheslav M.; Angryk, Rafal A. (December 2021, Systematic Feature Engineering for Time-Series Data Mining (SFE-TSDM) workshop)

Full Text Available
How to Train Your Flare Prediction Model: Revisiting Robust Sampling of Rare Events

https://doi.org/10.3847/1538-4365/abec88

Ahmadzadeh, Azim; Aydin, Berkay; Georgoulis, Manolis K.; Kempton, Dustin J.; Mahajan, Sushant S.; Angryk, Rafal A. (May 2021, The Astrophysical Journal Supplement Series)

Abstract We present a case study of solar flare forecasting by means of metadata feature time series, by treating it as a prominent class-imbalance and temporally coherent problem. Taking full advantage of pre-flare time series in solar active regions is made possible via the Space Weather Analytics for Solar Flares (SWAN-SF) benchmark data set, a partitioned collection of multivariate time series of active region properties comprising 4075 regions and spanning over 9 yr of the Solar Dynamics Observatory period of operations. We showcase the general concept of temporal coherence triggered by the demand of continuity in time series forecasting and show that lack of proper understanding of this effect may spuriously enhance models’ performance. We further address another well-known challenge in rare-event prediction, namely, the class-imbalance issue. The SWAN-SF is an appropriate data set for this, with a 60:1 imbalance ratio for GOES M- and X-class flares and an 800:1 imbalance ratio for X-class flares against flare-quiet instances. We revisit the main remedies for these challenges and present several experiments to illustrate the exact impact that each of these remedies may have on performance. Moreover, we acknowledge that some basic data manipulation tasks such as data normalization and cross validation may also impact the performance; we discuss these problems as well. In this framework we also review the primary advantages and disadvantages of using true skill statistic and Heidke skill score, two widely used performance verification metrics for the flare-forecasting task. In conclusion, we show and advocate for the benefits of time series versus point-in-time forecasting, provided that the above challenges are measurably and quantitatively addressed.
more » « less
Full Text Available
On The Effectiveness of Imaging of Time Series for Flare Forecasting Problem

https://doi.org/10.1109/BigData50022.2020.9378006

Chen, Yang; Ji, Anli; Babajiyavar, Pavan Ajit; Ahmadzadeh, Azim; Angryk, Rafal A. (December 2020, 2020 IEEE International Conference on Big Data (Big Data))

Full Text Available
Contingency Space: A Semimetric Space for Classification Evaluation

https://doi.org/10.1109/TPAMI.2022.3167007

Ahmadzadeh, Azim; Kempton, Dustin J.; Martens, Petrus C.; Angryk, Rafal A. (February 2023, IEEE Transactions on Pattern Analysis and Machine Intelligence)

« Prev Next »

Search for: All records