Abstract We present FlareDB, a database that provides comprehensive magnetic field information, ultraviolet/extreme ultraviolet (UV/EUV) emissions, and white light continuum images for solar active regions (ARs) associated with 151 significant flares from May 2010 to May 2025. The data, sourced from the Solar Dynamics Observatory (SDO) via the Joint Science Operations Center (JSOC), were processed with SunPy and stored in standardized JSOC FITS format. FlareDB includes all M5.0 and larger flares within 50° of the solar disk center. Key features include (1) Atmospheric Imaging Assembly (AIA) AR patches in Helioprojective Cartesian(HPC) and Lambert Cylindrical Equal-Area (CEA) projections, aligned with corresponding HMI magnetogram patches; (2) quick-look movies with uniform value ranges that ensure consistent visualization, maintain data uniformity, and enhance readiness for machine learning studies; (3) a supplementary web interface that allows the entire dataset of a flare to be downloaded for large flare analysis. One of FlareDB’s primary objectives is to support scientists in predicting and understanding the onset of solar eruptions, including flares and coronal mass ejections. The data set is machine-learning ready for this purpose.
more »
« less
Bitmap Filtered Line of Sight HMI Active Region Patches with Augmentations
Dataset Description This dataset consists of processed Line-of-Sight (LoS) magnetogram images of Active Regions (ARs) from the Helioseismic and Magnetic Imager (HMI) onboard the Solar Dynamics Observatory (SDO). The images are derived from the Space-Weather HMI Active Region Patches (SHARP) data product definitive series and cover the period from May 2010 to 2018, sampled hourly. Dataset Contents: Processed Magnetogram Images: Each image represents a cropped and standardized view of an AR patch, extracted and adjusted from the original magnetograms. These images have been filtered and normalized to a size of 512×512 pixels. Processing Steps: Cropping: Magnetograms are cropped using bitmaps that define the region of interest within the AR patches. Regions smaller than 70 pixels in width are excluded. Flux Adjustment: Magnetic flux values are capped at ±256 G, with values within ±25 G set to 0 to minimize noise. Standardization: Patches are resized to 512×512 pixels using zero-padding for smaller patches or a 512×512 kernel to select regions with the maximum total unsigned flux (USFLUX) for larger patches. Normalization: Final images are scaled to fit within the range of 0-255. Data Dictionary: harp_N1_N2: These tar files contains folders where the AR patches with harp number N1 to N2 are included. complete_hourly_dataset.csv: This includes the list of hourly sampled magnetograms along with their associated goes flare class, assuming a 24 hour forecast horizon. augmentations: Five different augmentations of AR patches corresponding to GOES flare classes greater than C, assuming a 24 hour forecast horizon are listed as 5 different tar files. Look for: horizontal flip, vertical_flip, add noise, polarity change, and gaussian blur.
more »
« less
- Award ID(s):
- 2104004
- PAR ID:
- 10526392
- Publisher / Repository:
- Harvard Dataverse
- Date Published:
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Context.High-resolution magnetograms are crucial for studying solar flare dynamics because they enable the precise tracking of magnetic structures and rapid field changes. The Helioseismic and Magnetic Imager on board the Solar Dynamics Observatory (SDO/HMI) has been an essential provider of vector magnetograms. However, the spatial resolution of the HMI magnetograms is limited and hence is not able to capture the fine structures that are essential for understanding flare precursors. The Near InfraRed Imaging Spectropolarimeter on the 1.6 m Goode Solar Telescope (GST/NIRIS) at Big Bear Solar Observatory (BBSO) provides a better spatial resolution and is therefore more suitable to track the fine magnetic features and their connection to flare precursors. Aims.We propose DeepHMI, a machine-learning method for solar image super-resolution, to enhance the transverse and line-of-sight magnetograms of solar active regions (ARs) collected by SDO/HMI to better capture the fine-scale magnetic structures that are crucial for understanding solar flare dynamics. The enhanced HMI magnetograms can also be used to study spicules, sunspot light bridges and magnetic outbreaks, for which high-resolution data are essential. Methods.DeepHMI employs a conditional diffusion model that is trained using ground-truth images obtained by an inversion analysis of Stokes measurements collected by GST/NIRIS. Results.Our experiments show that DeepHMI performs better than the commonly used bicubic interpolation method in terms of four evaluation metrics. In addition, we demonstrate the ability of DeepHMI through a case study of the enhancement of SDO/HMI transverse and line-of-sight magnetograms of AR 12371 to GST/NIRIS data.more » « less
-
Supervised Machine Learning (ML) models for solar flare prediction rely on accurate labels for a given input data set, commonly obtained from the GOES/XRS X-ray flare catalog. With increasing interest in utilizing ultraviolet (UV) and extreme ultraviolet (EUV) image data as input to these models, we seek to understand if flaring activity can be defined and quantified using EUV data alone. This would allow us to move away from the GOES single pixel measurement definition of flares and use the same data we use for flare prediction for label creation. In this work, we present a Solar Dynamics Observatory (SDO) Atmospheric Imaging Assembly (AIA)-based flare catalog covering flare of GOES X-ray magnitudes C, M and X from 2010 to 2017. We use active region (AR) cutouts of full disk AIA images to match the corresponding SDO/Helioseismic and Magnetic Imager (HMI) SHARPS (Space weather HMI Active Region Patches) that have been extensively used in ML flare prediction studies, thus allowing for labeling of AR number as well as flare magnitude and timing. Flare start, peak, and end times are defined using a peak-finding algorithm on AIA time series data obtained by summing the intensity across the AIA cutouts. An extremely randomized trees (ERT) regression model is used to map SDO/AIA flare magnitudes to GOES X-ray magnitude, achieving a low-variance regression. We find an accurate overlap on 85% of M/X flares between our resulting AIA catalog and the GOES flare catalog. However, we also discover a number of large flares unrecorded or mislabeled in the GOES catalog.more » « less
-
Magnetic polarity inversion lines (PILs) in solar active regions are key to triggering flares and eruptions. Recently, engineered PIL features have been used for predicting solar eruptions. Derived from the original PIL dataset, using line-of-sight (LoS) magnetograms provided by the Solar Dynamics Observatory's (SDO) Helioseismic and Magnetic Imager (HMI) Active Region Patches (HARPs), we provide a publicly available comprehensive dataset in a supervised format, where each instance includes a raster of Polarity Inversion Lines (PILs), one of the polarity convex hull, and a multivariate time-series of properties related to PILs. Using SDO-GOES integrated flares historical data covering May 2010 to January 2019, we have assigned each of the instances their corresponding class of flare, FQ, C, M or X. By integrating these diverse data modalities, our approach aims to improve the accuracy of solar flare predictions. Initial findings suggest that the multimodal approach can uncover new patterns and relationships, potentially leading to breakthroughs in predictive accuracy and more effective mitigation strategies against the impacts of solar activities.more » « less
-
Using non-linear force free field (NLFFF) extrapolation, 3D magnetic fields were modeled from the 12-min cadence Solar Dynamics Observatory Helioseismic and Magnetic Imager (HMI) photospheric vector magnetograms, spanning a time period of 1 hour before through 1 hour after the start of 18 X-class and 12 M-class solar flares. Several magnetic field parameters were calculated from the modeled fields directly, as well as from the power spectrum of surface maps generated by summing the fields along the vertical axis, for two different regions: areas with photospheric |Bz|≥ 300 G (active region—AR) and areas above the photosphere with the magnitude of the non-potential field (BNP) greater than three standard deviations above of the AR field and either the unsigned twist number |Tw| ≥ 1 turn or the shear angle Ψ ≥ 80° (non-potential region—NPR). Superposed epoch (SPE) plots of the magnetic field parameters were analyzed to investigate the evolution of the 3D solar field during the solar flare events and discern consistent trends across all solar flare events in the dataset, as well as across subsets of flare events categorized by their magnetic and sunspot classifications. The relationship between different flare properties and the magnetic field parameters was quantitatively described by the Spearman ranking correlation coefficient, rs. The parameters that showed the most consistent and discernable trends among the flare events, particularly for the hour leading up to the eruption, were the total unsigned fluxϕ), free magnetic energy (EFree), total unsigned magnetic twist (τTot), and total unsigned free magnetic twist (ρTot). Strong (|rs| ∈ [0.6, 0.8)) to very strong (|rs| ∈ [0.8, 1.0]) correlations were found between the magnetic field parameters and the following flare properties: peak X-ray flux, duration, rise time, decay time, impulsiveness, and integrated flux; the strongest correlation coefficient calculated for each flare property was 0.62, 0.85, 0.73, 0.82, −0.81, and 0.82, respectively.more » « less
An official website of the United States government
