<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>A Machine Learning Ecosystem for Filament Analysis - Phase I: A Manually Annotated Dataset of Filaments</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>04/17/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10427981</idno>
					<idno type="doi"></idno>
					<title level='j'>Space Weather Workshop 2023</title>
<idno></idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Rohan Adhyapak Samuel McDonald</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Detecting and classifying solar filaments is critical in forecasting Earth-affecting transient solar events, including large solar flares and coronal mass ejections. Undetected, these space weather events can cause catastrophic geomagnetic storms resulting in substantial economic damage and death. A network of ground-based observatories, named the Global Oscillation Network Group, was created to provide continuous observation of solar activity. However, parsing the large volume of image data continuously streaming in from GONG tests the limitations of human analysis and presents challenges for space weather research. To address this, we proposed the Machine Learning Ecosystem for Filament Detection (MLEcoFi), an NSF-funded, multiyear project that will produce an open-source collection of filament data and computer vision software for space weather research. In cooperation with NSO, MLEcoFi will assist in automatically detecting, classifying, localizing, and segmenting solar filaments in full-disk H-alpha images. The present phase of research aims to produce a dataset of thousands of GONG H-alpha images, with all filaments’ chirality, bounding box, and segmentation mask manually annotated following strong quality assurance standards, advancing research of filaments and filament-related topics. This dataset, as the first MLEcoFi product, will aid in ongoing and future development of products, including a chirality-aware filament data augmentation engine, high-precision image segmentation loss function, deep neural network segmentation and classification model, and filament detection module that is planned for deployment into NSO’s live infrastructure for the research community. The MLEcoFi team is eager to present its progress and future milestones to obtain valuable feedback from the future users of this ecosystem.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Project Overview</head><p>The present phase of research aims to produce a dataset of thousands of GONG H-alpha images, with all solar filaments' magnetic chirality, bounding box, and segmentation mask manually annotated through morphological analysis following strong quality assurance and data validation standards. This high-fidelity dataset will help advance research of filaments and filament-related topics. This dataset, as the first ML Eco Fi product, will facilitate the development of upcoming ML Eco Fi products, including a browser-based tool for easily viewing and analyzing GONG H-alpha images in cadence, chirality-aware filament data augmentation engine, highprecision image segmentation machine learning loss function, deep neural network segmentation and classification model, and filament detection module that is planned for deployment into NSO's live infrastructure for the research community.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head><p>The data annotation phase is ongoing, but sampled here are examples of annotations in the final dataset. The ML Eco Fi annotations include instance segmentation polygons and masks, bounding boxes, spine polygons, and magnetic chirality labels, for all filaments in focus in each full-disk H-alpha image. As shown in the second figure, care is taken to ensure the fine structural details of each filaments such as the size and shape of its barbs, are accurately captured in the segmentation mask. Existing research datasets for the morphological analysis of filaments often lack this information. It is the ML Eco Fi team's objective that this additional effort will yield a dataset that meets and exceeds the quality of current data sources.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusion and Future Work</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>References and Acknowledgements</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Background and Motivation</head><p>Timely detection and classification of solar filaments is critical in forecasting Earth-affecting transient solar events, including large solar flares and coronal mass ejections. Undetected, these space weather events can cause catastrophic geomagnetic storms resulting in substantial economic damage and death. NSO deployed a network of six ground-based solar observatories, GONG, in 1995 to provide continuous observation of solar activity and aid in space weather research efforts <ref type="bibr">[1]</ref>. 2010, GONG began archiving opensource full-disk H-alpha FITS images for the research community, collected year-round at an uninterrupted one-minute cadence. Based on previous investigations of filaments <ref type="bibr">[2]</ref>[3], and identifying that existing automatic segmentation models fail to capture fine structure information <ref type="bibr">[4]</ref>, the ML Eco Fi team set out to develop a new automatic segmentation model to aid in NSO's mission.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Abstract</head><p>The ML Eco Fi team's ongoing annotation and validation efforts are proceeding well. When the dataset is completed, it will be made available publicly, alongside detailed documentation of its production and an opensource repository of related software. A novel machine learning segmentation loss function based on a fine-structure-sensitive object similarity metric called Multiscale Intersection over Union (MIoU) is currently being developed by the ML Eco Fi team. The MIoU segmentation loss function will be integrated into a deep neural network filamentdetection architecture, trained using the ML Eco Fi filament dataset augmented by our chirality-aware data augmentation engine. Once implemented, this model will be able to automatically classify and segment filaments in new H-alpha images from GONG. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>This project has been</head></div></body>
		</text>
</TEI>
