NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Spatiotemporal Data Augmentation of MODIS‐Landsat Water Bodies Using Adversarial Networks

https://doi.org/10.1029/2023WR036342

Filali_Boubrahimi, Soukaina; Neema, Ashit; Nassar, Ayman; Hosseinzadeh, Pouya; Hamdi, Shah Muhammad (March 2024, Water Resources Research)

Abstract With increasing demands for precise water resource management, there is a growing need for advanced techniques in mapping water bodies. The currently deployed satellites provide complementary data that are either of high spatial or high temporal resolutions. As a result, there is a clear trade‐off between space and time when considering a single data source. For the efficient monitoring of multiple environmental resources, various Earth science applications need data at high spatial and temporal resolutions. To address this need, many data fusion methods have been described in the literature, that rely on combining data snapshots from multiple sources. Traditional methods face limitations due to sensitivity to atmospheric disturbances and other environmental factors, resulting in noise, outliers, and missing data. This paper introduces Hydrological Generative Adversarial Network (Hydro‐GAN), a novel machine learning‐based method that utilizes modified GANs to enhance boundary accuracy when mapping low‐resolution MODIS data to high‐resolution Landsat‐8 images. We propose a new non‐saturating loss function for the Hydro‐GAN generator, which maximizes the log of discriminator probabilities to promote stable updates and aid convergence. By focusing on reducing squared differences between real and synthetic images, our approach enhances training stability and overall performance. We specifically focus on mapping water bodies using MODIS and Landsat‐8 imagery due to their relevance in water resource management tasks. Our experimental results demonstrate the effectiveness of Hydro‐GAN in generating high‐resolution water body maps, outperforming traditional methods in terms of boundary accuracy and overall quality.
more » « less
Full Text Available
Classification of Major Solar Flares from Extremely Imbalanced Multivariate Time Series Data Using Minimally Random Convolutional Kernel Transform

https://doi.org/10.3390/universe10060234

Saini, Kartik; Alshammari, Khaznah; Hamdi, Shah Muhammad; Filali_Boubrahimi, Soukaina (June 2024, Universe)

Solar flares are characterized by sudden bursts of electromagnetic radiation from the Sun’s surface, and are caused by the changes in magnetic field states in active solar regions. Earth and its surrounding space environment can suffer from various negative impacts caused by solar flares, ranging from electronic communication disruption to radiation exposure-based health risks to astronauts. In this paper, we address the solar flare prediction problem from magnetic field parameter-based multivariate time series (MVTS) data using multiple state-of-the-art machine learning classifiers that include MINImally RandOm Convolutional KErnel Transform (MiniRocket), Support Vector Machine (SVM), Canonical Interval Forest (CIF), Multiple Representations Sequence Learner (Mr-SEQL), and a Long Short-Term Memory (LSTM)-based deep learning model. Our experiment is conducted on the Space Weather Analytics for Solar Flares (SWAN-SF) benchmark data set, which is a partitioned collection of MVTS data of active region magnetic field parameters spanning over nine years of operation of the Solar Dynamics Observatory (SDO). The MVTS instances of the SWAN-SF dataset are labeled by GOES X-ray flux-based flare class labels, and attributed to extreme class imbalance because of the rarity of the major flaring events (e.g., X and M). As a performance validation metric in this class-imbalanced dataset, we used the True Skill Statistic (TSS) score. Finally, we demonstrate the advantages of the MVTS learning algorithm MiniRocket, which outperformed the aforementioned classifiers without the need for essential data preprocessing steps such as normalization, statistical summarization, and class imbalance handling heuristics.
more » « less
Full Text Available
Enhancing Monthly Streamflow Prediction Using Meteorological Factors and Machine Learning Models in the Upper Colorado River Basin

https://doi.org/10.3390/hydrology11050066

Thota, Saichand; Nassar, Ayman; Filali_Boubrahimi, Soukaina; Hamdi, Shah Muhammad; Hosseinzadeh, Pouya (May 2024, Hydrology)

Streamflow prediction is crucial for planning future developments and safety measures along river basins, especially in the face of changing climate patterns. In this study, we utilized monthly streamflow data from the United States Bureau of Reclamation and meteorological data (snow water equivalent, temperature, and precipitation) from the various weather monitoring stations of the Snow Telemetry Network within the Upper Colorado River Basin to forecast monthly streamflow at Lees Ferry, a specific location along the Colorado River in the basin. Four machine learning models—Random Forest Regression, Long short-term memory, Gated Recurrent Unit, and Seasonal AutoRegresive Integrated Moving Average—were trained using 30 years of monthly data (1991–2020), split into 80% for training (1991–2014) and 20% for testing (2015–2020). Initially, only historical streamflow data were used for predictions, followed by including meteorological factors to assess their impact on streamflow. Subsequently, sequence analysis was conducted to explore various input-output sequence window combinations. We then evaluated the influence of each factor on streamflow by testing all possible combinations to identify the optimal feature combination for prediction. Our results indicate that the Random Forest Regression model consistently outperformed others, especially after integrating all meteorological factors with historical streamflow data. The best performance was achieved with a 24-month look-back period to predict 12 months of streamflow, yielding a Root Mean Square Error of 2.25 and R-squared (R2) of 0.80. Finally, to assess model generalizability, we tested the best model at other locations—Greenwood Springs (Colorado River), Maybell (Yampa River), and Archuleta (San Juan) in the basin.
more » « less
Full Text Available
METFORC: Classification with Meta-Learning and Multimodal Stratified Time Series Forest

https://doi.org/10.1109/ICMLA58977.2023.00188

Hosseinzadeh, Pouya; Bahri, Omar; Li, Peiyu; Boubrahimi, Soukaina Filali; Hamdi, Shah Muhammad (December 2023, 2023 International Conference on Machine Learning and Applications (ICMLA))
Adversarial Attack Driven Data Augmentation for Time Series Classification

https://doi.org/10.1109/ICMLA58977.2023.00096

Li, Peiyu; Hosseinzadeh, Pouya; Bahri, Omar; Boubrahimi, Soukaina Filali; Hamdi, Shah Muhammad (December 2023, 2023 International Conference on Machine Learning and Applications (ICMLA))
CELS: Counterfactual Explanations for Time Series Data via Learned Saliency Maps

https://doi.org/10.1109/BigData59044.2023.10386229

Li, Peiyu; Bahri, Omar; Boubrahimi, Soukaïna Filali; Hamdi, Shah Muhammad (December 2023, IEEE)
Shapelet-Preserving Bootstrapping For Time Series Data Augmentation

https://doi.org/10.1109/ICMLA58977.2023.00069

Bahri, Omar; Li, Peiyu; Hosseinzadeh, Pouya; Boubrahimi, Soukaïna Filali; Hamdi, Shah Muhammad (December 2023, IEEE)
ML-Based Streamflow Prediction in the Upper Colorado River Basin Using Climate Variables Time Series Data

https://doi.org/10.3390/hydrology10020029

Hosseinzadeh, Pouya; Nassar, Ayman; Boubrahimi, Soukaina Filali; Hamdi, Shah Muhammad (February 2023, Hydrology)

Streamflow prediction plays a vital role in water resources planning in order to understand the dramatic change of climatic and hydrologic variables over different time scales. In this study, we used machine learning (ML)-based prediction models, including Random Forest Regression (RFR), Long Short-Term Memory (LSTM), Seasonal Auto- Regressive Integrated Moving Average (SARIMA), and Facebook Prophet (PROPHET) to predict 24 months ahead of natural streamflow at the Lees Ferry site located at the bottom part of the Upper Colorado River Basin (UCRB) of the US. Firstly, we used only historic streamflow data to predict 24 months ahead. Secondly, we considered meteorological components such as temperature and precipitation as additional features. We tested the models on a monthly test dataset spanning 6 years, where 24-month predictions were repeated 50 times to ensure the consistency of the results. Moreover, we performed a sensitivity analysis to identify our best-performing model. Later, we analyzed the effects of considering different span window sizes on the quality of predictions made by our best model. Finally, we applied our best-performing model, RFR, on two more rivers in different states in the UCRB to test the model’s generalizability. We evaluated the performance of the predictive models using multiple evaluation measures. The predictions in multivariate time-series models were found to be more accurate, with RMSE less than 0.84 mm per month, R-squared more than 0.8, and MAPE less than 0.25. Therefore, we conclude that the temperature and precipitation of the UCRB increases the accuracy of the predictions. Ultimately, we found that multivariate RFR performs the best among four models and is generalizable to other rivers in the UCRB.
more » « less
Full Text Available
Shapelet-based Temporal Association Rule Mining for Multivariate Time Series Classification

https://doi.org/10.1109/BigData55660.2022.10020478

Bahri, Omar; Li, Peiyu; Boubrahimi, Soukaina Filali; Hamdi, Shah Muhammad (January 2023, 2022 IEEE International Conference on Big Data (Big Data))

Full Text Available
SG-CF: Shapelet-Guided Counterfactual Explanation for Time Series Classification

https://doi.org/10.1109/BigData55660.2022.10020866

Li, Peiyu; Bahri, Omar; Boubrahimi, Soukaina Filali; Hamdi, Shah Muhammad (December 2022, 2022 IEEE International Conference on Big Data (Big Data))

Full Text Available

« Prev Next »

Search for: All records