NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Task Aware Modulation using Representation Learning: An Approach for Few Shot Learning in Environmental Systems

Renganathan, Arvind; Ghosh, Rahul; Khandelwal, Ankush; Kumar, Vipin (May 2025, Rundbrief)

Representation Learning), a novel multimodal meta-learning framework for few-shot learning in heterogeneous systems, designed for science and engineering problems where entities share a common underlying forward model but exhibit heterogeneity due to entity-specific characteristics. TAM-RL leverages an amortized training process with a modulation network and a base network to learn task-specific modulation parameters, enabling efficient adaptation to new tasks with limited data. We evaluate TAM-RL on two real-world environmental datasets: Gross Primary Product (GPP) prediction and streamflow forecasting, demonstrating significant improvements over existing meta-learning methods. On the FLUXNET dataset, TAM-RL improves RMSE by 18.9% over MMAML with just one month of few-shot data, while for streamflow prediction, it achieves an 8.21% improvement with one year of data. Synthetic data experiments further validate TAM-RL’s superior performance in heterogeneous task distributions, outperforming the baselines in the most heterogeneous setting. Notably, TAM-RL offers substantial computational efficiency, with at least 3x faster training times compared to gradient-based meta-learning approaches while being much simpler to train due to reduced complexity. Ablation studies highlight the importance of pretraining and adaptation mechanisms in TAM-RL’s performance. Keywords: Representation Learning, meta-learning, few-shot learning, environmental applications, time-series. DOI:10.1137/1.9781611978520.2
more » « less
Free, publicly-accessible full text available May 1, 2026
Task Aware Modulation using Representation Learning: An Approach for Few Shot Learning in Environmental Systems

Renganathan, Arvind; Ghosh, Rahul; Khandelwal, Ankush; Kumar, Vipin (May 2025, SIAM)

Free, publicly-accessible full text available May 1, 2026
Combining self-supervision and privileged information for representation learning from tabular data

https://doi.org/10.1007/s10115-025-02418-1

Yang, Haoyu; Steinbach, Michael; Melton, Genevieve; Kumar, Vipin; Simon, Gyorgy (April 2025, Knowledge and Information Systems)

Abstract When building predictive models for real-world applications, many data are discarded because conventional learning algorithms cannot utilize it, although such data could be very informative. This paper focuses on representation learning using two types of additional data: privileged information (PI) and unlabeled data. PI refers to data available only during training but not at test time. Existing methods transfer the knowledge embedded in PI via supervised mechanisms, making them unable to use unlabeled data. In contrast, self-supervised learning methods can use unlabeled data but cannot learn from PI. While these techniques appear complementary, as we demonstrate, combining them is non-trivial. This paper introduces the privileged information regularized (PIReg) self-supervised learning framework, which utilizes both PI and unlabeled data to learn better representations.
more » « less
Time series predictions in unmonitored sites: a survey of machine learning techniques in water resources

https://doi.org/10.1017/eds.2024.14

Willard, Jared D; Varadharajan, Charuleka; Jia, Xiaowei; Kumar, Vipin (January 2025, Environmental Data Science)

Abstract Prediction of dynamic environmental variables in unmonitored sites remains a long-standing challenge for water resources science. The majority of the world’s freshwater resources have inadequate monitoring of critical environmental variables needed for management. Yet, the need to have widespread predictions of hydrological variables such as river flow and water quality has become increasingly urgent due to climate and land use change over the past decades, and their associated impacts on water resources. Modern machine learning methods increasingly outperform their process-based and empirical model counterparts for hydrologic time series prediction with their ability to extract information from large, diverse data sets. We review relevant state-of-the art applications of machine learning for streamflow, water quality, and other water resources prediction and discuss opportunities to improve the use of machine learning with emerging methods for incorporating watershed characteristics and process knowledge into classical, deep learning, and transfer learning methodologies. The analysis here suggests most prior efforts have been focused on deep learning frameworks built on many sites for predictions at daily time scales in the United States, but that comparisons between different classes of machine learning methods are few and inadequate. We identify several open questions for time series predictions in unmonitored sites that include incorporating dynamic inputs and site characteristics, mechanistic understanding and spatial context, and explainable AI techniques in modern machine learning frameworks.
more » « less
Free, publicly-accessible full text available January 1, 2026
Vacuum-assisted extrusion to reduce internal porosity in large-format additive manufacturing

https://doi.org/10.1016/j.addma.2024.104612

Mattingly, Frye; Kumar, Vipin; Chawla, Komal; Bras, Wim; Kunc, Vlastimil; Duty, Chad (January 2025, Additive Manufacturing)

Free, publicly-accessible full text available January 5, 2026
ExoTST: Exogenous-Aware Temporal Sequence Transformer for Time Series Prediction

https://doi.org/10.1109/ICDM59182.2024.00105

Tayal, Kshitij; Renganathan, Arvind; Jia, Xiaowei; Kumar, Vipin; Lu, Dan (December 2024, Proceedings)

Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. Traditional time series approaches for prediction often focus on either autoregressive modeling, which relies solely on past observations of the target “endogenous variables”, or forward modeling, which considers only current covariate drivers “exogenous variables”. However, effectively integrating past endogenous and past exogenous with current exogenous variables remains a significant challenge. In this paper, we propose ExoTST, a novel transformer-based framework that effectively incorporates current exogenous variables alongside past context for improved time series prediction. To integrate exogenous information efficiently, ExoTST leverages the strengths of attention mechanisms and introduces a novel cross-temporal modality fusion module. This module enables the model to jointly learn from both past and current exogenous series, treating them as distinct modalities. By considering these series separately, ExoTST provides robustness and flexibility in handling data uncertainties that arise from the inherent distribution shift between historical and current exogenous variables. Extensive experiments on real-world carbon flux datasets and time series benchmarks demonstrate ExoTST's superior performance compared to state-of-the-art baselines, with improvements of up to 10% in prediction accuracy. Moreover, ExoTST exhibits strong robustness against missing values and noise in exogenous drivers, maintaining consistent performance in real-world situations where these imperfections are common.
more » « less
Free, publicly-accessible full text available December 9, 2025
Effect of plasma treatment on LMPAEK/CF tape and composites manufactured by automated tape placement (ATP)

https://doi.org/10.1016/j.compositesa.2024.108540

Chahine, Georges; Marathe, Umesh; Collins, Liam; Thomas, Vinoy; Kumar, Vipin; Arabi_Hassen, Ahmed; Tekinalp, Halil; Ozcan, Soydan; Vaidya, Uday (October 2024, Composites Part A: Applied Science and Manufacturing)

Full Text Available
Message Propagation Through Time: An Algorithm for Sequence Dependency Retention in Time Series Modeling

https://doi.org/10.1137/1.9781611978032.35

Xu, Shaoming; Khandelwal, Ankush; Renganathan, Arvind; Kumar, Vipin (April 2024, SIAM International Conference on Data Mining (SDM24))

Time series modeling, a crucial area in science, often encounters challenges when training Machine Learning (ML) models like Recurrent Neural Networks (RNNs) using the conventional mini-batch training strategy that assumes independent and identically distributed (IID) samples and initializes RNNs with zero hidden states. The IID assumption ignores temporal dependencies among samples, resulting in poor performance. This paper proposes the Message Propagation Through Time (MPTT) algorithm to effectively incorporate long temporal dependencies while preserving faster training times relative to the stateful algorithms. MPTT utilizes two memory modules to asynchronously manage initial hidden states for RNNs, fostering seamless information exchange between samples and allowing diverse mini-batches throughout epochs. MPTT further implements three policies to filter outdated and preserve essential information in the hidden states to generate informative initial hidden states for RNNs, facilitating robust training. Experimental results demonstrate that MPTT outperforms seven strategies on four climate datasets with varying levels of temporal dependencies.
more » « less
Full Text Available
Knowledge-guided machine learning can improve carbon cycle quantification in agroecosystems

https://doi.org/10.1038/s41467-023-43860-5

Liu, Licheng; Zhou, Wang; Guan, Kaiyu; Peng, Bin; Xu, Shaoming; Tang, Jinyun; Zhu, Qing; Till, Jessica; Jia, Xiaowei; Jiang, Chongya; et al (December 2024, Nature Communications)

Abstract Accurate and cost-effective quantification of the carbon cycle for agroecosystems at decision-relevant scales is critical to mitigating climate change and ensuring sustainable food production. However, conventional process-based or data-driven modeling approaches alone have large prediction uncertainties due to the complex biogeochemical processes to model and the lack of observations to constrain many key state and flux variables. Here we propose a Knowledge-Guided Machine Learning (KGML) framework that addresses the above challenges by integrating knowledge embedded in a process-based model, high-resolution remote sensing observations, and machine learning (ML) techniques. Using the U.S. Corn Belt as a testbed, we demonstrate that KGML can outperform conventional process-based and black-box ML models in quantifying carbon cycle dynamics. Our high-resolution approach quantitatively reveals 86% more spatial detail of soil organic carbon changes than conventional coarse-resolution approaches. Moreover, we outline a protocol for improving KGML via various paths, which can be generalized to develop hybrid models to better predict complex earth system dynamics.
more » « less
Free, publicly-accessible full text available December 1, 2025
Combining Satellite and Weather Data for Crop Type Mapping: An Inverse Modelling Approach

Ghosh, Rahul; Khandelwal, Ankush; Jia, Xiaowei; Mulla, David; Kumar, Vipin (April 2024, 2024 SIAM International Conference on Data Mining (SDM))

Accurate and timely crop mapping is essential for yield estimation, insurance claims, and conservation efforts. Over the years, many successful machine learning models for crop mapping have been developed that use just the multispectral imagery from satellites to predict crop type over the area of interest. However, these traditional methods do not account for the physical processes that govern crop growth. At a high level, crop growth can be envisioned as physical parameters, such as weather and soil type, acting upon the plant, leading to crop growth, which can be observed via satellites. In this paper, we propose a Weather-based Spatio-Temporal segmentation network with ATTention (WSTATT), a deep learning model that leverages this understanding of crop growth by formulating it as an inverse model that combines weather (Daymet) and satellite imagery (Sentinel-2) to generate accurate crop maps. We show that our approach provides significant improvements over existing algorithms that solely rely on spectral imagery by comparing segmentation maps and F1 classification scores. Furthermore, effective use of attention in WSTATT architecture enables the detection of crop types earlier in the season (up to 5 months in advance), which is very useful for improving food supply projections. We finally discuss the impact of weather by correlating our results with crop phenology to show that WSTATT is able to capture the physical properties of crop growth.
more » « less
Full Text Available

« Prev Next »

Search for: All records