NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ExoTST: Exogenous-Aware Temporal Sequence Transformer for Time Series Prediction

https://doi.org/10.1109/ICDM59182.2024.00105

Tayal, Kshitij; Renganathan, Arvind; Jia, Xiaowei; Kumar, Vipin; Lu, Dan (December 2024, Proceedings)

Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. Traditional time series approaches for prediction often focus on either autoregressive modeling, which relies solely on past observations of the target “endogenous variables”, or forward modeling, which considers only current covariate drivers “exogenous variables”. However, effectively integrating past endogenous and past exogenous with current exogenous variables remains a significant challenge. In this paper, we propose ExoTST, a novel transformer-based framework that effectively incorporates current exogenous variables alongside past context for improved time series prediction. To integrate exogenous information efficiently, ExoTST leverages the strengths of attention mechanisms and introduces a novel cross-temporal modality fusion module. This module enables the model to jointly learn from both past and current exogenous series, treating them as distinct modalities. By considering these series separately, ExoTST provides robustness and flexibility in handling data uncertainties that arise from the inherent distribution shift between historical and current exogenous variables. Extensive experiments on real-world carbon flux datasets and time series benchmarks demonstrate ExoTST's superior performance compared to state-of-the-art baselines, with improvements of up to 10% in prediction accuracy. Moreover, ExoTST exhibits strong robustness against missing values and noise in exogenous drivers, maintaining consistent performance in real-world situations where these imperfections are common.
more » « less
Free, publicly-accessible full text available December 9, 2025
Koopman Invertible Autoencoder: Leveraging Forward and Backward Dynamics for Temporal Modeling

https://doi.org/10.1109/ICDM58522.2023.00068

Tayal, Kshitij; Renganathan, Arvind; Ghosh, Rahul; Jia, Xiaowei; Kumar, Vipin (December 2023, IEEE ICDM 2024, 23rd IEEE International Conference on Data Mining)

Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. However, building accurate long-term prediction models remains challenging due to the limitations of existing temporal models like recurrent neural networks (RNNs), as they capture only the statistical connections in the training data and may fail to learn the underlying dynamics of the target system. To tackle this challenge, we propose a novel machine learning model based on Koopman operator theory, which we call Koopman Invertible Autoencoders (KIA), that captures the inherent characteristic of the system by modeling both forward and backward dynamics in the infinite-dimensional Hilbert space. This enables us to efficiently learn low-dimensional representations, resulting in more accurate predictions of long-term system behavior. Moreover, our method’s invertibility design enforces reversibility and consistency in both forward and inverse operations. We illustrate the utility of KIA on pendulum and climate datasets, demonstrating 300% improvements in long-term prediction capability for pendulum while maintaining robustness against noise. Additionally, our method demonstrates the ability to better comprehend the intricate dynamics of the climate system when compared to existing Koopman-based methods.
more » « less
Full Text Available
Koopman Invertible Autoencoder: Leveraging Forward and Backward Dynamics for Temporal Modeling (Selected as one of the best-ranked papers for possible publication in the journal Knowledge and Information Systems.)

Tayal, Kshitij; Renganathan, Arvind; Ghosh, Rahul; Jia, Xiaowei; Kumar, Vipin (December 2023, IEEE International Conference on Data Mining (ICDM))

Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. However, building accurate long-term prediction models remains challenging due to the limitations of existing temporal models like recurrent neural networks (RNNs), as they capture only the statistical connections in the training data and may fail to learn the underlying dynamics of the target system. To tackle this challenge, we propose a novel machine learning model based on Koopman operator theory, which we call Koopman Invertible Autoencoders (KIA), that captures the inherent characteristic of the system by modeling both forward and backward dynamics in the infinite-dimensional Hilbert space. This enables us to efficiently learn low-dimensional representations, resulting in more accurate predictions of long-term system behavior. Moreover, our method’s invertibility design enforces reversibility and consistency in both forward and inverse operations. We illustrate the utility of KIA on pendulum and climate datasets, demonstrating 300% improvements in long-term prediction capability for pendulum while maintaining robustness against noise. Additionally, our method demonstrates the ability to better comprehend the intricate dynamics of the climate system when compared to existing Koopman-based methods.
more » « less
Full Text Available
Koopman Invertible Autoencoder: Leveraging Forward and Backward Dynamics for Temporal Modeling

https://doi.org/10.1109/ICDM58522.2023.00068

Tayal, Kshitij; Renganathan, Arvind; Ghosh, Rahul; Jia, Xiaowei; Kumar, Vipin (December 2023, IEEE)
Meta-Transfer Learning: An application to Streamflow modeling in River-streams

https://doi.org/10.1109/ICDM54844.2022.00026

Ghosh, Rahul; Li, Bangyan; Tayal, Kshitij; Kumar, Vipin; Jia, Xiaowei (November 2022, 2022 IEEE International Conference on Data Mining (ICDM))

Full Text Available
Robust Inverse Framework using Knowledge-guided Self-Supervised Learning: An application to Hydrology

https://doi.org/10.1145/3534678.3539448

Ghosh, Rahul; Renganathan, Arvind; Tayal, Kshitij; Li, Xiang; Khandelwal, Ankush; Jia, Xiaowei; Duffy, Christopher; Nieber, John; Kumar, Vipin (August 2022, KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Machine Learning is beginning to provide state-of-the-art performance in a range of environmental applications such as streamflow prediction in a hydrologic basin. However, building accurate broad-scale models for streamflow remains challenging in practice due to the variability in the dominant hydrologic processes, which are best captured by sets of process-related basin characteristics. Existing basin characteristics suffer from noise and uncertainty, among many other things, which adversely impact model performance. To tackle the above challenges, in this paper, we propose a novel Knowledge-guided Self-Supervised Learning (KGSSL) inverse framework to extract system characteristics from driver(input) and response(output) data. This first-of-its-kind framework achieves robust performance even when characteristics are corrupted or missing. We evaluate the KGSSL framework in the context of stream flow modeling using CAMELS (Catchment Attributes and MEteorology for Large-sample Studies) which is a widely used hydrology benchmark dataset. Specifically, KGSSL outperforms baseline by 16% in predicting missing characteristics. Furthermore, in the context of forward modelling, KGSSL inferred characteristics provide a 35% improvement in performance over a standard baseline when the static characteristic are unknown.
more » « less
Full Text Available
Model-agnostic Methods for Text Classification with Inherent Noise

https://doi.org/10.18653/v1/2020.coling-industry.19

Tayal, Kshitij; Ghosh, Rahul; Kumar, Vipin (December 2020, 28th International Conference on Computational Linguistics: Industry Track)
null (Ed.)
Text classification is a fundamental problem, and recently, deep neural networks (DNN) have shown promising results in many natural language tasks. However, their human-level performance relies on high-quality annotations, which are time-consuming and expensive to collect. As we move towards large inexpensive datasets, the inherent label noise degrades the generalization of DNN. While most machine learning literature focuses on building complex networks to handle noise, in this work, we evaluate model-agnostic methods to handle inherent noise in large scale text classification that can be easily incorporated into existing machine learning workflows with minimal interruption. Specifically, we conduct a point-by-point comparative study between several noise-robust methods on three datasets encompassing three popular classification models. To our knowledge, this is the first time such a comprehensive study in text classification encircling popular models and model-agnostic loss methods has been conducted. In this study, we describe our learning and demonstrate the application of our approach, which outperformed baselines by up to 10% in classification accuracy while requiring no network modifications.
more » « less
Full Text Available
End to End learning for Phase Retrieval

Manekar, Raunak; Tayal, Kshitij; Kumar, Vipin; Sun, Ju (July 2020, ICML workshop on ML Interpretability for Scientific Discovery)

We consider the end-to-end deep learning approach for phase retrieval, a central problem in scientific imaging. We highlight a fundamental difficulty for learning that previous work has neglected, likely due to the biased datasets they use for training and evaluation. We propose a simple yet different formulation for PR that seems to overcome the difficulty and return consistently better qualitative results.
more » « less
Full Text Available
Inverse Problems, Deep Learning, and Symmetry Breaking

Tayal, Kshitij; Lai, Chieh-Hsin; Manekar, Raunak; Kumar, Vipin; Sun, Ju (July 2020, ICML workshop on ML Interpretability for Scientific Discovery)

In many physical systems, inputs related by intrinsic system symmetries are mapped to the same output. When inverting such physical systems, i.e., solving the associated inverse problems, there is no unique solution. This causes fundamental difficulty in deploying the emerging end-to-end deep learning approach. Using the generalized phase retrieval problem as an illustrative example, we show that careful symmetry breaking on training data can help remove the difficulty and significantly improve the learning performance. We also extract and highlight the underlying mathematical principle of the proposed solution, which is directly applicable to other inverse problems. A full-length version of this paper can be found at https://arxiv.org/abs/2003.09077.
more » « less
Full Text Available
Regionalization in a Global Hydrologic Deep Learning Model: From Physical Descriptors to Random Vectors

https://doi.org/10.1029/2021WR031794

Li, Xiang; Khandelwal, Ankush; Jia, Xiaowei; Cutler, Kelly; Ghosh, Rahul; Renganathan, Arvind; Xu, Shaoming; Tayal, Kshitij; Nieber, John; Duffy, Christopher; et al (August 2022, Water Resources Research)

Abstract Streamflow prediction is a long‐standing hydrologic problem. Development of models for streamflow prediction often requires incorporation of catchment physical descriptors to characterize the associated complex hydrological processes. Across different scales of catchments, these physical descriptors also allow models to extrapolate hydrologic information from one catchment to others, a process referred to as “regionalization”. Recently, in gauged basin scenarios, deep learning models have been shown to achieve state of the art regionalization performance by building a global hydrologic model. These models predict streamflow given catchment physical descriptors and weather forcing data. However, these physical descriptors are by their nature uncertain, sometimes incomplete, or even unavailable in certain cases, which limits the applicability of this approach. In this paper, we show that by assigning a vector of random values as a surrogate for catchment physical descriptors, we can achieve robust regionalization performance under a gauged prediction scenario. Our results show that the deep learning model using our proposed random vector approach achieves a predictive performance comparable to that of the model using actual physical descriptors. The random vector approach yields robust performance under different data sparsity scenarios and deep learning model selections. Furthermore, based on the use of random vectors, high‐dimensional characterization improves regionalization performance in gauged basin scenario when physical descriptors are uncertain, or insufficient.
more » « less

Search for: All records