NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Breaking Time Invariance: Assorted-Time Normalization for RNNs

https://doi.org/10.1007/s11063-024-11442-1

Pospisil, Cole; Zadorozhnyy, Vasily; Ye, Qiang (March 2024, Neural Processing Letters)

Abstract Methods such as Layer Normalization (LN) and Batch Normalization have proven to be effective in improving the training of Recurrent Neural Networks (RNNs). However, existing methods normalize using only the instantaneous information at one particular time step, and the result of the normalization is a preactivation state with a time-independent distribution. This implementation fails to account for certain temporal differences inherent in the inputs and the architecture of RNNs. Since these networks share weights across time steps, it may also be desirable to account for the connections between time steps in the normalization scheme. In this paper, we propose a normalization method called Assorted-Time Normalization (ATN), which preserves information from multiple consecutive time steps and normalizes using them. This setup allows us to introduce longer time dependencies into the traditional normalization methods without introducing any new trainable parameters. We present theoretical derivations for the gradient propagation and prove the weight scaling invariance property. Our experiments applying ATN to LN demonstrate consistent improvement on various tasks, such as Adding, Copying, and Denoise Problems and Language Modeling Problems.
more » « less
PROTECT: Protein circadian time prediction using unsupervised learning

Ogholbake, AA; Cheng, Q (August 2025, iScience)

accepted
more » « less
Free, publicly-accessible full text available August 14, 2026
CausalGeD: Blending Causality and Diffusion for Spatial Gene Expression Generation, ACM Knowledge Discovery and Data Mining

https://doi.org/10.1145/3711896.3736875

Sadia, Rabeya Tus; Ahamed, Md Atik; Cheng, Qiang (August 2025, ACM)

Free, publicly-accessible full text available August 3, 2026
Generative adversarial networks (GAN) model for dynamically adjusted weld pool image toward human-based model predictive control (MPC)

https://doi.org/10.1016/j.jmapro.2025.02.053

Li, Tianpu; Cao, Yue; Ye, Qiang; Zhang, YuMing (May 2025, Journal of Manufacturing Processes)

Gas Metal Arc Welding (GMAW) is a critical industrial technique known for its high productivity, flexibility, and adaptability to automation. Despite the significant advancements in robotic welding, challenges remain in fully automating the arc welding process, particularly due to the complex dynamics of the weld pool associated with GMAW. A human-robot collaborative (HRC) system where humans operate robots may conveniently provide the needed adaptive control to the complex GMAW. While in conventional HRC systems humans receive process feedback to make adaptive adjustments, we propose provide humans with predictive future feedback to further ease the human decision and reduce the needed skills/trainings. To this end, this study explores the integration of deep learning models, specifically Generative Adversarial Networks (GANs) combined with Gated Recurrent Units (GRUs), to model and predict the dynamic behavior of the weld pool during GMAW. By leveraging time-series data of torch movement and corresponding weld pool images, the proposed GRU-GAN model generates high-fidelity weld pool images, capturing the intricate relationship between speed variations and weld pool morphology. Through extensive experimentation, including the design of an acceptable Encoder-Decoder structure for the GAN, we demonstrate that incorporating both temporal and speed sequence information significantly enhances the model's predictive capabilities. The findings validate the hypothesis that dynamic torch speed adjustments, akin to those performed by skilled human welders, can be effectively modeled to improve the quality of automated welding processes. Future work will be devoted to human-based model predictive control (MPC) in an HRC environment.
more » « less
Free, publicly-accessible full text available May 1, 2026
CorrGAN: Simultaneous Learning of Speech Enhancement and Perceptual Quality Loss Functions

https://doi.org/10.1109/ICASSP49660.2025.10887633

Zadorozhnyy, Vasily; Amizadeh, Saeed; Ye, Qiang; Koishida, Kazuhito (April 2025, IEEE)

Deep-learning models have allowed effective end-to-end SE systems in the Speech Enhancement (SE) field. Most of these methods are trained using a fixed reconstruction loss in a supervised setting. Often these losses do not perfectly represent the desired perceptual quality metrics, resulting in sub-optimal performance. Recently, there have been efforts to learn the behavior of those metrics directly via neural nets for training SE models. However, an accurate estimation of the true metric function introduces statistical complexity for training because it attempts to capture the exact value of the metric. We propose an adversarial training strategy based on statistical correlation that avoids the complexity of estimating the SE metric while learning to mimic its overall behavior. We call this framework CorrGAN and show its significant improvement over standard losses of the SOTA baselines and achieve SOTA performance on the VoiceBank+DEMAND dataset.
more » « less
Free, publicly-accessible full text available April 6, 2026
TSCMamba: Mamba meets multi-view learning for time series classification

https://doi.org/10.1016/j.inffus.2025.103079

Ahamed, M A; Cheng, Q (March 2025, Information Fusion)

Free, publicly-accessible full text available March 20, 2026
Human-robot collaborative assembly and welding: A review and analysis of the state of the art

https://doi.org/10.1016/j.jmapro.2024.09.044

Cao, Yue; Zhou, Quan; Yuan, Wei; Ye, Qiang; Popa, Dan; Zhang, YuMing (December 2024, Journal of Manufacturing Processes)

Free, publicly-accessible full text available December 1, 2025
Orthogonal Gated Recurrent Unit With Neumann-Cayley Transformation

https://doi.org/10.1162/neco_a_01710

Zadorozhnyy, Vasily; Mucllari, Edison; Pospisil, Cole; Nguyen, Duc; Ye, Qiang (November 2024, Neural Computation)

In recent years, using orthogonal matrices has been shown to be a promising approach to improving recurrent neural networks (RNNs) with training, stability, and convergence, particularly to control gradients. While gated recurrent unit (GRU) and long short-term memory (LSTM) architectures address the vanishing gradient problem by using a variety of gates and memory cells, they are still prone to the exploding gradient problem. In this work, we analyze the gradients in GRU and propose the use of orthogonal matrices to prevent exploding gradient problems and enhance long-term memory. We study where to use orthogonal matrices and propose a Neumann series–based scaled Cayley transformation for training orthogonal matrices in GRU, which we call Neumann-Cayley orthogonal GRU (NC-GRU). We present detailed experiments of our model on several synthetic and real-world tasks, which show that NC-GRU significantly outperforms GRU and several other RNNs.
more » « less
Free, publicly-accessible full text available November 19, 2025
TimeMachine: A Time Series is Worth 4 Mambas for Long-Term Forecasting

https://doi.org/10.3233/FAIA240677

Ahamed, Md Atik; Cheng, Qiang (October 2024, IOS Press)

Long-term time-series forecasting remains challenging due to the difficulty in capturing long-term dependencies, achieving linear scalability, and maintaining computational efficiency. We introduce TimeMachine, an innovative model that leverages Mamba, a state-space model, to capture long-term dependencies in multivariate time series data while maintaining linear scalability and small memory footprints. TimeMachine exploits the unique properties of time series data to produce salient contextual cues at multi-scales and leverage an innovative integrated quadruple-Mamba architecture to unify the handling of channel-mixing and channel-independence situations, thus enabling effective selection of contents for prediction against global and local contexts at different scales. Experimentally, TimeMachine achieves superior performance in prediction accuracy, scalability, and memory efficiency, as extensively validated using benchmark datasets. Code availability: https://github.com/Atik-Ahamed/TimeMachine
more » « less
Full Text Available
MambaTab: A Plug-and-Play Model for Learning Tabular Data

https://doi.org/10.1109/MIPR62202.2024.00065

Ahamed, Md Atik; Cheng, Qiang (August 2024, IEEE)

Full Text Available

« Prev Next »

Search for: All records