NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Breaking Time Invariance: Assorted-Time Normalization for RNNs

https://doi.org/10.1007/s11063-024-11442-1

Pospisil, Cole; Zadorozhnyy, Vasily; Ye, Qiang (March 2024, Neural Processing Letters)

Abstract Methods such as Layer Normalization (LN) and Batch Normalization have proven to be effective in improving the training of Recurrent Neural Networks (RNNs). However, existing methods normalize using only the instantaneous information at one particular time step, and the result of the normalization is a preactivation state with a time-independent distribution. This implementation fails to account for certain temporal differences inherent in the inputs and the architecture of RNNs. Since these networks share weights across time steps, it may also be desirable to account for the connections between time steps in the normalization scheme. In this paper, we propose a normalization method called Assorted-Time Normalization (ATN), which preserves information from multiple consecutive time steps and normalizes using them. This setup allows us to introduce longer time dependencies into the traditional normalization methods without introducing any new trainable parameters. We present theoretical derivations for the gradient propagation and prove the weight scaling invariance property. Our experiments applying ATN to LN demonstrate consistent improvement on various tasks, such as Adding, Copying, and Denoise Problems and Language Modeling Problems.
more » « less
CorrGAN: Simultaneous Learning of Speech Enhancement and Perceptual Quality Loss Functions

https://doi.org/10.1109/ICASSP49660.2025.10887633

Zadorozhnyy, Vasily; Amizadeh, Saeed; Ye, Qiang; Koishida, Kazuhito (April 2025, IEEE)

Deep-learning models have allowed effective end-to-end SE systems in the Speech Enhancement (SE) field. Most of these methods are trained using a fixed reconstruction loss in a supervised setting. Often these losses do not perfectly represent the desired perceptual quality metrics, resulting in sub-optimal performance. Recently, there have been efforts to learn the behavior of those metrics directly via neural nets for training SE models. However, an accurate estimation of the true metric function introduces statistical complexity for training because it attempts to capture the exact value of the metric. We propose an adversarial training strategy based on statistical correlation that avoids the complexity of estimating the SE metric while learning to mimic its overall behavior. We call this framework CorrGAN and show its significant improvement over standard losses of the SOTA baselines and achieve SOTA performance on the VoiceBank+DEMAND dataset.
more » « less
Free, publicly-accessible full text available April 6, 2026
Human-robot collaborative assembly and welding: A review and analysis of the state of the art

https://doi.org/10.1016/j.jmapro.2024.09.044

Cao, Yue; Zhou, Quan; Yuan, Wei; Ye, Qiang; Popa, Dan; Zhang, YuMing (December 2024, Journal of Manufacturing Processes)

Free, publicly-accessible full text available December 1, 2025
Orthogonal Gated Recurrent Unit With Neumann-Cayley Transformation

https://doi.org/10.1162/neco_a_01710

Zadorozhnyy, Vasily; Mucllari, Edison; Pospisil, Cole; Nguyen, Duc; Ye, Qiang (November 2024, Neural Computation)

In recent years, using orthogonal matrices has been shown to be a promising approach to improving recurrent neural networks (RNNs) with training, stability, and convergence, particularly to control gradients. While gated recurrent unit (GRU) and long short-term memory (LSTM) architectures address the vanishing gradient problem by using a variety of gates and memory cells, they are still prone to the exploding gradient problem. In this work, we analyze the gradients in GRU and propose the use of orthogonal matrices to prevent exploding gradient problems and enhance long-term memory. We study where to use orthogonal matrices and propose a Neumann series–based scaled Cayley transformation for training orthogonal matrices in GRU, which we call Neumann-Cayley orthogonal GRU (NC-GRU). We present detailed experiments of our model on several synthetic and real-world tasks, which show that NC-GRU significantly outperforms GRU and several other RNNs.
more » « less
Free, publicly-accessible full text available November 19, 2025
Modeling imaged welding process dynamic behaviors using Generative Adversarial Network (GAN) for a new foundation to monitor weld penetration using deep learning

https://doi.org/10.1016/j.jmapro.2024.05.081

Mucllari, Edison; Cao, Yue; Ye, Qiang; Zhang, YuMing (June 2024, Journal of Manufacturing Processes)

Full Text Available
SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks

https://doi.org/10.21437/Interspeech.2023-456

Zadorozhnyy, Vasily; Ye, Qiang; Koishida, Kazuhito (August 2023, ISCA)

Full Text Available
Do We Need a New Foundation to Use Deep Learning to Monitor Weld Penetration?

https://doi.org/10.1109/LRA.2023.3270038

Mucllari, Edison; Yu, Rui; Cao, Yue; Ye, Qiang; Zhang, YuMing (June 2023, IEEE Robotics and Automation Letters)

Full Text Available
Novel Molecular Representations Using Neumann-Cayley Orthogonal Gated Recurrent Unit

https://doi.org/10.1021/acs.jcim.2c01526

Mucllari, Edison; Zadorozhnyy, Vasily; Ye, Qiang; Nguyen, Duc Duy (May 2023, Journal of Chemical Information and Modeling)

Full Text Available
Deep learning based real-time and in-situ monitoring of weld penetration: Where we are and what are needed revolutionary solutions?

https://doi.org/10.1016/j.jmapro.2023.03.011

Yu, Rui; Cao, Yue; Chen, Heping; Ye, Qiang; Zhang, YuMing (May 2023, Journal of Manufacturing Processes)

Full Text Available
Improving Deep Neural Networks’ Training for Image Classification With Nonlinear Conjugate Gradient-Style Adaptive Momentum

https://doi.org/10.1109/TNNLS.2023.3255783

Wang, Bao; Ye, Qiang (March 2023, IEEE Transactions on Neural Networks and Learning Systems)

Full Text Available

« Prev Next »

Search for: All records