NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Preconditioning for accurate solutions of ill‐conditioned linear systems

https://doi.org/10.1002/nla.2315

Ye, Qiang (June 2020, Numerical Linear Algebra with Applications)

Summary This article develops the preconditioning technique as a method to address the accuracy issue caused by ill‐conditioning. Given a preconditionerMfor an ill‐conditioned linear systemAx=b, we show that, if the inverse of the preconditionerM⁻¹can be applied to vectorsaccurately, then the linear system can be solvedaccurately. A stability concept calledinverse‐equivalentaccuracy is introduced to describe the high accuracy that is achieved and an error analysis will be presented. Numerical examples are presented to illustrate the error analysis and the performance of the methods.
more » « less
SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks

https://doi.org/10.21437/Interspeech.2023-456

Zadorozhnyy, Vasily; Ye, Qiang; Koishida, Kazuhito (August 2023, ISCA)

Full Text Available
A method for computing a few eigenpairs of large generalized eigenvalue problems

https://doi.org/10.1016/j.apnum.2022.08.018

Alkilayh, Maged; Reichel, Lothar; Ye, Qiang (January 2023, Applied Numerical Mathematics)

Full Text Available
FiberSim: A flexible open-source model of myofilament-level contraction

https://doi.org/10.1016/j.bpj.2021.12.021

Kosta, Sarah; Colli, Dylan; Ye, Qiang; Campbell, Kenneth S. (January 2022, Biophysical Journal)

Full Text Available
AUTM Flow: Atomic Unrestricted Time Machine for Monotonic Normalizing Flows

Cai, D.; Ji, Y.; He, H.; Ye, Q. (January 2022, Uncertainty in artificial intelligence)

Nonlinear monotone transformations are used extensively in normalizing flows to construct invertible triangular mappings from simple distributions to complex ones. In existing literature, monotonicity is usually enforced by restricting function classes or model parameters and the inverse transformation is often approximated by root-finding algorithms as a closed-form inverse is unavailable. In this paper, we introduce a new integral-based approach termed: Atomic Unrestricted Time Machine (AUTM), equipped with unrestricted integrands and easy-to-compute explicit inverse. AUTM offers a versatile and efficient way to the design of normalizing flows with explicit inverse and unrestricted function classes or parameters. Theoretically, we present a constructive proof that AUTM is universal: all monotonic normalizing flows can be viewed as limits of AUTM flows. We provide a concrete example to show how to approximate any given monotonic normalizing flow using AUTM flows with guaranteed convergence. Our result implies that AUTM can be used to transform an existing flow into a new one equipped with explicit inverse and unrestricted parameters. The performance of the new approach is evaluated on high dimensional density estimation, variational inference and image generation.
more » « less
Full Text Available
Batch Normalization Preconditioning for Neural Network Training

Lange, S.; Helfrich, K.; Ye, Q. (January 2022, Journal of machine learning research)

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well understood. It is not suitable for use with very small mini-batch sizes or online learning. In this paper, we propose a new method called Batch Normalization Preconditioning (BNP). Instead of applying normalization explicitly through a batch normalization layer as is done in BN, BNP applies normalization by conditioning the parameter gradients directly during training. This is designed to improve the Hessian matrix of the loss function and hence convergence during training. One benefit is that BNP is not constrained on the mini-batch size and works in the online learning setting. Furthermore, its connection to BN provides theoretical insights on how BN improves training and how BN is applied to special architectures such as convolutional neural networks. For a theoretical foundation, we also present a novel Hessian condition number based convergence theory for a locally convex but not strong-convex loss, which is applicable to networks with a scale-invariant property.
more » « less
Full Text Available
Adaptive Weighted Discriminator for Training Generative Adversarial Networks

Zadorozhnyy, Vasily; Cheng, Qiang; Ye, Qiang (June 2021, IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
A robust deep learning approach for automatic classification of seizures against non-seizures

https://doi.org/10.1016/j.bspc.2020.102215

Yao, Xinghua; Li, Xiaojin; Ye, Qiang; Huang, Yan; Cheng, Qiang; Zhang, Guo-Qiang (February 2021, Biomedical Signal Processing and Control)

Full Text Available
On the regularization of convolutional kernel tensors in neural networks

https://doi.org/10.1080/03081087.2020.1795058

Guo, Pei-Chang; Ye, Qiang (July 2020, Linear and Multilinear Algebra)

Full Text Available
Eigenvalue Normalized Recurrent Neural Networks for Short Term Memory

https://doi.org/10.1609/aaai.v34i04.5831

Helfrich, Kyle; Ye, Qiang (June 2020, Proceedings of the AAAI Conference on Artificial Intelligence)

Several variants of recurrent neural networks (RNNs) with orthogonal or unitary recurrent matrices have recently been developed to mitigate the vanishing/exploding gradient problem and to model long-term dependencies of sequences. However, with the eigenvalues of the recurrent matrix on the unit circle, the recurrent state retains all input information which may unnecessarily consume model capacity. In this paper, we address this issue by proposing an architecture that expands upon an orthogonal/unitary RNN with a state that is generated by a recurrent matrix with eigenvalues in the unit disc. Any input to this state dissipates in time and is replaced with new inputs, simulating short-term memory. A gradient descent algorithm is derived for learning such a recurrent matrix. The resulting method, called the Eigenvalue Normalized RNN (ENRNN), is shown to be highly competitive in several experiments.
more » « less
Full Text Available

« Prev Next »

Search for: All records