Search for: All records

Award ID contains: 1845799

« Prev Next »

Total Resources

15

Resource Type
Conference Paper

2

Conference Proceeding

0

Dataset

0

Journal Article

13

Workshop Report

0

Availability
Full Text / Resource Available

13

Citation Only

2

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A unified and constructive framework for the universality of neural networks

https://doi.org/10.1093/imamat/hxad032

Bui-Thanh, Tan ( November 2023 , IMA Journal of Applied Mathematics)

Abstract
One of the reasons why many neural networks are capable of replicating complicated tasks or functions is their universal approximation property. Though the past few decades have seen tremendous advances in theories of neural networks, a single constructive and elementary framework for neural network universality remains unavailable. This paper is an effort to provide a unified and constructive framework for the universality of a large class of activation functions including most of the existing ones. At the heart of the framework is the concept of neural network approximate identity (nAI). The main result is as follows: any nAI activation function is universal in the space of continuous functions on compacta. It turns out that most of the existing activation functions are nAI, and thus universal. The framework induces several advantages over the contemporary counterparts. First, it is constructive with elementary means from functional analysis, probability theory, and numerical analysis. Second, it is one of the first unified and constructive attempts that is valid for most of the existing activation functions. Third, it provides new proofs for most activation functions. Fourth, for a given activation and error tolerance, the framework provides precisely the architecture of the corresponding one-hidden neural network with a predetermined number of neurons and the values of weights/biases. Fifth, the framework allows us to abstractly present the first universal approximation with a favorable non-asymptotic rate. Sixth, our framework also provides insights into the developments, and hence providing constructive derivations, of some of the existing approaches.

more » « less
An autoencoder compression approach for accelerating large-scale inverse problems

https://doi.org/10.1088/1361-6420/acfbe1

Wittmer, Jonathan ; Badger, Jacob ; Sundar, Hari ; Bui-Thanh, Tan ( October 2023 , Inverse Problems)
N/A (Ed.)
Abstract
Partial differential equation (PDE)-constrained inverse problems are some of the most challenging and computationally demanding problems in computational science today. Fine meshes required to accurately compute the PDE solution introduce an enormous number of parameters and require large-scale computing resources such as more processors and more memory to solve such systems in a reasonable time. For inverse problems constrained by time-dependent PDEs, the adjoint method often employed to compute gradients and higher order derivatives efficiently requires solving a time-reversed, so-called adjoint PDE that depends on the forward PDE solution at each timestep. This necessitates the storage of a high-dimensional forward solution vector at every timestep. Such a procedure quickly exhausts the available memory resources. Several approaches that trade additional computation for reduced memory footprint have been proposed to mitigate the memory bottleneck, including checkpointing and compression strategies. In this work, we propose a close-to-ideal scalable compression approach using autoencoders to eliminate the need for checkpointing and substantial memory storage, thereby reducing the time-to-solution and memory requirements. We compare our approach with checkpointing and an off-the-shelf compression approach on an earth-scale ill-posed seismic inverse problem. The results verify the expected close-to-ideal speedup for the gradient and Hessian-vector product using the proposed autoencoder compression approach. To highlight the usefulness of the proposed approach, we combine the autoencoder compression with the data-informed active subspace (DIAS) prior showing how the DIAS method can be affordably extended to large-scale problems without the need for checkpointing and large memory.

more » « less
Free, publicly-accessible full text available October 12, 2024
On unifying randomized methods for inverse problems

https://doi.org/10.1088/1361-6420/acd36e

Wittmer, Jonathan ; Krishnanunni, C G ; Nguyen, Hai V ; Bui-Thanh, Tan ( June 2023 , Inverse Problems)
N/A (Ed.)
Abstract
This work unifies the analysis of various randomized methods for solving linear and nonlinear inverse problems with Gaussian priors by framing the problem in a stochastic optimization setting. By doing so, we show that many randomized methods are variants of a sample average approximation (SAA). More importantly, we are able to prove a single theoretical result that guarantees the asymptotic convergence for a variety of randomized methods. Additionally, viewing randomized methods as an SAA enables us to prove, for the first time, a single non-asymptotic error result that holds for randomized methods under consideration. Another important consequence of our unified framework is that it allows us to discover new randomization methods. We present various numerical results for linear, nonlinear, algebraic, and PDE-constrained inverse problems that verify the theoretical convergence results and provide a discussion on the apparently different convergence rates and the behavior for various randomized methods.

more » « less
Free, publicly-accessible full text available June 9, 2024
A multilevel block preconditioner for the HDG trace system applied to incompressible resistive MHD

https://doi.org/10.1016/j.cma.2022.115775

Muralikrishnan, Sriramkrishnan ; Shannon, Stephen ; Bui-Thanh, Tan ; Shadid, John N. ( February 2023 , Computer Methods in Applied Mechanics and Engineering)

Full Text Available
Forward and inverse modeling of fault transmissibility in subsurface flows

https://doi.org/10.1016/j.camwa.2022.09.013

Lee, Jeonghun J. ; Bui-Thanh, Tan ; Villa, Umberto ; Ghattas, Omar ( December 2022 , Computers & Mathematics with Applications)

Full Text Available
Probabilistic constrained Bayesian inversion for transpiration cooling

https://doi.org/10.1002/fld.5135

Steins, Ella ; Bui‐Thanh, Tan ; Herty, Michael ; Müller, Siegfried ( December 2022 , International Journal for Numerical Methods in Fluids)

Full Text Available
A Model-Constrained Tangent Slope Learning Approach for Dynamical Systems

https://doi.org/10.1080/10618562.2022.2146677

Nguyen, Hai V. ; Bui-Thanh, Tan ( August 2022 , International Journal of Computational Fluid Dynamics)

Full Text Available
DIAS: A Data-Informed Active Subspace Regularization Framework for Inverse Problems

https://doi.org/10.3390/computation10030038

Nguyen, Hai ; Wittmer, Jonathan ; Bui-Thanh, Tan ( March 2022 , Computation)

This paper presents a regularization framework that aims to improve the fidelity of Tikhonov inverse solutions. At the heart of the framework is the data-informed regularization idea that only data-uninformed parameters need to be regularized, while the data-informed parameters, on which data and forward model are integrated, should remain untouched. We propose to employ the active subspace method to determine the data-informativeness of a parameter. The resulting framework is thus called a data-informed (DI) active subspace (DIAS) regularization. Four proposed DIAS variants are rigorously analyzed, shown to be robust with the regularization parameter and capable of avoiding polluting solution features informed by the data. They are thus well suited for problems with small or reasonably small noise corruptions in the data. Furthermore, the DIAS approaches can effectively reuse any Tikhonov regularization codes/libraries. Though they are readily applicable for nonlinear inverse problems, we focus on linear problems in this paper in order to gain insights into the framework. Various numerical results for linear inverse problems are presented to verify theoretical findings and to demonstrate advantages of the DIAS framework over the Tikhonov, truncated SVD, and the TSVD-based DI approaches.
more » « less
Full Text Available
A scalable exponential-DG approach for nonlinear conservation laws: With application to Burger and Euler equations

https://doi.org/10.1016/j.cma.2021.114031

Kang, Shinhoo ; Bui-Thanh, Tan ( November 2021 , Computer Methods in Applied Mechanics and Engineering)
null (Ed.)
Full Text Available
Solving Bayesian Inverse Problems via Variational Autoencoders

Goh, H. ( October 2021 , Proceeding of Machine Learning Research, 2nd Annual Conference on Mathematical and Scientific Machine Learning)

In recent years, the field of machine learning has made phenomenal progress in the pursuit of simulating real-world data generation processes. One notable example of such success is the variational autoencoder (VAE). In this work, with a small shift in perspective, we leverage and adapt VAEs for a different purpose: uncertainty quantification in scientific inverse problems. We introduce UQ-VAE: a flexible, adaptive, hybrid data/model-informed framework for training neural networks capable of rapid modelling of the posterior distribution representing the unknown parameter of interest. Specifically, from divergence-based variational inference, our framework is derived such that most of the information usually present in scientific inverse problems is fully utilized in the training procedure. Additionally, this framework includes an adjustable hyperparameter that allows selection of the notion of distance between the posterior model and the target distribution. This introduces more flexibility in controlling how optimization directs the learning of the posterior model. Further, this framework possesses an inherent adaptive optimization property that emerges through the learning of the posterior uncertainty.
more » « less
Full Text Available

« Prev Next »