NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A tutorial on the Bayesian statistical approach to inverse problems

https://doi.org/10.1063/5.0154773

Waqar, Faaiq_G; Patel, Swati; Simon, Cory_M (November 2023, APL Machine Learning)

Inverse problems are ubiquitous in science and engineering. Two categories of inverse problems concerning a physical system are (1) estimate parameters in a model of the system from observed input–output pairs and (2) given a model of the system, reconstruct the input to it that caused some observed output. Applied inverse problems are challenging because a solution may (i) not exist, (ii) not be unique, or (iii) be sensitive to measurement noise contaminating the data. Bayesian statistical inversion (BSI) is an approach to tackle ill-posed and/or ill-conditioned inverse problems. Advantageously, BSI provides a “solution” that (i) quantifies uncertainty by assigning a probability to each possible value of the unknown parameter/input and (ii) incorporates prior information and beliefs about the parameter/input. Herein, we provide a tutorial of BSI for inverse problems by way of illustrative examples dealing with heat transfer from ambient air to a cold lime fruit. First, we use BSI to infer a parameter in a dynamic model of the lime temperature from measurements of the lime temperature over time. Second, we use BSI to reconstruct the initial condition of the lime from a measurement of its temperature later in time. We demonstrate the incorporation of prior information, visualize the posterior distributions of the parameter/initial condition, and show posterior samples of lime temperature trajectories from the model. Our Tutorial aims to reach a wide range of scientists and engineers.
more » « less
Data-Driven Imputation of Miscibility of Aqueous Solutions via Graph-Regularized Logistic Matrix Factorization

https://doi.org/10.1021/acs.jpcb.3c03789

Behnoudfar, Diba; Simon, Cory M.; Schrier, Joshua (September 2023, The Journal of Physical Chemistry B)

Aqueous, two-phase systems (ATPSs) may form upon mixing two solutions of independently water-soluble compounds. Many separation, purification, and extraction processes rely on ATPSs. Predicting the miscibility of solutions can accelerate and reduce the cost of the discovery of new ATPSs for these applications. Whereas previous machine learning approaches to ATPS prediction used physicochemical properties of each solute as a descriptor, in this work, we show how to impute missing miscibility outcomes directly from an incomplete collection of pairwise miscibility experiments. We use graph-regularized logistic matrix factorization (GR-LMF) to learn a latent vector of each solution from (i) the observed entries in the pairwise miscibility matrix and (ii) a graph where each node is a solution and edges are relationships indicating the general category of the solute (i.e., polymer, surfactant, salt, protein). For an experimental data set of the pairwise miscibility of 68 solutions from Peacock et al. [ACS Appl. Mater. Interfaces 2021, 13, 11449–11460], we find that GR-LMF more accurately predicts missing (im)miscibility outcomes of pairs of solutions than ordinary logistic matrix factorization and random forest classifiers that use physicochemical features of the solutes. GR-LMF obviates the need for features of the solutions and solutions to impute missing miscibility outcomes, but it cannot predict the miscibility of a new solution without some observations of its miscibility with other solutions in the training data set.
more » « less
Full Text Available
Classifying the toxicity of pesticides to honey bees via support vector machines with random walk graph kernels

https://doi.org/10.1063/5.0090573

Yang, Ping; Henle, E. Adrian; Fern, Xiaoli Z.; Simon, Cory M. (July 2022, The Journal of Chemical Physics)

Pesticides benefit agriculture by increasing crop yield, quality, and security. However, pesticides may inadvertently harm bees, which are valuable as pollinators. Thus, candidate pesticides in development pipelines must be assessed for toxicity to bees. Leveraging a dataset of 382 molecules with toxicity labels from honey bee exposure experiments, we train a support vector machine (SVM) to predict the toxicity of pesticides to honey bees. We compare two representations of the pesticide molecules: (i) a random walk feature vector listing counts of length- L walks on the molecular graph with each vertex- and edge-label sequence and (ii) the Molecular ACCess System (MACCS) structural key fingerprint (FP), a bit vector indicating the presence/absence of a list of pre-defined subgraph patterns in the molecular graph. We explicitly construct the MACCS FPs but rely on the fixed-length- L random walk graph kernel (RWGK) in place of the dot product for the random walk representation. The L-RWGK-SVM achieves an accuracy, precision, recall, and F1 score (mean over 2000 runs) of 0.81, 0.68, 0.71, and 0.69, respectively, on the test data set—with L = 4 being the mode optimal walk length. The MACCS-FP-SVM performs on par/marginally better than the L-RWGK-SVM, lends more interpretability, but varies more in performance. We interpret the MACCS-FP-SVM by illuminating which subgraph patterns in the molecules tend to strongly push them toward the toxic/non-toxic side of the separating hyperplane.
more » « less
Full Text Available
PoreMatMod.jl : Julia Package for in Silico Postsynthetic Modification of Crystal Structure Models

https://doi.org/10.1021/acs.jcim.1c01219

Henle, E. Adrian; Gantzler, Nickolas; Thallapally, Praveen K.; Fern, Xiaoli Z.; Simon, Cory M. (February 2022, Journal of Chemical Information and Modeling)

Full Text Available
Bayesian optimization of nanoporous materials

https://doi.org/10.1039/D1ME00093D

Deshwal, Aryan; Simon, Cory; Doppa, Janardhan Rao (December 2021, Molecular Systems Design and Engineering)

Full Text Available
Recommendation System to Predict Missing Adsorption Properties of Nanoporous Materials

https://doi.org/10.1021/acs.chemmater.1c01201

Sturluson, Arni; Raza, Ali; McConachie, Grant D.; Siderius, Daniel W.; Fern, Xiaoli Z.; Simon, Cory M. (September 2021, Chemistry of Materials)

Full Text Available
Non-injective gas sensor arrays: identifying undetectable composition changes

https://doi.org/10.1088/1361-648X/ac1e49

Gantzler, Nickolas; Henle, E Adrian; Thallapally, Praveen K; Fern, Xiaoli Z; Simon, Cory M (September 2021, Journal of Physics: Condensed Matter)

Full Text Available
Evaluating the Fitness of Combinations of Adsorbents for Quantitative Gas Sensor Arrays

https://doi.org/10.1021/acssensors.0c02014

Sousa, Rachel; Simon, Cory M. (December 2020, ACS Sensors)
null (Ed.)
Full Text Available
Towards explainable message passing networks for predicting carbon dioxide adsorption in metal-organic frameworks

A. Raza, F. Waqar (December 2020, Machine Learning for Molecules Workshop at NeurIPS)
null (Ed.)
Full Text Available
Statistical Mechanical Model of Gas Adsorption in a Metal–Organic Framework Harboring a Rotaxane Molecular Shuttle

https://doi.org/10.1021/acs.langmuir.0c02839

Carney, Jonathan; Roundy, David; Simon, Cory M. (November 2020, Langmuir)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records