NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Machine learning materials properties with accurate predictions, uncertainty estimates, domain guidance, and persistent online accessibility

https://doi.org/10.1088/2632-2153/ad95db

Jacobs, Ryan; Schultz, Lane_E; Scourtas, Aristana; Schmidt, KJ; Price-Skelly, Owen; Engler, Will; Foster, Ian; Blaiszik, Ben; Voyles, Paul_M; Morgan, Dane (December 2024, Machine Learning: Science and Technology)

Abstract One compelling vision of the future of materials discovery and design involves the use of machine learning (ML) models to predict materials properties and then rapidly find materials tailored for specific applications. However, realizing this vision requires both providing detailed uncertainty quantification (model prediction errors and domain of applicability) and making models readily usable. At present, it is common practice in the community to assess ML model performance only in terms of prediction accuracy (e.g. mean absolute error), while neglecting detailed uncertainty quantification and robust model accessibility and usability. Here, we demonstrate a practical method for realizing both uncertainty and accessibility features with a large set of models. We develop random forest ML models for 33 materials properties spanning an array of data sources (computational and experimental) and property types (electrical, mechanical, thermodynamic, etc). All models have calibrated ensemble error bars to quantify prediction uncertainty and domain of applicability guidance enabled by kernel-density-estimate-based feature distance measures. All data and models are publicly hosted on the Garden-AI infrastructure, which provides an easy-to-use, persistent interface for model dissemination that permits models to be invoked with only a few lines of Python code. We demonstrate the power of this approach by using our models to conduct a fully ML-based materials discovery exercise to search for new stable, highly active perovskite oxide catalyst materials.
more » « less
Community action on FAIR data will fuel a revolution in materials research

https://doi.org/10.1557/s43577-023-00498-4

Brinson, L. Catherine; Bartolo, Laura M.; Blaiszik, Ben; Elbert, David; Foster, Ian; Strachan, Alejandro; Voorhees, Peter W. (March 2023, MRS Bulletin)

Graphical abstract
more » « less
Deep Learning Approach for High-accuracy Electron Counting of Monolithic Active Pixel Sensor-type Direct Electron Detectors at Increased Electron Dose

https://doi.org/10.1093/micmic/ozad132

Wei, Jingrui; Moore, Kalani; Bammes, Benjamin; Levin, Barnaby D; Hagopian, Nicholas; Jacobs, Ryan; Morgan, Dane; Voyles, Paul M (December 2023, Microscopy and Microanalysis)

Abstract Electron counting can be performed algorithmically for monolithic active pixel sensor direct electron detectors to eliminate readout noise and Landau noise arising from the variability in the amount of deposited energy for each electron. Errors in existing counting algorithms include mistakenly counting a multielectron strike as a single electron event, and inaccurately locating the incident position of the electron due to lateral spread of deposited energy and dark noise. Here, we report a supervised deep learning (DL) approach based on Faster region-based convolutional neural network (R-CNN) to recognize single electron events at varying electron doses and voltages. The DL approach shows high accuracy according to the near-ideal modulation transfer function (MTF) and detector quantum efficiency for sparse images. It predicts, on average, 0.47 pixel deviation from the incident positions for 200 kV electrons versus 0.59 pixel using the conventional counting method. The DL approach also shows better robustness against coincidence loss as the electron dose increases, maintaining the MTF at half Nyquist frequency above 0.83 as the electron density increases to 0.06 e−/pixel. Thus, the DL model extends the advantages of counting analysis to higher dose rates than conventional methods.
more » « less
Full Text Available
FAIR principles for AI models with a practical application for accelerated high energy diffraction microscopy

https://doi.org/10.1038/s41597-022-01712-9

Ravi, Nikil; Chaturvedi, Pranshu; Huerta, E. A.; Liu, Zhengchun; Chard, Ryan; Scourtas, Aristana; Schmidt, K. J.; Chard, Kyle; Blaiszik, Ben; Foster, Ian (November 2022, Scientific Data)

Abstract A concise and measurable set of FAIR (Findable, Accessible, Interoperable and Reusable) principles for scientific data is transforming the state-of-practice for data management and stewardship, supporting and enabling discovery and innovation. Learning from this initiative, and acknowledging the impact of artificial intelligence (AI) in the practice of science and engineering, we introduce a set of practical, concise, and measurable FAIR principles for AI models. We showcase how to create and share FAIR data and AI models within a unified computational framework combining the following elements: the Advanced Photon Source at Argonne National Laboratory, the Materials Data Facility, the Data and Learning Hub for Science, and funcX, and the Argonne Leadership Computing Facility (ALCF), in particular the ThetaGPU supercomputer and the SambaNova DataScale^®system at the ALCF AI Testbed. We describe how this domain-agnostic computational framework may be harnessed to enable autonomous AI-driven discovery.
more » « less
Experimental and theoretical studies of native deep-level defects in transition metal dichalcogenides

https://doi.org/10.1038/s41699-022-00350-4

Kim, Jun Young; Gelczuk, Łukasz; Polak, Maciej P.; Hlushchenko, Daria; Morgan, Dane; Kudrawiec, Robert; Szlufarska, Izabela (October 2022, npj 2D Materials and Applications)

Abstract Transition metal dichalcogenides (TMDs), especially in two-dimensional (2D) form, exhibit many properties desirable for device applications. However, device performance can be hindered by the presence of defects. Here, we combine state of the art experimental and computational approaches to determine formation energies and charge transition levels of defects in bulk and 2D MX₂(M = Mo or W; X = S, Se, or Te). We perform deep level transient spectroscopy (DLTS) measurements of bulk TMDs. Simultaneously, we calculate formation energies and defect levels of all native point defects, which enable identification of levels observed in DLTS and extend our calculations to vacancies in 2D TMDs, for which DLTS is challenging. We find that reduction of dimensionality of TMDs to 2D has a significant impact on defect properties. This finding may explain differences in optical properties of 2D TMDs synthesized with different methods and lays foundation for future developments of more efficient TMD-based devices.
more » « less
Calibration after bootstrap for accurate uncertainty quantification in regression models

https://doi.org/10.1038/s41524-022-00794-8

Palmer, Glenn; Du, Siqi; Politowicz, Alexander; Emory, Joshua_Paul; Yang, Xiyu; Gautam, Anupraas; Gupta, Grishma; Li, Zhelong; Jacobs, Ryan; Morgan, Dane (May 2022, npj Computational Materials)

Abstract Obtaining accurate estimates of machine learning model uncertainties on newly predicted data is essential for understanding the accuracy of the model and whether its predictions can be trusted. A common approach to such uncertainty quantification is to estimate the variance from an ensemble of models, which are often generated by the generally applicable bootstrap method. In this work, we demonstrate that the direct bootstrap ensemble standard deviation is not an accurate estimate of uncertainty but that it can be simply calibrated to dramatically improve its accuracy. We demonstrate the effectiveness of this calibration method for both synthetic data and numerous physical datasets from the field of Materials Science and Engineering. The approach is motivated by applications in physical and biological science but is quite general and should be applicable for uncertainty quantification in a wide range of machine learning regression models.
more » « less
A general approach for determining applicability domain of machine learning models

https://doi.org/10.1038/s41524-025-01573-x

Schultz, Lane E; Wang, Yiqi; Jacobs, Ryan; Morgan, Dane (December 2025, npj Computational Materials)

Free, publicly-accessible full text available December 1, 2026
Machine learning metallic glass critical cooling rates through elemental and molecular simulation based featurization

https://doi.org/10.1016/j.jmat.2024.100964

Schultz, Lane E; Afflerbach, Benjamin; Voyles, Paul M; Morgan, Dane (July 2025, Journal of Materiomics)

Free, publicly-accessible full text available July 1, 2026
Accelerating ensemble uncertainty estimates in supervised materials property regression models

https://doi.org/10.1016/j.commatsci.2024.113494

Agrawal, Vidit; Zhang, Shixin; Schultz, Lane; Morgan, Dane (October 2024, Computational materials science)

Full Text Available
Regression with Large Language Models for Materials and Molecular Property Prediction

https://doi.org/https://doi.org/10.48550/arXiv.2409.06080

Jacobs, Ryan; Polak, Maciej; Schultz, Lane; Mahdavi, Hamed; Honaver, Vasant; Morgan, Dane (September 2024, arXivorg)

Full Text Available

« Prev Next »

Search for: All records