Towards overcoming data scarcity in materials science: unifying models and datasets with a mixture of experts framework

Chang, Rees (ORCID:0000000157737183); Wang, Yu-Xiong; Ertekin, Elif

doi:10.1038/s41524-022-00929-x

Citation Details

Towards overcoming data scarcity in materials science: unifying models and datasets with a mixture of experts framework

Abstract While machine learning has emerged in recent years as a useful tool for the rapid prediction of materials properties, generating sufficient data to reliably train models without overfitting is often impractical. Towards overcoming this limitation, we present a general framework for leveraging complementary information across different models and datasets for accurate prediction of data-scarce materials properties. Our approach, based on a machine learning paradigm called mixture of experts, outperforms pairwise transfer learning on 14 of 19 materials property regression tasks, performing comparably on four of the remaining five. The approach is interpretable, model-agnostic, and scalable to combining an arbitrary number of pre-trained models and datasets to any downstream property prediction task. We anticipate the performance of our framework will further improve as better model architectures, new pre-training tasks, and larger materials datasets are developed by the community. more »

Award ID(s):: 2118201 2106825

PAR ID:: 10380747

Author(s) / Creator(s):: Chang, Rees; Wang, Yu-Xiong; Ertekin, Elif

Publisher / Repository:: Nature Publishing Group

Date Published:: 2022-11-18

Journal Name:: npj Computational Materials

Volume:: 8

Issue:: 1

ISSN:: 2057-3960

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1038/s41524-022-00929-x

More Like this