JARVIS-Leaderboard: a large scale benchmark of materials design methods

Choudhary, Kamal (ORCID:0000000197378074); Wines, Daniel (ORCID:0000000338553754); Li, Kangming (ORCID:0000000344718527); Garrity, Kevin F. (ORCID:0000000302634157); Gupta, Vishu (ORCID:0000000249317194); Romero, Aldo H. (ORCID:0000000159680571); Krogel, Jaron T. (ORCID:000000021859181X); Saritas, Kayahan (ORCID:0000000222408520); Fuhr, Addis (ORCID:0000000288198255); Ganesh, Panchapakesan (ORCID:0000000271702902); Kent, Paul R. C. (ORCID:0000000155394017); Yan, Keqiang (ORCID:0009000992869259); Lin, Yuchao (ORCID:0009000362716397); Ji, Shuiwang (ORCID:0000000242054563); Blaiszik, Ben (ORCID:0000000253264902); Reiser, Patrick (ORCID:000000027052696X); Friederich, Pascal (ORCID:0000000344651465); Agrawal, Ankit (ORCID:0000000255190302); Tiwary, Pratyush (ORCID:0000000224126922); Beyerle, Eric; Minch, Peter; Rhone, Trevor David (ORCID:0000000201989952); Takeuchi, Ichiro (ORCID:0000000326250553); Wexler, Robert B. (ORCID:0000000268616421); Mannodi-Kanakkithodi, Arun (ORCID:0000000307801583); Ertekin, Elif (ORCID:0000000278161803); Mishra, Avanish (ORCID:0000000339970445); Mathew, Nithin (ORCID:0000000223163190); Wood, Mitchell (ORCID:0000000158784096); Rohskopf, Andrew Dale (ORCID:0000000227128296); Hattrick-Simpers, Jason (ORCID:0000000329373188); Wang, Shih-Han (ORCID:0000000344182080); Achenie, Luke E. K. (ORCID:0000000198505346); Xin, Hongliang (ORCID:0000000193441697); Williams, Maureen (ORCID:0000000191440551); Biacchi, Adam J. (ORCID:0000000156632048); Tavazza, Francesca (ORCID:000000025602180X)

doi:10.1038/s41524-024-01259-w

Abstract Lack of rigorous reproducibility and validation are significant hurdles for scientific development across many fields. Materials science, in particular, encompasses a variety of experimental and theoretical approaches that require careful benchmarking. Leaderboard efforts have been developed previously to mitigate these issues. However, a comprehensive comparison and benchmarking on an integrated platform with multiple data modalities with perfect and defect materials data is still lacking. This work introduces JARVIS-Leaderboard, an open-source and community-driven platform that facilitates benchmarking and enhances reproducibility. The platform allows users to set up benchmarks with custom tasks and enables contributions in the form of dataset, code, and meta-data submissions. We cover the following materials design categories: Artificial Intelligence (AI), Electronic Structure (ES), Force-fields (FF), Quantum Computation (QC), and Experiments (EXP). For AI, we cover several types of input data, including atomic structures, atomistic images, spectra, and text. For ES, we consider multiple ES approaches, software packages, pseudopotentials, materials, and properties, comparing results to experiment. For FF, we compare multiple approaches for material property predictions. For QC, we benchmark Hamiltonian simulations using various quantum algorithms and circuits. Finally, for experiments, we use the inter-laboratory approach to establish benchmarks. There are 1281 contributions to 274 benchmarks using 152 methods with more than 8 million data points, and the leaderboard is continuously expanding. The JARVIS-Leaderboard is available at the website:https://pages.nist.gov/jarvis_leaderboard/

More Like this