Achieving GPT-4o level performance in astronomy with a specialized 8B-parameter large language model

de_Haan, Tijmen; Ting, Yuan-Sen; Ghosal, Tirthankar; Nguyen, Tuan Dung; Accomazzi, Alberto; Wells, Azton; Ramachandra, Nesar; Pan, Rui; Sun, Zechang

doi:10.1038/s41598-025-97131-y

Citation Details

This content will become publicly available on December 1, 2026

Achieving GPT-4o level performance in astronomy with a specialized 8B-parameter large language model

Abstract AstroSage-Llama-3.1-8B is a domain-specialized natural-language AI assistant tailored for research in astronomy, astrophysics, cosmology, and astronomical instrumentation. Trained on the complete collection of astronomy-related arXiv papers from 2007 to 2024 along with millions of synthetically-generated question-answer pairs and other astronomical literature, AstroSage-Llama-3.1-8B demonstrates remarkable proficiency on a wide range of questions. AstroSage-Llama-3.1-8B scores 80.9% on the AstroMLab-1 benchmark, greatly outperforming all models—proprietary and open-weight—in the 8-billion parameter class, and performing on par with GPT-4o. This achievement demonstrates the potential of domain specialization in AI, suggesting that focused training can yield capabilities exceeding those of much larger, general-purpose models. AstroSage-Llama-3.1-8B is freely available, enabling widespread access to advanced AI capabilities for astronomical education and research. more »

Award ID(s):: 2406729

PAR ID:: 10612805

Author(s) / Creator(s):: de_Haan, Tijmen; Ting, Yuan-Sen; Ghosal, Tirthankar; Nguyen, Tuan Dung; Accomazzi, Alberto; Wells, Azton; Ramachandra, Nesar; Pan, Rui; Sun, Zechang

Publisher / Repository:: Scientific Reports

Date Published:: 2025-12-01

Journal Name:: Scientific Reports

Volume:: 15

Issue:: 1

ISSN:: 2045-2322

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on December 1, 2026
Journal Article:
https://doi.org/10.1038/s41598-025-97131-y

More Like this