NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Trial by FIRE: probing the dark matter density profile of dwarf galaxies with GraphNPE

https://doi.org/10.1093/mnras/staf1118

Nguyen, Tri; Read, Justin; Necib, Lina; Mishra-Sharma, Siddharth; Faucher-Giguère, Claude-André; Wetzel, Andrew; Starkenburg, Tjitske K (July 2025, Monthly Notices of the Royal Astronomical Society)

ABSTRACT The dark matter (DM) distribution in dwarf galaxies provides crucial insights into both structure formation and the particle nature of DM. GraphNPE (Graph Neural Posterior Estimator), first introduced in Nguyen et al. (2023), is a novel simulation-based inference framework that combines graph neural networks and normalizing flows to infer the DM density profile from line-of-sight stellar velocities. Here, we apply GraphNPE to satellite dwarf galaxies in the FIRE-2 Latte simulation suite of Milky Way-mass haloes, testing it against both Cold and Self-Interacting DM scenarios. Our method demonstrates superior precision compared to conventional Jeans-based approaches, recovering DM density profiles to within the 95 per cent confidence level even in systems with as few as 30 tracers. Moreover, we present the first evaluation of mass modelling methods in constraining two key parameters from realistic simulations: the peak circular velocity, $$V_\mathrm{max}$$, and the peak virial mass, $$M_\mathrm{200m}^\mathrm{peak}$$. Using only line-of-sight velocities, GraphNPE can reliably recover both $$V_\mathrm{max}$$ and $$M_\mathrm{200m}^\mathrm{peak}$$ within our quoted uncertainties, including those experiencing tidal effects ($$\gtrsim 63~{{\rm per\ cent}}$$ of systems are recovered within our 68 per cent confidence intervals and $$\gtrsim 92~{{\rm per\ cent}}$$ within our 95 per cent confidence intervals). The method achieves $$10-20~{{\rm per\ cent}}$$ accuracy in $$V_\mathrm{max}$$ recovery, while $$M_\mathrm{200m}^\mathrm{peak}$$ is recovered to $$0.1-0.4 \, \mathrm{dex}$$ accuracy. This work establishes GraphNPE as a robust tool for inferring DM density profiles in dwarf galaxies, offering promising avenues for constraining DM models. The framework’s potential extends beyond this study, as it can be adapted to non-spherical and disequilibrium models, showcasing the broader utility of simulation-based inference and graph-based learning in astrophysics.
more » « less
Free, publicly-accessible full text available July 9, 2026
Inferring the morphology of the Galactic Center excess with Gaussian processes

https://doi.org/10.1103/PhysRevD.111.063065

Ramirez, Edward_D; Sun, Yitian; Buckley, Matthew_R; Mishra-Sharma, Siddharth; Slatyer, Tracy_R (March 2025, Physical Review D)
A Generative Modeling Approach to Reconstructing 21-cm Tomographic Data

https://doi.org/10.1088/2632-2153/adb19c

Sabti, Nashwan; Sudha, Ram_Purandhar_Reddy; Muñoz, Julian_B; Mishra-Sharma, Siddharth; Youn, Taewook (February 2025, Machine Learning: Science and Technology)

Abstract Analyses of the cosmic 21-cm signal are hampered by astrophysical foregrounds that are far stronger than the signal itself. These foregrounds, typically confined to a wedge-shaped region in Fourier space, often necessitate the removal of a vast majority of modes, thereby degrading the quality of the data anisotropically. To address this challenge, we introduce a novel deep generative model based on stochastic interpolants to reconstruct the 21-cm data lost to wedge filtering. Our method leverages the non-Gaussian nature of the 21-cm signal to effectively map wedge-filtered 3D lightcones to samples from the conditional distribution of wedge-recovered lightcones. We demonstrate how our method is able to restore spatial information effectively, considering both varying cosmological initial conditions and astrophysics. Furthermore, we discuss a number of future avenues where this approach could be applied in analyses of the 21-cm signal, potentially offering new opportunities to improve our understanding of the Universe during the epochs of cosmic dawn and reionization.
more » « less
Maven: a multimodal foundation model for supernova science

https://doi.org/10.1088/2632-2153/ad990d

Zhang, Gemma; Helfer, Thomas; Gagliano, Alexander_T; Mishra-Sharma, Siddharth; Ashley_Villar, V. (December 2024, Machine Learning: Science and Technology)

Abstract A common setting in astronomy is the availability of a small number of high-quality observations, and larger amounts of either lower-quality observations or synthetic data from simplified models. Time-domain astrophysics is a canonical example of this imbalance, with the number of supernovae observed photometrically outpacing the number observed spectroscopically by multiple orders of magnitude. At the same time, no data-driven models exist to understand these photometric and spectroscopic observables in a common context. Contrastive learning objectives, which have grown in popularity for aligning distinct data modalities in a shared embedding space, provide a potential solution to extract information from these modalities. We present Maven, the first foundation model for supernova science. To construct Maven, we first pre-train our model to align photometry and spectroscopy from 0.5 M synthetic supernovae using a contrastive objective. We then fine-tune the model on 4702 observed supernovae from the Zwicky transient facility. Maven reaches state-of-the-art performance on both classification and redshift estimation, despite the embeddings not being explicitly optimized for these tasks. Through ablation studies, we show that pre-training with synthetic data improves overall performance. In the upcoming era of the Vera C. Rubin observatory, Maven will serve as a valuable tool for leveraging large, unlabeled and multimodal time-domain datasets.
more » « less
PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models

Mishra-Sharma, Siddharth; Song, Yiding; Thaler, Jesse (July 2024, Open Review)

We present PAPERCLIP (Proposal Abstracts Provide an Effective Representation for Contrastive Language-Image Pre-training), a method which associates astronomical observations imaged by telescopes with natural language using a neural network model. The model is fine-tuned from a pre-trained Contrastive Language–Image Pre-training (CLIP) model using successful observing proposal abstracts and corresponding downstream observations, with the abstracts optionally summarized via guided generation using large language models (LLMs). Using observations from the Hubble Space Telescope (HST) as an example, we show that the fine-tuned model embodies a meaningful joint representation between observations and natural language through quantitative evaluation as well as tests targeting image retrieval (i.e., finding the most relevant observations using natural language queries). and description retrieval (i.e., querying for astrophysical object classes and use cases most relevant to a given observation). Our study demonstrates the potential for using generalist foundation models rather than task-specifc models for interacting with astronomical data by leveraging text as an interface.
more » « less
Full Text Available
Full-shape analysis with simulation-based priors: Constraints on single field inflation from BOSS

https://doi.org/10.1103/PhysRevD.110.063538

Ivanov, Mikhail_M; Cuesta-Lazaro, Carolina; Mishra-Sharma, Siddharth; Obuljen, Andrej; Toomey, Michael_W (September 2024, Physical Review D)
Uncovering dark matter density profiles in dwarf galaxies with graph neural networks

https://doi.org/10.1103/PhysRevD.107.043015

Nguyen, Tri; Mishra-Sharma, Siddharth; Williams, Reuel; Necib, Lina (February 2023, Physical Review D)

Full Text Available
Inferring subhalo effective density slopes from strong lensing observations with neural likelihood-ratio estimation

https://doi.org/10.1093/mnras/stac3014

Zhang, Gemma; Mishra-Sharma, Siddharth; Dvorkin, Cora (October 2022, Monthly Notices of the Royal Astronomical Society)

ABSTRACT Strong gravitational lensing has emerged as a promising approach for probing dark matter (DM) models on sub-galactic scales. Recent work has proposed the subhalo effective density slope as a more reliable observable than the commonly used subhalo mass function. The subhalo effective density slope is a measurement independent of assumptions about the underlying density profile and can be inferred for individual subhaloes through traditional sampling methods. To go beyond individual subhalo measurements, we leverage recent advances in machine learning and introduce a neural likelihood-ratio estimator to infer an effective density slope for populations of subhaloes. We demonstrate that our method is capable of harnessing the statistical power of multiple subhaloes (within and across multiple images) to distinguish between characteristics of different subhalo populations. The computational efficiency warranted by the neural likelihood-ratio estimator over traditional sampling enables statistical studies of DM perturbers and is particularly useful as we expect an influx of strong lensing systems from upcoming surveys.
more » « less
Neural simulation-based inference approach for characterizing the Galactic Center $γ$ -ray excess

https://doi.org/10.1103/PhysRevD.105.063017

Mishra-Sharma, Siddharth; Cranmer, Kyle (March 2022, Physical Review D)

Full Text Available
Edges and Endpoints in 21-cm Observations from Resonant Photon Production

https://doi.org/10.1103/PhysRevLett.127.011102

Caputo, Andrea; Liu, Hongwan; Mishra-Sharma, Siddharth; Pospelov, Maxim; Ruderman, Joshua T.; Urbano, Alfredo (June 2021, Physical Review Letters)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records