NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Trial by FIRE: probing the dark matter density profile of dwarf galaxies with GraphNPE

https://doi.org/10.1093/mnras/staf1118

Nguyen, Tri; Read, Justin; Necib, Lina; Mishra-Sharma, Siddharth; Faucher-Giguère, Claude-André; Wetzel, Andrew; Starkenburg, Tjitske K (July 2025, Monthly Notices of the Royal Astronomical Society)

ABSTRACT The dark matter (DM) distribution in dwarf galaxies provides crucial insights into both structure formation and the particle nature of DM. GraphNPE (Graph Neural Posterior Estimator), first introduced in Nguyen et al. (2023), is a novel simulation-based inference framework that combines graph neural networks and normalizing flows to infer the DM density profile from line-of-sight stellar velocities. Here, we apply GraphNPE to satellite dwarf galaxies in the FIRE-2 Latte simulation suite of Milky Way-mass haloes, testing it against both Cold and Self-Interacting DM scenarios. Our method demonstrates superior precision compared to conventional Jeans-based approaches, recovering DM density profiles to within the 95 per cent confidence level even in systems with as few as 30 tracers. Moreover, we present the first evaluation of mass modelling methods in constraining two key parameters from realistic simulations: the peak circular velocity, $$V_\mathrm{max}$$, and the peak virial mass, $$M_\mathrm{200m}^\mathrm{peak}$$. Using only line-of-sight velocities, GraphNPE can reliably recover both $$V_\mathrm{max}$$ and $$M_\mathrm{200m}^\mathrm{peak}$$ within our quoted uncertainties, including those experiencing tidal effects ($$\gtrsim 63~{{\rm per\ cent}}$$ of systems are recovered within our 68 per cent confidence intervals and $$\gtrsim 92~{{\rm per\ cent}}$$ within our 95 per cent confidence intervals). The method achieves $$10-20~{{\rm per\ cent}}$$ accuracy in $$V_\mathrm{max}$$ recovery, while $$M_\mathrm{200m}^\mathrm{peak}$$ is recovered to $$0.1-0.4 \, \mathrm{dex}$$ accuracy. This work establishes GraphNPE as a robust tool for inferring DM density profiles in dwarf galaxies, offering promising avenues for constraining DM models. The framework’s potential extends beyond this study, as it can be adapted to non-spherical and disequilibrium models, showcasing the broader utility of simulation-based inference and graph-based learning in astrophysics.
more » « less
Free, publicly-accessible full text available July 9, 2026
Noisy Label Learning with Instance-Dependent Outliers: Identifiability via Crowd Wisdom

Nguyen, Tri; Ibrahim, Shahana; Fu, Xiao (December 2024, NeurIPS 2024)

Full Text Available
A Transducers-based Programming Framework for Efficient Data Transformation

https://doi.org/10.1145/3656019.3676891

Nguyen, Tri; Becchi, Michela (October 2024, ACM)

Many data analytics and scientific applications rely on data transformation tasks, such as encoding, decoding, parsing of structured and unstructured data, and conversions between data formats and layouts. Previous work has shown that data transformation can represent a performance bottleneck for data analytics workloads. The transducers computational abstraction can be used to express a wide range of data transformations, and recent efforts have proposed configurable engines implementing various transducer models (from finite state transducers, to pushdown transducers, to extended models). This line of research, however, is still at an early stage. Notably, expressing data transformation using transducers requires a paradigm shift, impacting programmability. To address this problem, we propose a programming framework to map data transformation tasks onto a variety of transducer models. Our framework includes: (1) a platform agnostic programming language (xPTLang) to code transducer programs using intuitive programming constructs, and (2) a compiler that, given an xPTLang program, generates efficient transducer processing engines for CPU and GPU. Our compiler includes a set of optimizations to improve code efficiency. We demonstrate our framework on a diverse set of data transformation tasks on an Intel CPU and an Nvidia GPU.
more » « less
Full Text Available
Significantly Improving Fixed-Ratio Compression Framework for Resource-limited Applications

https://doi.org/10.1145/3673038.3673092

Nguyen, Tri; Rahman, Md Hasanur; Di, Sheng; Becchi, Michela (August 2024, ACM)

Scientific simulations running on HPC facilities generate massive amount of data, putting significant pressure onto supercomputers’ storage capacity and network bandwidth. To alleviate this problem, there has been a rich body of work on reducing data volumes via error-controlled lossy compression. However, fixed-ratio compression is not very well-supported, not allowing users to appropriately allocate memory/storage space or know the data transfer time over the network in advance. To address this problem, recent ratio-controlled frameworks, such as FXRZ, have incorporated methods to predict required error bound settings to reach a user-specified compression ratio. However, these approaches fail to achieve fixed-ratio compression in an accurate, efficient and scalable fashion on diverse datasets and compression algorithms. This work proposes an efficient, scalable, ratio-controlled lossy compression framework (CAROL). At the core of CAROL are four optimization strategies that allow for improving the prediction accuracy and runtime efficiency over state-of-the-art solutions. First, CAROL uses surrogate-based compression ratio estimation to generate training data. Second, it includes a novel calibration method to improve prediction accuracy across a variety of compressors. Third, it leverages Bayesian optimization to allow for efficient training and incremental model refinement. Forth, it uses GPU acceleration to speed up prediction. We evaluate CAROL on four compression algorithms and six scientific datasets. On average, when compared to the state-of-the-art FXRZ framework, CAROL achieves 4 × speedup in setup time and 36 × speedup in inference time, while maintaining less than 1% difference in estimation accuracy.
more » « less
Full Text Available
FLORAH: a generative model for halo assembly histories

https://doi.org/10.1093/mnras/stae2001

Nguyen, Tri; Modi, Chirag; Yung, L_Y_Aaron; Somerville, Rachel_S (August 2024, Monthly Notices of the Royal Astronomical Society)

ABSTRACT The mass assembly history (MAH) of dark matter haloes plays a crucial role in shaping the formation and evolution of galaxies. MAHs are used extensively in semi-analytic and empirical models of galaxy formation, yet current analytic methods to generate them are inaccurate and unable to capture their relationship with the halo internal structure and large-scale environment. This paper introduces florah (FLOw-based Recurrent model for Assembly Histories), a machine-learning framework for generating assembly histories of ensembles of dark matter haloes. We train florah on the assembly histories from the Gadget at Ultra-high Redshift with Extra Fine Time-steps and vsmdplN-body simulations and demonstrate its ability to recover key properties such as the time evolution of mass and concentration. We obtain similar results for the galaxy stellar mass versus halo mass relation and its residuals when we run the Santa Cruz semi-analytic model on florah-generated assembly histories and halo formation histories extracted from an N-body simulation. We further show that florah also reproduces the dependence of clustering on properties other than mass (assembly bias), which is not captured by other analytic methods. By combining multiple networks trained on a suite of simulations with different redshift ranges and mass resolutions, we are able to construct accurate main progenitor branches with a wide dynamic mass range from $z=0$ up to an ultra-high redshift $$z \approx 20$$, currently far beyond that of a single N-body simulation. florah is the first step towards a machine learning-based framework for planting full merger trees; this will enable the exploration of different galaxy formation scenarios with great computational efficiency at unprecedented accuracy.
more » « less
Tuning the Magnetic Properties of CrI3 Using Ni Thin Film Deposition for Applications in Spintronic Devices

https://doi.org/10.1021/acsanm.4c06641

Nnokwe, Cynthia; Cunningham, Connor J; Liu, Wenhao; Ye, Gaihua; Nguyen, Tri; Wu, Kai; Hemesath, Colin; Sadler, Caden; Lukashev, Pavel; Zhai, Zixin; et al (January 2025, ACS Applied Nano Materials)

Free, publicly-accessible full text available January 29, 2026
Under-Counted Matrix Completion Without Detection Features

https://doi.org/10.1109/ICASSP49660.2025.10888717

Nguyen, Tri; Ibrahim, Shahana; Hutchinson, Rebecca A; Fu, Xiao (April 2025, IEEE)

Free, publicly-accessible full text available April 6, 2026
Sputtered SnTe Thin Films on Si and Ge as a Plasmonic Material

https://doi.org/10.1021/acsaelm.3c01449

Nguyen, Tri; Nordin, Leland; Mukherjee, Kunal (February 2024, ACS Applied Electronic Materials)

Full Text Available
How Far in Advance Can Deep Learning Predict Tropical Cyclone Formation?

Kieu, C; Nguye, Quan; Nguyen, Tri (January 2024, AMS 104 Annual Meeting, Baltimore, Maryland)

Full Text Available
Introducing the DREAMS Project: DaRk mattEr and Astrophysics with Machine Learning and Simulations

https://doi.org/10.3847/1538-4357/adb8e5

Rose, Jonah C; Torrey, Paul; Villaescusa-Navarro, Francisco; Lisanti, Mariangela; Nguyen, Tri; Roy, Sandip; Kollmann, Kassidy E; Vogelsberger, Mark; Cyr-Racine, Francis-Yan; Medvedev, Mikhail V; et al (March 2025, The Astrophysical Journal)

Abstract We introduce the DaRk mattEr and Astrophysics with Machine learning and Simulations (DREAMS) project, an innovative approach to understanding the astrophysical implications of alternative dark matter (DM) models and their effects on galaxy formation and evolution. The DREAMS project will ultimately comprise thousands of cosmological hydrodynamic simulations that simultaneously vary over DM physics, astrophysics, and cosmology in modeling a range of systems—from galaxy clusters to ultra-faint satellites. Such extensive simulation suites can provide adequate training sets for machine-learning-based analyses. This paper introduces two new cosmological hydrodynamical suites of warm dark matter (WDM), each comprising 1024 simulations generated using thearepocode. One suite consists of uniform-box simulations covering a ${(25 h^{- 1} Mpc)}^{3}$ volume, while the other consists of Milky Way zoom-ins with sufficient resolution to capture the properties of classical satellites. For each simulation, the WDM particle mass is varied along with the initial density field and several parameters controlling the strength of baryonic feedback within the IllustrisTNG model. We provide two examples, separately utilizing emulators and convolutional neural networks, to demonstrate how such simulation suites can be used to disentangle the effects of DM and baryonic physics on galactic properties. The DREAMS project can be extended further to include different DM models, galaxy formation physics, and astrophysical targets. In this way, it will provide an unparalleled opportunity to characterize uncertainties on predictions for small-scale observables, leading to robust predictions for testing the particle physics nature of DM on these scales.
more » « less
Free, publicly-accessible full text available March 20, 2026

« Prev Next »

Search for: All records