skip to main content


Title: KilonovaNet : Surrogate models of kilonova spectra with conditional variational autoencoders
ABSTRACT Detailed radiative transfer simulations of kilonova spectra play an essential role in multimessenger astrophysics. Using the simulation results in parameter inference studies requires building a surrogate model from the simulation outputs to use in algorithms requiring sampling. In this work, we present kilonovanet, an implementation of conditional variational autoencoders (cVAEs) for the construction of surrogate models of kilonova spectra. This method can be trained on spectra directly, removing overhead time of pre-processing spectra, and greatly speeds up parameter inference time. We build surrogate models of three state-of-the-art kilonova simulation data sets and present in-depth surrogate error evaluation methods, which can in general be applied to any surrogate construction method. By creating synthetic photometric observations from the spectral surrogate, we perform parameter inference for the observed light-curve data of GW170817 and compare the results with previous analyses. Given the speed with which kilonovanet performs during parameter inference, it will serve as a useful tool in future gravitational wave observing runs to quickly analyse potential kilonova candidates.  more » « less
Award ID(s):
2122312
PAR ID:
10353486
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Monthly Notices of the Royal Astronomical Society
Volume:
516
Issue:
1
ISSN:
0035-8711
Page Range / eLocation ID:
1137 to 1148
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT

    We develop a method to compute synthetic kilonova light curves that combine numerical relativity simulations of neutron star mergers and the SNEC radiation–hydrodynamics code. We describe our implementation of initial and boundary conditions, r-process heating, and opacities for kilonova simulations. We validate our approach by carefully checking that energy conservation is satisfied and by comparing the SNEC results with those of two semi-analytic light-curve models. We apply our code to the calculation of colour light curves for three binaries having different mass ratios (equal and unequal mass) and different merger outcome (short-lived and long-lived remnants). We study the sensitivity of our results to hydrodynamic effects, nuclear physics uncertainties in the heating rates, and duration of the merger simulations. We find that hydrodynamics effects are typically negligible and that homologous expansion is a good approximation in most cases. However, pressure forces can amplify the impact of uncertainties in the radioactive heating rates. We also study the impact of shocks possibly launched into the outflows by a relativistic jet. None of our models match AT2017gfo, the kilonova in GW170817. This points to possible deficiencies in our merger simulations and kilonova models that neglect non-LTE effects and possible additional energy injection from the merger remnant and to the need to go beyond the assumption of spherical symmetry adopted in this work.

     
    more » « less
  2. While Bayesian inference is the gold standard for uncertainty quantification and propagation, its use within physical chemistry encounters formidable computational barriers. These bottlenecks are magnified for modeling data with many independent variables, such as X-ray/neutron scattering patterns and electromagnetic spectra. To address this challenge, we employ local Gaussian process (LGP) surrogate models to accelerate Bayesian optimization over these complex thermophysical properties. The time-complexity of the LGPs scales linearly in the number of independent variables, in stark contrast to the computationally expensive cubic scaling of conventional Gaussian processes. To illustrate the method, we trained a LGP surrogate model on the radial distribution function of liquid neon and observed a 1,760,000-fold speed-up compared to molecular dynamics simulation, beating a conventional GP by three orders-of-magnitude. We conclude that LGPs are robust and efficient surrogate models poised to expand the application of Bayesian inference in molecular simulations to a broad spectrum of experimental data. 
    more » « less
  3. ABSTRACT

    Type Ia supernovae (SNe Ia) are standarizable candles whose observed light curves can be used to infer their distances, which can in turn be used in cosmological analyses. As the quantity of observed SNe Ia grows with current and upcoming surveys, increasingly scalable analyses are necessary to take full advantage of these new data sets for precise estimation of cosmological parameters. Bayesian inference methods enable fitting SN Ia light curves with robust uncertainty quantification, but traditional posterior sampling using Markov Chain Monte Carlo (MCMC) is computationally expensive. We present an implementation of variational inference (VI) to accelerate the fitting of SN Ia light curves using the BayeSN hierarchical Bayesian model for time-varying SN Ia spectral energy distributions. We demonstrate and evaluate its performance on both simulated light curves and data from the Foundation Supernova Survey with two different forms of surrogate posterior–a multivariate normal and a custom multivariate zero-lower-truncated normal distribution–and compare them with the Laplace Approximation and full MCMC analysis. To validate of our variational approximation, we calculate the Pareto-smoothed importance sampling diagnostic, and perform variational simulation-based calibration. The VI approximation achieves similar results to MCMC but with an order-of-magnitude speed-up for the inference of the photometric distance moduli. Overall, we show that VI is a promising method for scalable parameter inference that enables analysis of larger data sets for precision cosmology.

     
    more » « less
  4. Abstract

    A novel modeling framework that simultaneously improves accuracy, predictability, and computational efficiency is presented. It embraces the benefits of three modeling techniques integrated together for the first time: surrogate modeling, parameter inference, and data assimilation. The use of polynomial chaos expansion (PCE) surrogates significantly decreases computational time. Parameter inference allows for model faster convergence, reduced uncertainty, and superior accuracy of simulated results. Ensemble Kalman filters assimilate errors that occur during forecasting. To examine the applicability and effectiveness of the integrated framework, we developed 18 approaches according to how surrogate models are constructed, what type of parameter distributions are used as model inputs, and whether model parameters are updated during the data assimilation procedure. We conclude that (1) PCE must be built over various forcing and flow conditions, and in contrast to previous studies, it does not need to be rebuilt at each time step; (2) model parameter specification that relies on constrained, posterior information of parameters (so‐calledSelectedspecification) can significantly improve forecasting performance and reduce uncertainty bounds compared toRandomspecification using prior information of parameters; and (3) no substantial differences in results exist between single and dual ensemble Kalman filters, but the latter better simulates flood peaks. The use of PCE effectively compensates for the computational load added by the parameter inference and data assimilation (up to ~80 times faster). Therefore, the presented approach contributes to a shift in modeling paradigm arguing that complex, high‐fidelity hydrologic and hydraulic models should be increasingly adopted for real‐time and ensemble flood forecasting.

     
    more » « less
  5. ABSTRACT

    We present an improved version of the 3D Monte Carlo radiative transfer code possis to model kilonovae from neutron star mergers, wherein nuclear heating rates, thermalization efficiencies, and wavelength-dependent opacities depend on local properties of the ejecta and time. Using an axially symmetric two-component ejecta model, we explore how simplistic assumptions on heating rates, thermalization efficiencies, and opacities often found in the literature affect kilonova spectra and light curves. Specifically, we compute five models: one (FIDUCIAL) with an appropriate treatment of these three quantities, one (SIMPLE-HEAT) with uniform heating rates throughout the ejecta, one (SIMPLE-THERM) with a constant and uniform thermalization efficiency, one (SIMPLE-OPAC) with grey opacities, and one (SIMPLE-ALL) with all these three simplistic assumptions combined. We find that deviations from the FIDUCIAL model are of several (∼1–10) magnitudes and are generally larger for the SIMPLE-OPAC and SIMPLE-ALL compared to the SIMPLE-THERM and SIMPLE-HEAT models. The discrepancies generally increase from a face-on to an edge-on view of the system, from early to late epochs and from infrared to ultraviolet/optical wavelengths. This work indicates that kilonova studies using either of these simplistic assumptions ought to be treated with caution and that appropriate systematic uncertainties ought to be added to kilonova light curves when performing inference on ejecta parameters.

     
    more » « less