skip to main content

Title: Testing Modules for Experiments in Stellar Astrophysics (MESA)

Regular, automated testing is a foundational principle of modern software development. Numerous widely used continuous integration systems exist, but they are often not suitable for the unique needs of scientific simulation software. Here we describe the testing infrastructure developed for and used by the Modules for Experiments in Stellar Astrophysics (MESA) project. This system allows the computationally demanding MESA test suite to be regularly run on a heterogeneous set of computers and aggregates and displays the testing results in a form that allows for the rapid identification and diagnosis of regressions. Regularly collecting comprehensive testing data also enables longitudinal studies of the performance of the software and the properties of the models it generates.

more » « less
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
DOI PREFIX: 10.3847
Date Published:
Journal Name:
The Astrophysical Journal Supplement Series
Medium: X Size: Article No. 50
["Article No. 50"]
Sponsoring Org:
National Science Foundation
More Like this
  1. Summary

    Researchers and practitioners have designed and implemented various automated test case generators to support effective software testing. Such generators exist for various languages (e.g., Java, C#, or Python) and various platforms (e.g., desktop, web, or mobile applications). The generators exhibit varying effectiveness and efficiency, depending on the testing goals they aim to satisfy (e.g., unit‐testing of libraries versus system‐testing of entire applications) and the underlying techniques they implement. In this context, practitioners need to be able to compare different generators to identify the most suited one for their requirements, while researchers seek to identify future research directions. This can be achieved by systematically executing large‐scale evaluations of different generators. However, executing such empirical evaluations is not trivial and requires substantial effort to select appropriate benchmarks, setup the evaluation infrastructure, and collect and analyse the results. In this Software Note, we present ourJUnit Generation Benchmarking Infrastructure(JUGE) supporting generators (search‐based, random‐based, symbolic execution, etc.) seeking to automate the production of unit tests for various purposes (validation, regression testing, fault localization, etc.). The primary goal is to reduce the overall benchmarking effort, ease the comparison of several generators, and enhance the knowledge transfer between academia and industry by standardizing the evaluation and comparison process. Since 2013, several editions of a unit testing tool competition, co‐located with the Search‐Based Software Testing Workshop, have taken place whereJUGEwas used and evolved. As a result, an increasing amount of tools (over 10) from academia and industry have been evaluated onJUGE, matured over the years, and allowed the identification of future research directions. Based on the experience gained from the competitions, we discuss the expected impact ofJUGEin improving the knowledge transfer on tools and approaches for test generation between academia and industry. Indeed, theJUGEinfrastructure demonstrated an implementation design that is flexible enough to enable the integration of additional unit test generation tools, which is practical for developers and allows researchers to experiment with new and advanced unit testing tools and approaches.

    more » « less
  2. Abstract

    Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, (: effect of the exposure on the mediator after adjusting for confounders; : effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for large‐scale one at a time mediation hypothesis testing. These methods are commonly used for continuous outcomes and continuous mediators assuming there is no exposure‐mediator interaction so that the product has a causal interpretation as the indirect effect. The first class of methods ignores the impact of different structures under the composite null hypothesis, namely, (1) ; (2) ; and (3) . The second class of methods weights the reference distribution under each case of the null to form a mixture reference distribution. The third class constructs a composite test statistic using the threepvalues obtained under each case of the null so that the reference distribution of the composite statistic is approximately . In addition to these existing methods, we developed the Sobel‐comp method belonging to the second class, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods belonging to these three classes in terms of the false positive rates (FPRs) under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the second class of methods which uses a mixture reference distribution could best maintain the FPRs at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi‐Ethnic Study of Atherosclerosis (MESA). We provide guidelines for choosing the optimal mediation hypothesis testing method in practice and develop an R packagemedScanavailable on the CRAN for implementing all the six methods.

    more » « less
  3. Abstract

    We update the capabilities of the open-knowledge software instrument Modules for Experiments in Stellar Astrophysics (MESA). The newauto_diffmodule implements automatic differentiation inMESA, an enabling capability that alleviates the need for hard-coded analytic expressions or finite-difference approximations. We significantly enhance the treatment of the growth and decay of convection inMESAwith a new model for time-dependent convection, which is particularly important during late-stage nuclear burning in massive stars and electron-degenerate ignition events. We strengthenMESA’s implementation of the equation of state, and we quantify continued improvements to energy accounting and solver accuracy through a discussion of different energy equation features and enhancements. To improve the modeling of stars inMESA, we describe key updates to the treatment of stellar atmospheres, molecular opacities, Compton opacities, conductive opacities, element diffusion coefficients, and nuclear reaction rates. We introduce treatments of starspots, an important consideration for low-mass stars, and modifications for superadiabatic convection in radiation-dominated regions. We describe new approaches for increasing the efficiency of calculating monochromatic opacities and radiative levitation, and for increasing the efficiency of evolving the late stages of massive stars with a new operator-split nuclear burning mode. We close by discussing major updates toMESA’s software infrastructure that enhance source code development and community engagement.

    more » « less
  4. Abstract

    We enhance the treatment of crystallization for models of white dwarfs (WDs) in the stellar evolution software Modules for Experiments in Stellar Astrophysics (MESA) by implementing carbon–oxygen (C/O) phase separation. The phase separation process during crystallization leads to transport of oxygen toward the centers of WDs, resulting in a more compact structure that liberates gravitational energy as additional heating that modestly slows WD cooling timescales. We quantify this cooling delay in MESA C/O WD models over the mass range 0.5–1.0M, finding delays of 0.5–0.8 Gyr for typical C/O interior profiles. MESA WD cooling timescales including this effect are generally comparable to other WD evolution models that make similar assumptions about input physics. When considering phase separation alongside22Ne sedimentation, however, we find that both MESA and BaSTI WD cooling models predict a more modest sedimentation delay than the latest LPCODE models, and this may therefore require a reevaluation of previously proposed solutions to some WD cooling anomalies that were based on LPCODE models of22Ne sedimentation. Our implementation of C/O phase separation in the open-source stellar evolution software MESA provides an important tool for building realistic grids of WD cooling models, as well as a framework for expanding on our implementation to explore additional physical processes related to phase transitions and associated fluid motions in WD interiors.

    more » « less
  5. Abstract

    The dimmest and most numerous outlier of the Type Ia supernova population, the Type Iax event, is increasingly being found in the results of observational campaigns. There is currently no single accepted model to describe these events. This 2D study explores the viability of modeling Type Iax events as a hybrid C/O/Ne white dwarf progenitor undergoing a deflagration using the multiphysics software FLASH. This hybrid was created using the stellar evolution code MESA, and its C-depleted core and mixed structure have demonstrated lower yields than traditional C/O progenitors in previous deflagration-to-detonation studies. To generate a sample, 30 “realizations” of this simulation were performed, the only difference being the shape of the initial match head used to start the deflagration. Consistent with earlier work, these realizations produce the familiar hot dense bound remnant surrounded by sparse ejecta. Our results indicate that the majority of the star remains unburned (∼70%) and bound (>90%). Our realizations produce total ejecta yields on the order of 10−2–10−1M, ejected56Ni yields on the order of 10−4–10−2M, and ejecta kinetic energies on the order of 1048–1049erg. Compared to yields inferred from recent observations of the dimmest Type Iax events—SN 2007qd, SN 2008ha, SN 2010ae, SN 2019gsc, SN 2019muj, SN 2020kyg, and SN 2021fcg—our simulation produces comparable56Ni yields but too-small total yields and kinetic energies. Reignition of the remnant is also seen in some realizations.

    more » « less