skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on October 28, 2026

Title: Generative diffusion model surrogates for mechanistic agent-based biological models
Abstract Mechanistic, multicellular, agent-based models are commonly used to investigate tissue, organ, and organism-scale biology at single-cell resolution. The Cellular-Potts Model (CPM) is a powerful and popular framework for developing and interrogating these models. CPMs become computationally expensive at large space- and time- scales making application and investigation of developed models difficult. Surrogate models may allow for the accelerated evaluation of CPMs of complex biological systems. However, the stochastic nature of these models means each set of parameters may give rise to different model configurations, complicating surrogate model development. In this work, we leverage denoising diffusion probabilistic models (DDPMs) to train a generative AI surrogate of a CPM used to investigatein vitrovasculogenesis. We describe the use of an image classifier to learn the characteristics that define unique areas of a 2-dimensional parameter space. We then apply this classifier to aid in surrogate model selection and verification. Our CPM model surrogate generates model configurations 20,000 timesteps ahead of a reference configuration and demonstrates approximately a 22x reduction in computational time as compared to native code execution. Our work represents a step towards the implementation of DDPMs to develop digital twins of stochastic biological systems.  more » « less
Award ID(s):
2212550
PAR ID:
10651622
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
IOP Publishing
Date Published:
Journal Name:
Machine Learning: Science and Technology
Volume:
6
Issue:
4
ISSN:
2632-2153
Page Range / eLocation ID:
045024
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Maini, Philip K (Ed.)
    The Cellular-Potts model is a powerful and ubiquitous framework for developing computational models for simulating complex multicellular biological systems. Cellular-Potts models (CPMs) are often computationally expensive due to the explicit modeling of interactions among large numbers of individual model agents and diffusive fields described by partial differential equations (PDEs). In this work, we develop a convolutional neural network (CNN) surrogate model using a U-Net architecture that accounts for periodic boundary conditions. We use this model to accelerate the evaluation of a mechanistic CPM previously used to investigatein vitrovasculogenesis. The surrogate model was trained to predict 100 computational steps ahead (Monte-Carlo steps, MCS), accelerating simulation evaluations by a factor of 562 times compared to single-core CPM code execution on CPU. Over short timescales of up to 3 recursive evaluations, or 300 MCS, our model captures the emergent behaviors demonstrated by the original Cellular-Potts model such as vessel sprouting, extension and anastomosis, and contraction of vascular lacunae. This approach demonstrates the potential for deep learning to serve as a step toward efficient surrogate models for CPM simulations, enabling faster evaluation of computationally expensive CPM simulations of biological processes. 
    more » « less
  2. Abstract Sleep is critical to a variety of cognitive functions and insufficient sleep can have negative consequences for mood and behavior across the lifespan. An important open question is how sleep duration is related to functional brain organization which may in turn impact cognition. To characterize the functional brain networks related to sleep across youth and young adulthood, we analyzed data from the publicly available Human Connectome Project (HCP) dataset, which includesn‐back task‐based and resting‐state fMRI data from adults aged 22–35 years (taskn = 896; restn = 898). We applied connectome‐based predictive modeling (CPM) to predict participants' mean sleep duration from their functional connectivity patterns. Models trained and tested using 10‐fold cross‐validation predicted self‐reported average sleep duration for the past month fromn‐back task and resting‐state connectivity patterns. We replicated this finding in data from the 2‐year follow‐up study session of the Adolescent Brain Cognitive Development (ABCD) Study, which also includesn‐back task and resting‐state fMRI for adolescents aged 11–12 years (taskn = 786; restn = 1274) as well as Fitbit data reflecting average sleep duration per night over an average duration of 23.97 days. CPMs trained and tested with 10‐fold cross‐validation again predicted sleep duration fromn‐back task and resting‐state functional connectivity patterns. Furthermore, demonstrating that predictive models are robust across independent datasets, CPMs trained on rest data from the HCP sample successfully generalized to predict sleep duration in the ABCD Study sample and vice versa. Thus, common resting‐state functional brain connectivity patterns reflect sleep duration in youth and young adults. 
    more » « less
  3. Abstract We introduce the concept of decision‐focused surrogate modeling for solving computationally challenging nonlinear optimization problems in real‐time settings. The proposed data‐driven framework seeks to learn a simpler, for example, convex, surrogate optimization model that is trained to minimize thedecision prediction error, which is defined as the difference between the optimal solutions of the original and the surrogate optimization models. The learning problem, formulated as a bilevel program, can be viewed as a data‐driven inverse optimization problem to which we apply a decomposition‐based solution algorithm from previous work. We validate our framework through numerical experiments involving the optimization of common nonlinear chemical processes such as chemical reactors, heat exchanger networks, and material blending systems. We also present a detailed comparison of decision‐focused surrogate modeling with standard data‐driven surrogate modeling methods and demonstrate that our approach is significantly more data‐efficient while producing simple surrogate models with high decision prediction accuracy. 
    more » « less
  4. Abstract BackgroundThe biophysics of an organism span multiple scales from subcellular to organismal and include processes characterized by spatial properties, such as the diffusion of molecules, cell migration, and flow of intravenous fluids. Mathematical biology seeks to explain biophysical processes in mathematical terms at, and across, all relevant spatial and temporal scales, through the generation of representative models. While non-spatial, ordinary differential equation (ODE) models are often used and readily calibrated to experimental data, they do not explicitly represent the spatial and stochastic features of a biological system, limiting their insights and applications. However, spatial models describing biological systems with spatial information are mathematically complex and computationally expensive, which limits the ability to calibrate and deploy them and highlights the need for simpler methods able to model the spatial features of biological systems. ResultsIn this work, we develop a formal method for deriving cell-based, spatial, multicellular models from ODE models of population dynamics in biological systems, and vice versa. We provide examples of generating spatiotemporal, multicellular models from ODE models of viral infection and immune response. In these models, the determinants of agreement of spatial and non-spatial models are the degree of spatial heterogeneity in viral production and rates of extracellular viral diffusion and decay. We show how ODE model parameters can implicitly represent spatial parameters, and cell-based spatial models can generate uncertain predictions through sensitivity to stochastic cellular events, which is not a feature of ODE models. Using our method, we can test ODE models in a multicellular, spatial context and translate information to and from non-spatial and spatial models, which help to employ spatiotemporal multicellular models using calibrated ODE model parameters. We additionally investigate objects and processes implicitly represented by ODE model terms and parameters and improve the reproducibility of spatial, stochastic models. ConclusionWe developed and demonstrate a method for generating spatiotemporal, multicellular models from non-spatial population dynamics models of multicellular systems. We envision employing our method to generate new ODE model terms from spatiotemporal and multicellular models, recast popular ODE models on a cellular basis, and generate better models for critical applications where spatial and stochastic features affect outcomes. 
    more » « less
  5. ABSTRACT Agent-based models (ABMs) have become essential tools for simulating complex biological, ecological, and social systems where emergent behaviors arise from the interactions among individual agents. Quantifying uncertainty through global sensitivity analysis is crucial for assessing the robustness and reliability of ABM predictions. However, most global sensitivity methods demand substantial computational resources, making them impractical for highly complex models. Here, we introduce SMoRe GloS (SurrogateModeling forRecapitulatingGlobalSensitivity), a novel, computationally efficient method for performing global sensitivity analysis of ABMs. By leveraging explicitly formulated surrogate models, SMoRe GloS allows for comprehensive parameter space exploration and uncertainty quantification without sacrificing accuracy. We demonstrate our method’s flexibility by applying it to two biological ABMs: a simple 2D cell proliferation assay and a complex 3D vascular tumor growth model. Our results show that SMoRe GloS is compatible with simpler methods like the Morris one-at-a-time method, and more computationally intensive variance-based methods like eFAST. SMoRe GloS accurately recovered global sensitivity indices in each case while achieving substantial speedups, completing analyses in minutes. In contrast, direct implementation of eFAST amounted to several days of CPU time for the complex ABM. Remarkably, our method also estimates sensitivities for ABM parameters representing processes not explicitly included in the surrogate model, further enhancing its utility. By making global sensitivity analysis feasible for computationally expensive models, SMoRe GloS opens up new opportunities for uncertainty quantification in complex systems, allowing for more in depth exploration of model behavior, thereby increasing confidence in model predictions. 
    more » « less