Abstract MotivationCell function is regulated by gene regulatory networks (GRNs) defined by protein-mediated interaction between constituent genes. Despite advances in experimental techniques, we can still measure only a fraction of the processes that govern GRN dynamics. To infer the properties of GRNs using partial observation, unobserved sequential processes can be replaced with distributed time delays, yielding non-Markovian models. Inference methods based on the resulting model suffer from the curse of dimensionality. ResultsWe develop a simulation-based Bayesian MCMC method employing an approximate likelihood for the efficient and accurate inference of GRN parameters when only some of their products are observed. We illustrate our approach using a two-step activation model: an activation signal leads to the accumulation of an unobserved regulatory protein, which triggers the expression of observed fluorescent proteins. With prior information about observed fluorescent protein synthesis, our method successfully infers the dynamics of the unobserved regulatory protein. We can estimate the delay and kinetic parameters characterizing target regulation including transcription, translation, and target searching of an unobserved protein from experimental measurements of the products of its target gene. Our method is scalable and can be used to analyze non-Markovian models with hidden components. Availability and implementationOur code is implemented in R and is freely available with a simple example data at https://github.com/Mathbiomed/SimMCMC. 
                        more » 
                        « less   
                    
                            
                            Bayesian inference of distributed time delay in transcriptional and translational regulation
                        
                    
    
            Abstract MotivationAdvances in experimental and imaging techniques have allowed for unprecedented insights into the dynamical processes within individual cells. However, many facets of intracellular dynamics remain hidden, or can be measured only indirectly. This makes it challenging to reconstruct the regulatory networks that govern the biochemical processes underlying various cell functions. Current estimation techniques for inferring reaction rates frequently rely on marginalization over unobserved processes and states. Even in simple systems this approach can be computationally challenging, and can lead to large uncertainties and lack of robustness in parameter estimates. Therefore we will require alternative approaches to efficiently uncover the interactions in complex biochemical networks. ResultsWe propose a Bayesian inference framework based on replacing uninteresting or unobserved reactions with time delays. Although the resulting models are non-Markovian, recent results on stochastic systems with random delays allow us to rigorously obtain expressions for the likelihoods of model parameters. In turn, this allows us to extend MCMC methods to efficiently estimate reaction rates, and delay distribution parameters, from single-cell assays. We illustrate the advantages, and potential pitfalls, of the approach using a birth–death model with both synthetic and experimental data, and show that we can robustly infer model parameters using a relatively small number of measurements. We demonstrate how to do so even when only the relative molecule count within the cell is measured, as in the case of fluorescence microscopy. Availability and implementationAccompanying code in R is available at https://github.com/cbskust/DDE_BD. Supplementary informationSupplementary data are available at Bioinformatics online. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 1662290
- PAR ID:
- 10117057
- Publisher / Repository:
- Oxford University Press
- Date Published:
- Journal Name:
- Bioinformatics
- ISSN:
- 1367-4803
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract MotivationModeling single-cell gene expression trends along cell pseudotime is a crucial analysis for exploring biological processes. Most existing methods rely on nonparametric regression models for their flexibility; however, nonparametric models often provide trends too complex to interpret. Other existing methods use interpretable but restrictive models. Since model interpretability and flexibility are both indispensable for understanding biological processes, the single-cell field needs a model that improves the interpretability and largely maintains the flexibility of nonparametric regression models. ResultsHere, we propose the single-cell generalized trend model (scGTM) for capturing a gene’s expression trend, which may be monotone, hill-shaped or valley-shaped, along cell pseudotime. The scGTM has three advantages: (i) it can capture non-monotonic trends that are easy to interpret, (ii) its parameters are biologically interpretable and trend informative, and (iii) it can flexibly accommodate common distributions for modeling gene expression counts. To tackle the complex optimization problems, we use the particle swarm optimization algorithm to find the constrained maximum likelihood estimates for the scGTM parameters. As an application, we analyze several single-cell gene expression datasets using the scGTM and show that scGTM can capture interpretable gene expression trends along cell pseudotime and reveal molecular insights underlying biological processes. Availability and implementationThe Python package scGTM is open-access and available at https://github.com/ElvisCuiHan/scGTM. Supplementary informationSupplementary data are available at Bioinformatics online.more » « less
- 
            Fu, Feng (Ed.)Undetected infections fuel the dissemination of many infectious agents. However, identification of unobserved infectious individuals remains challenging due to limited observations of infections and imperfect knowledge of key transmission parameters. Here, we use an ensemble Bayesian inference method to infer unobserved infections using partial observations. The ensemble inference method can represent uncertainty in model parameters and update model states using all ensemble members collectively. We perform extensive experiments in both model-generated and real-world networks in which individuals have differential but unknown transmission rates. The ensemble method outperforms several alternative approaches for a variety of network structures and observation rates, despite that the model is mis-specified. Additionally, the computational complexity of this algorithm scales almost linearly with the number of nodes in the network and the number of observations, respectively, exhibiting the potential to apply to large-scale networks. The inference method may support decision-making under uncertainty and be adapted for use for other dynamical models in networks.more » « less
- 
            Biophysical models describing complex cellular phenomena typically include systems of nonlinear differential equations with many free parameters. While experimental measurements can fix some parameters, those describing internal cellular processes frequently remain inaccessible. Hence, a proliferation of free parameters risks overfitting the data, limiting the model's predictive power. In this study, we develop systematic methods, applying statistical analysis and dynamical-systems theory, to reduce parameter count in a biophysical model. We demonstrate our techniques on a five-variable computational model designed to describe active, mechanical motility of auditory hair cells. Specifically, we use two statistical measures, the total-effect and PAWN indices, to rank each free parameter by its influence on selected, core properties of the model. With the resulting ranking, we fix most of the less influential parameters, yielding a five-parameter model with refined predictive power. We validate the theoretical model with experimental recordings of active hair-bundle motility, specifically by using Akaike and Bayesian information criteria after obtaining maximum-likelihood fits. As a result, we determine the system's most influential parameters, which illuminate the key biophysical elements of the cell's overall features. Even though we demonstrate with a concrete example, our techniques provide a general framework, applicable to other biophysical systems. Published by the American Physical Society2024more » « less
- 
            In many cell types, mitochondria undergo extensive fusion and fission to form dynamic, responsive network structures that contribute to a number of homeostatic, metabolic, and signaling functions. The relationship between the dynamic interactions of individual mitochondrial units and the cell-scale network architecture remains an open area of study. In this work, we use coarse-grained simulations and approximate analytic models to establish how the network morphology is governed by local mechanical and kinetic parameters. The transition between fragmented structures and extensive networks is controlled by local fusion-to-fission ratios, network density, and geometric constraints. Similar fusion rate constants are found to account for the very different structures formed by mammalian networks (poised at the percolation transition) and well-connected budding yeast networks. Over a broad parameter range, the simulated network structures can be described by effective mean-field association constants that exhibit a nonlinear dependence on the microscopic nonequilibrium fusion, fission, and transport rates. Intermediate fusion rate constants are shown to result in the highest rates of network remodeling, with mammalian mitochondrial networks situated in a regime of high turnover. This spatially resolved modeling and simulation framework helps elucidate the emergence of cellular scale network structures, and allows for the quantitative extraction of microscopic kinetic parameters from past and future experimental data. Published by the American Physical Society2024more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
