skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A Bayesian non-parametric mixed-effects model of microbial growth curves
Substantive changes in gene expression, metabolism, and the proteome are manifested in overall changes in microbial population growth. Quantifying how microbes grow is therefore fundamental to areas such as genetics, bioengineering, and food safety. Traditional parametric growth curve models capture the population growth behavior through a set of summarizing parameters. However, estimation of these parameters from data is confounded by random effects such as experimental variability, batch effects or differences in experimental material. A systematic statistical method to identify and correct for such confounding effects in population growth data is not currently available. Further, our previous work has demonstrated that parametric models are insufficient to explain and predict microbial response under non-standard growth conditions. Here we develop a hierarchical Bayesian non-parametric model of population growth that identifies the latent growth behavior and response to perturbation, while simultaneously correcting for random effects in the data. This model enables more accurate estimates of the biological effect of interest, while better accounting for the uncertainty due to technical variation. Additionally, modeling hierarchical variation provides estimates of the relative impact of various confounding effects on measured population growth.  more » « less
Award ID(s):
1651117
PAR ID:
10234578
Author(s) / Creator(s):
; ; ; ; ;
Editor(s):
Papin, Jason A.
Date Published:
Journal Name:
PLOS Computational Biology
Volume:
16
Issue:
10
ISSN:
1553-7358
Page Range / eLocation ID:
e1008366
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. NA (Ed.)
    Variation in a sample of molecular sequence data informs about the past evolutionary history of the sample’s population. Traditionally, Bayesian modelling coupled with the standard coalescent is used to infer the sample’s bifurcating genealogy and demographic and evolutionary parameters such as effective population size and mutation rates. However, there are many situations where binary coalescent models do not accurately reflect the true underlying ancestral processes. Here, we propose a Bayesian non-parametric method for inferring effective population size trajectories from a multifurcating genealogy under the Lambda-coalescent. In particular, we jointly estimate the effective population size and the model parameter for the Beta-coalescent model, a special type of Lambda-coalescent. Finally, we test our methods on simulations and apply them to study various viral dynamics as well as Japanese sardine population size changes over time. The code and vignettes can be found in the phylodyn package. This article is part of the theme issue ‘“A mathematical theory of evolution”: phylogenetic models dating back 100 years’. 
    more » « less
  2. Parametric methods, such as autoregressive models or latent growth modeling, are usually inflexible to model the dependence and nonlinear effects among the changes of latent traits whenever the time gap is irregular and the recorded time points are individually varying. Often in practice, the growth trend of latent traits is subject to certain monotone and smooth conditions. To incorporate such conditions and to alleviate the strong parametric assumption on regressing latent trajectories, a flexible nonparametric prior has been introduced to model the dynamic changes of latent traits for item response theory models over the study period. Suitable Bayesian computation schemes are developed for such analysis of the longitudinal and dichotomous item responses. Simulation studies and a real data example from educational testing have been used to illustrate our proposed methods. 
    more » « less
  3. Abstract Data integration is a powerful tool for facilitating a comprehensive and generalizable understanding of microbial communities and their association with outcomes of interest. However, integrating data sets from different studies remains a challenging problem because of severe batch effects, unobserved confounding variables, and high heterogeneity across data sets. We propose a new data integration method called MetaDICT, which initially estimates the batch effects by weighting methods in causal inference literature and then refines the estimation via a novel shared dictionary learning. Compared with existing methods, MetaDICT can better avoid the overcorrection of batch effects and preserve biological variation when there exist unobserved confounding variables or data sets are highly heterogeneous across studies. Furthermore, MetaDICT can generate comparable embedding at both taxa and sample levels that can be used to unravel the hidden structure of the integrated data and improve the integrative analysis. Applications to synthetic and real microbiome data sets demonstrate the robustness and effectiveness of MetaDICT in integrative analysis. Using MetaDICT, we characterize microbial interaction, identify generalizable microbial signatures, and enhance the accuracy of disease prediction in an integrative analysis of colorectal cancer metagenomics studies. 
    more » « less
  4. Abstract A central challenge in global change research is the projection of the future behavior of a system based upon past observations. Tree‐ring data have been used increasingly over the last decade to project tree growth and forest ecosystem vulnerability under future climate conditions. But how can the response of tree growth to past climate variation predict the future, when the future does not look like the past? Space‐for‐time substitution (SFTS) is one way to overcome the problem of extrapolation: the response at a given location in a warmer future is assumed to follow the response at a warmer location today. Here we evaluated an SFTS approach to projecting future growth of Douglas‐fir (Pseudotsuga menziesii), a species that occupies an exceptionally large environmental space in North America. We fit a hierarchical mixed‐effects model to capture ring‐width variability in response to spatial and temporal variation in climate. We found opposing gradients for productivity and climate sensitivity with highest growth rates and weakest response to interannual climate variation in the mesic coastal part of Douglas‐fir's range; narrower rings and stronger climate sensitivity occurred across the semi‐arid interior. Ring‐width response to spatial versus temporal temperature variation was opposite in sign, suggesting that spatial variation in productivity, caused by local adaptation and other slow processes, cannot be used to anticipate changes in productivity caused by rapid climate change. We thus substituted only climate sensitivities when projecting future tree growth. Growth declines were projected across much of Douglas‐fir's distribution, with largest relative decreases in the semiarid U.S. Interior West and smallest in the mesic Pacific Northwest. We further highlight the strengths of mixed‐effects modeling for reviving a conceptual cornerstone of dendroecology, Cook's 1987 aggregate growth model, and the great potential to use tree‐ring networks and results as a calibration target for next‐generation vegetation models. 
    more » « less
  5. Social interactions with conspecifics are key to the fitness of most animals and, through the transmission opportunities they provide, are also key to the fitness of their parasites. As a result, research to date has largely focused on the role of host social behavior in imposing selection on parasites, particularly their virulence and transmission phenotypes. However, host social behavior also influences the distribution of parasites among hosts, with implications for their evolution through non-random mating, gene flow, and genetic drift, and thus ability to respond to that selection. Here, we review the paucity of empirical studies on parasites, and draw from empirical studies of free-living organisms and population genetic theory to propose several mechanisms by which host social behavior potentially drives parasite evolution through these less-well studied mechanisms. We focus on the guppy host and Gyrodactylus (Monogenea) ectoparasitic flatworm system and follow a spatially hierarchical outline to highlight that social behavior varies between individuals, and between host populations across the landscape, generating a mosaic of ecological and evolutionary outcomes for their infecting parasites. We argue that the guppy-Gyrodactylus system presents a unique opportunity to address this fundamental knowledge gap in our understanding of the connection between host social behavior and parasite evolution. Individual differences in host social behavior generates fine-scale changes in the spatial distribution of parasite genotypes, shape the size, and diversity of their infecting parasite populations and may generate non-random mating on, and non-random transmission between hosts. While at population and metapopulation level, variation in host social behavior interacts with landscape structure to affect parasite gene flow, effective population size, and genetic drift to alter the coevolutionary potential of local adaptation. 
    more » « less