skip to main content


Title: The landscape of transcriptional and 1translational changes over 22 years of bacterial adaptation
Organisms can adapt to an environment by taking multiple mutational paths. This redundancy at the genetic level, where many mutations have similar phenotypic and fitness effects, can make untangling the molecular mechanisms of complex adaptations difficult. Here we use the E. coli long-term evolution experiment (LTEE) as a model to address this challenge. To understand how different genomic changes could lead to parallel fitness gains, we characterize the landscape of transcriptional and translational changes across 12 replicate populations evolving in parallel for 50,000 generations. By quantifying absolute changes in mRNA abundances, we show that not only do all evolved lines have more mRNAs but that this increase in mRNA abundance scales with cell size. We also find that despite few shared mutations at the genetic level, clones from replicate populations in the LTEE are remarkably similar in their gene expression patterns at both the transcriptional and translational levels. Furthermore, we show that the majority of the expression changes are due to changes at the transcriptional level with very few translational changes. Finally, we show how mutations in transcriptional regulators lead to consistent and parallel changes in the expression levels of downstream genes. These results deepen our understanding of the molecular mechanisms underlying complex adaptations and provide insights into the repeatability of evolution.  more » « less
Award ID(s):
1936046
NSF-PAR ID:
10377814
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
eLife
Volume:
11
ISSN:
2050-084X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. INTRODUCTION Diverse phenotypes, including large brains relative to body size, group living, and vocal learning ability, have evolved multiple times throughout mammalian history. These shared phenotypes may have arisen repeatedly by means of common mechanisms discernible through genome comparisons. RATIONALE Protein-coding sequence differences have failed to fully explain the evolution of multiple mammalian phenotypes. This suggests that these phenotypes have evolved at least in part through changes in gene expression, meaning that their differences across species may be caused by differences in genome sequence at enhancer regions that control gene expression in specific tissues and cell types. Yet the enhancers involved in phenotype evolution are largely unknown. Sequence conservation–based approaches for identifying such enhancers are limited because enhancer activity can be conserved even when the individual nucleotides within the sequence are poorly conserved. This is due to an overwhelming number of cases where nucleotides turn over at a high rate, but a similar combination of transcription factor binding sites and other sequence features can be maintained across millions of years of evolution, allowing the function of the enhancer to be conserved in a particular cell type or tissue. Experimentally measuring the function of orthologous enhancers across dozens of species is currently infeasible, but new machine learning methods make it possible to make reliable sequence-based predictions of enhancer function across species in specific tissues and cell types. RESULTS To overcome the limits of studying individual nucleotides, we developed the Tissue-Aware Conservation Inference Toolkit (TACIT). Rather than measuring the extent to which individual nucleotides are conserved across a region, TACIT uses machine learning to test whether the function of a given part of the genome is likely to be conserved. More specifically, convolutional neural networks learn the tissue- or cell type–specific regulatory code connecting genome sequence to enhancer activity using candidate enhancers identified from only a few species. This approach allows us to accurately associate differences between species in tissue or cell type–specific enhancer activity with genome sequence differences at enhancer orthologs. We then connect these predictions of enhancer function to phenotypes across hundreds of mammals in a way that accounts for species’ phylogenetic relatedness. We applied TACIT to identify candidate enhancers from motor cortex and parvalbumin neuron open chromatin data that are associated with brain size relative to body size, solitary living, and vocal learning across 222 mammals. Our results include the identification of multiple candidate enhancers associated with brain size relative to body size, several of which are located in linear or three-dimensional proximity to genes whose protein-coding mutations have been implicated in microcephaly or macrocephaly in humans. We also identified candidate enhancers associated with the evolution of solitary living near a gene implicated in separation anxiety and other enhancers associated with the evolution of vocal learning ability. We obtained distinct results for bulk motor cortex and parvalbumin neurons, demonstrating the value in applying TACIT to both bulk tissue and specific minority cell type populations. To facilitate future analyses of our results and applications of TACIT, we released predicted enhancer activity of >400,000 candidate enhancers in each of 222 mammals and their associations with the phenotypes we investigated. CONCLUSION TACIT leverages predicted enhancer activity conservation rather than nucleotide-level conservation to connect genetic sequence differences between species to phenotypes across large numbers of mammals. TACIT can be applied to any phenotype with enhancer activity data available from at least a few species in a relevant tissue or cell type and a whole-genome alignment available across dozens of species with substantial phenotypic variation. Although we developed TACIT for transcriptional enhancers, it could also be applied to genomic regions involved in other components of gene regulation, such as promoters and splicing enhancers and silencers. As the number of sequenced genomes grows, machine learning approaches such as TACIT have the potential to help make sense of how conservation of, or changes in, subtle genome patterns can help explain phenotype evolution. Tissue-Aware Conservation Inference Toolkit (TACIT) associates genetic differences between species with phenotypes. TACIT works by generating open chromatin data from a few species in a tissue related to a phenotype, using the sequences underlying open and closed chromatin regions to train a machine learning model for predicting tissue-specific open chromatin and associating open chromatin predictions across dozens of mammals with the phenotype. [Species silhouettes are from PhyloPic] 
    more » « less
  2. Gao, Beile (Ed.)
    ABSTRACT Escherichia coli can survive for long periods in batch culture in the laboratory, where they experience a stressful and heterogeneous environment. During this incubation, E. coli acquires mutations that are selected in response to this environment, ultimately leading to evolved populations that are better adapted to these complex conditions, which can lead to a better understanding of evolutionary mechanisms. Mutations in regulatory genes often play a role in adapting to heterogeneous environments. To identify such mutations, we examined transcriptional differences during log phase growth in unaged cells compared to those that had been aged for 10 days and regrown. We identified expression changes in genes involved in motility and chemotaxis after adaptation to long-term cultures. We hypothesized that aged populations would also have phenotypic changes in motility and that motility may play a role in survival and adaptation to long-term cultures. While aged populations did show an increase in motility, this increase was not essential for survival in long-term cultures. We identified mutations in the regulatory gene sspA and other genes that may contribute to the observed differences in motility. Taken together, these data provide an overall picture of the role of mutations in regulatory genes for adaptation while underscoring that all changes that occur during evolution in stressful environments are not necessarily adaptive. IMPORTANCE Understanding how bacteria adapt in long-term cultures aids in both better treatment options for bacterial infections and gives insight into the mechanisms involved in bacterial evolution. In the past, it has been difficult to study these organisms in their natural environments. By using experimental evolution in heterogeneous and stressful laboratory conditions, we can more closely mimic natural environments and examine evolutionary mechanisms. One way to observe these mechanisms is to look at transcriptomic and genomic data from cells adapted to these complex conditions. Here, we found that although aged cells increase motility, this increase is not essential for survival in these conditions. These data emphasize that not all changes that occur due to evolutionary processes are adaptive, but these observations could still lead to hypotheses about the causative mutations. The information gained here allow us to make inferences about general mechanisms underlying phenotypic changes due to evolution. 
    more » « less
  3. Begun, D (Ed.)
    Abstract Changes in gene regulation at multiple levels may comprise an important share of the molecular changes underlying adaptive evolution in nature. However, few studies have assayed within- and between-population variation in gene regulatory traits at a transcriptomic scale, and therefore inferences about the characteristics of adaptive regulatory changes have been elusive. Here, we assess quantitative trait differentiation in gene expression levels and alternative splicing (intron usage) between three closely related pairs of natural populations of Drosophila melanogaster from contrasting thermal environments that reflect three separate instances of cold tolerance evolution. The cold-adapted populations were known to show population genetic evidence for parallel evolution at the SNP level, and here we find evidence for parallel expression evolution between them, with stronger parallelism at larval and adult stages than for pupae. We also implement a flexible method to estimate cis- vs trans-encoded contributions to expression or splicing differences at the adult stage. The apparent contributions of cis- vs trans-regulation to adaptive evolution vary substantially among population pairs. While two of three population pairs show a greater enrichment of cis-regulatory differences among adaptation candidates, trans-regulatory differences are more likely to be implicated in parallel expression changes between population pairs. Genes with significant cis-effects are enriched for signals of elevated genetic differentiation between cold- and warm-adapted populations, suggesting that they are potential targets of local adaptation. These findings expand our knowledge of adaptive gene regulatory evolution and our ability to make inferences about this important and widespread process. 
    more » « less
  4. Zhang, George (Ed.)
    Abstract All organisms encode enzymes that replicate, maintain, pack, recombine, and repair their genetic material. For this reason, mutation rates and biases also evolve by mutation, variation, and natural selection. By examining metagenomic time series of the Lenski long-term evolution experiment (LTEE) with Escherichia coli (Good BH, McDonald MJ, Barrick JE, Lenski RE, Desai MM. 2017. The dynamics of molecular evolution over 60,000 generations. Nature 551(7678):45–50.), we find that local mutation rate variation has evolved during the LTEE. Each LTEE population has evolved idiosyncratic differences in their rates of point mutations, indels, and mobile element insertions, due to the fixation of various hypermutator and antimutator alleles. One LTEE population, called Ara+3, shows a strong, symmetric wave pattern in its density of point mutations, radiating from the origin of replication. This pattern is largely missing from the other LTEE populations, most of which evolved missense, indel, or structural mutations in topA, fis, and dusB—loci that all affect DNA topology. The distribution of mutations in those genes over time suggests epistasis and historical contingency in the evolution of DNA topology, which may have in turn affected local mutation rates. Overall, the replicate populations of the LTEE have largely diverged in their mutation rates and biases, even though they have adapted to identical abiotic conditions. 
    more » « less
  5. INTRODUCTION Eukaryotes contain a highly conserved signaling pathway that becomes rapidly activated when adenosine triphosphate (ATP) levels decrease, as happens during conditions of nutrient shortage or mitochondrial dysfunction. The adenosine monophosphate (AMP)–activated protein kinase (AMPK) is activated within minutes of energetic stress and phosphorylates a limited number of substrates to biochemically rewire metabolism from an anabolic state to a catabolic state to restore metabolic homeostasis. AMPK also promotes prolonged metabolic adaptation through transcriptional changes, decreasing biosynthetic genes while increasing expression of genes promoting lysosomal and mitochondrial biogenesis. The transcription factor EB (TFEB) is a well-appreciated effector of AMPK-dependent signals, but many of the molecular details of how AMPK controls these processes remain unknown. RATIONALE The requirement of AMPK and its specific downstream targets that control aspects of the transcriptional adaptation of metabolism remain largely undefined. We performed time courses examining gene expression changes after various mitochondrial stresses in wild-type (WT) or AMPK knockout cells. We hypothesized that a previously described interacting protein of AMPK, folliculin-interacting protein 1 (FNIP1), may be involved in how AMPK promotes increases in gene expression after metabolic stress. FNIP1 forms a complex with the protein folliculin (FLCN), together acting as a guanosine triphosphate (GTP)–activating protein (GAP) for RagC. The FNIP1-FLCN complex has emerged as an amino acid sensor to the mechanistic target of rapamycin complex 1 (mTORC1), involved in how amino acids control TFEB activation. We therefore examined whether AMPK may regulate FNIP1 to dominantly control TFEB independently of amino acids. RESULTS AMPK was found to govern expression of a core set of genes after various mitochondrial stresses. Hallmark features of this response were activation of TFEB and increases in the transcription of genes specifying lysosomal and mitochondrial biogenesis. AMPK directly phosphorylated five conserved serine residues in FNIP1, suppressing the function of the FLCN-FNIP1 GAP complex, which resulted in dissociation of RagC and mTOR from the lysosome, promoting nuclear translocation of TFEB even in the presence of amino acids. FNIP1 phosphorylation was required for AMPK to activate TFEB and for subsequent increases in peroxisome proliferation–activated receptor gamma coactivator 1-alpha (PGC1α) and estrogen-related receptor alpha (ERRα) mRNAs. Cells in which the five serines in FNIP1 were mutated to alanine were unable to increase lysosomal and mitochondrial gene expression programs after treatment with mitochondrial poisons or AMPK activators despite the presence and normal regulation of all other substrates of AMPK. By contrast, neither AMPK nor its control of FNIP1 were needed for activation of TFEB after amino acid withdrawal, illustrating the specificity to energy-limited conditions. CONCLUSION Our data establish FNIP1 as the long-sought substrate of AMPK that controls TFEB translocation to the nucleus, defining AMPK phosphorylation of FNIP1 as a singular event required for increased lysosomal and mitochondrial gene expression programs after metabolic stresses. This study also illuminates the larger biological question of how mitochondrial damage triggers a temporal response of repair and replacement of damaged mitochondria: Within early hours, AMPK-FNIP1–activated TFEB induces a wave of lysosome and autophagy genes to promote degradation of damaged mitochondria, and a few hours later, TFEB–up-regulated PGC1⍺ and ERR⍺ promote expression of a second wave of genes specifying mitochondrial biogenesis. These insights open therapeutic avenues for several common diseases associated with mitochondrial dysfunction, ranging from neurodegeneration to type 2 diabetes to cancer. Mitochondrial damage activates AMPK to phosphorylate FNIP1, stimulating TFEB translocation to the nucleus and sequential waves of lysosomal and mitochondrial biogenesis. After mitochondrial damage, activated AMPK phosphorylates FNIP1 (1), causing inhibition of FLCN-FNIP1 GAP activity (2). This leads to accumulation of RagC in its GTP-bound form, causing dissociation of RagC, mTORC1, and TFEB from the lysosome (3). TFEB is therefore not phosphorylated and translocates to the nucleus, inducing transcription of lysosomal or autophagy genes, with parallel increases in NT-PGC1α mRNA (4), which, in concert with ERRα (5), subsequently induces mitochondrial biogenesis (6). CCCP, carbonyl cyanide m-chlorophenylhydrazone; CLEAR, coordinated lysosomal expression and regulation; GDP, guanosine diphosphate; P, phosphorylation. [Figure created using BioRender] 
    more » « less