skip to main content


Title: Molecular Signatures of Adaptation and Selection in Forest Trees
Uncovering the genes and molecular basis of phenotypic variation and adaptation is a major goal in conservation and evolutionary genetics; it also sets the basis for future operational breeding in commercial species, like forest trees. These taxa are characterized by their large size, growth habit and longevity, which hampers the use of reverse-genetic approaches (i.e. from gene function to phenotype) to pinpoint adaptive molecular variants. In this chapter, we summarize the basis of the forward-genetic approaches (i.e. from phenotype to gene function) currently used in forest trees. For each strategy, we provide a brief overview of the statistical approaches employed to identify candidate genes, and then highlight the main findings of landmark studies that provide evidence for adaptation in forest trees. Adaptive and commercial traits are generally well inherited in trees, although they are mostly affected by the variation of multiple genes, each one accounting for a small part of the phenotypic variance of each character. However, some individual and important genes involved in growth, phenology, drought resistance and cold hardiness have been identified; many of them showing evidence of selection across multiple taxa (sometimes including angiosperms and gymnosperms). Future challenges for detecting the signatures and understanding the molecular basis of adaptation in trees include more adequate and precise phenotype assessment in natural populations, and the inclusion of gene interactions and epigenetic variations in current models. The implications of these findings in conservation and breeding of forest trees are finally discussed.  more » « less
Award ID(s):
1461868
NSF-PAR ID:
10210920
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Advances in botanical research
Volume:
74
ISSN:
0065-2296
Page Range / eLocation ID:
265-306
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Context

    Processes that shape genomic and ecological divergence can reveal important evolutionary dynamics to inform the conservation of threatened species.Fontaineais a genus of rainforest shrubs and small trees including critically endangered and threatened species restricted to narrow, but complex geographic and ecological regions. Several species ofFontaineaare subject to spatially explicit conditions and experience limited intra-specific gene flow, likely generating genetic differentiation and local adaptation.

    Objectives

    Here, we explored the genetic and ecological mechanisms underlying patterns of diversification in two, closely related threatenedFontaineaspecies. Our aim was to compare spatial patterns of genetic variation between the vulnerableFontainea australis(Southern Fontainea) and critically endangeredF. oraria(Coastal Fontainea), endemic to the heterogeneous subtropical region of central, eastern Australia, where large-scale clearing has severely reduced rainforest habitat to a fraction (< 1%) of its pre-European settlement extent.

    Methods

    We used a set of 10,000 reduced-representation markers to infer genetic relationships and the drivers of spatial genetic variation across the two species. In addition, we employed a combination of univariate and multivariate genome-environment association analysis using a set of topo-climatic variables to explore potential patterns of local adaptation as a factor impacting genomic divergence.

    Results

    Our study revealed that Coastal Fontainea have a close genetic relationship with Southern Fontainea. We showed that isolation by distance has played a key role in their genetic variation, indicating that vicariance can explain the spatial genetic distribution of the two species. Genotype-environment analyses showed a strong association with temperature and topographic features, suggesting adaptation to localised thermal environments. We used a multivariate redundancy analysis to identify a range of putatively adapted loci associated with local environmental conditions.

    Conclusions

    Divergent selection at the local-habitat scale as a result of dispersal limitations and environmental heterogeneity (including physical barriers) are likely contributors to adaptive divergence between the twoFontaineaspecies. Our findings have presented evidence to indicate that Southern and Coastal Fontainea were comprised of distinct genetic groups and ecotypes, that together may form a single species continuum, with further phenotype research suggested to confirm the current species boundaries. Proactive conservation actions, including assisted migration to enhance the resilience of populations lacking stress-tolerant single nucleotide polymorphisms (SNPs) may be required to secure the long-term future of both taxa. This is especially vital for the critically endangered Coastal Fontainea given projections of habitat decline for the species under future climate scenarios.

     
    more » « less
  2. INTRODUCTION Diverse phenotypes, including large brains relative to body size, group living, and vocal learning ability, have evolved multiple times throughout mammalian history. These shared phenotypes may have arisen repeatedly by means of common mechanisms discernible through genome comparisons. RATIONALE Protein-coding sequence differences have failed to fully explain the evolution of multiple mammalian phenotypes. This suggests that these phenotypes have evolved at least in part through changes in gene expression, meaning that their differences across species may be caused by differences in genome sequence at enhancer regions that control gene expression in specific tissues and cell types. Yet the enhancers involved in phenotype evolution are largely unknown. Sequence conservation–based approaches for identifying such enhancers are limited because enhancer activity can be conserved even when the individual nucleotides within the sequence are poorly conserved. This is due to an overwhelming number of cases where nucleotides turn over at a high rate, but a similar combination of transcription factor binding sites and other sequence features can be maintained across millions of years of evolution, allowing the function of the enhancer to be conserved in a particular cell type or tissue. Experimentally measuring the function of orthologous enhancers across dozens of species is currently infeasible, but new machine learning methods make it possible to make reliable sequence-based predictions of enhancer function across species in specific tissues and cell types. RESULTS To overcome the limits of studying individual nucleotides, we developed the Tissue-Aware Conservation Inference Toolkit (TACIT). Rather than measuring the extent to which individual nucleotides are conserved across a region, TACIT uses machine learning to test whether the function of a given part of the genome is likely to be conserved. More specifically, convolutional neural networks learn the tissue- or cell type–specific regulatory code connecting genome sequence to enhancer activity using candidate enhancers identified from only a few species. This approach allows us to accurately associate differences between species in tissue or cell type–specific enhancer activity with genome sequence differences at enhancer orthologs. We then connect these predictions of enhancer function to phenotypes across hundreds of mammals in a way that accounts for species’ phylogenetic relatedness. We applied TACIT to identify candidate enhancers from motor cortex and parvalbumin neuron open chromatin data that are associated with brain size relative to body size, solitary living, and vocal learning across 222 mammals. Our results include the identification of multiple candidate enhancers associated with brain size relative to body size, several of which are located in linear or three-dimensional proximity to genes whose protein-coding mutations have been implicated in microcephaly or macrocephaly in humans. We also identified candidate enhancers associated with the evolution of solitary living near a gene implicated in separation anxiety and other enhancers associated with the evolution of vocal learning ability. We obtained distinct results for bulk motor cortex and parvalbumin neurons, demonstrating the value in applying TACIT to both bulk tissue and specific minority cell type populations. To facilitate future analyses of our results and applications of TACIT, we released predicted enhancer activity of >400,000 candidate enhancers in each of 222 mammals and their associations with the phenotypes we investigated. CONCLUSION TACIT leverages predicted enhancer activity conservation rather than nucleotide-level conservation to connect genetic sequence differences between species to phenotypes across large numbers of mammals. TACIT can be applied to any phenotype with enhancer activity data available from at least a few species in a relevant tissue or cell type and a whole-genome alignment available across dozens of species with substantial phenotypic variation. Although we developed TACIT for transcriptional enhancers, it could also be applied to genomic regions involved in other components of gene regulation, such as promoters and splicing enhancers and silencers. As the number of sequenced genomes grows, machine learning approaches such as TACIT have the potential to help make sense of how conservation of, or changes in, subtle genome patterns can help explain phenotype evolution. Tissue-Aware Conservation Inference Toolkit (TACIT) associates genetic differences between species with phenotypes. TACIT works by generating open chromatin data from a few species in a tissue related to a phenotype, using the sequences underlying open and closed chromatin regions to train a machine learning model for predicting tissue-specific open chromatin and associating open chromatin predictions across dozens of mammals with the phenotype. [Species silhouettes are from PhyloPic] 
    more » « less
  3. Abstract

    Microgeographic adaptation provides a particularly interesting context for understanding the genetic basis of phenotypic divergence and may also present unique empirical challenges. In particular, plant adaptation to extreme soil mosaics may generate barriers to gene flow or shifts in mating system that confound simple genomic scans for adaptive loci. Here, we combine three approaches – quantitative trait locus (QTL) mapping of candidate intervals in controlled crosses, population resequencing (PoolSeq) and analyses of wild recombinant individuals – to investigate one trait associated withMimulus guttatus(yellow monkeyflower) adaptation to geothermal soils in Yellowstone National Park. We mapped a majorQTLcausing dense leaf trichomes in thermally adapted plants to a <50‐kb region of linkage Group 14 (Tr14) previously implicated in trichome divergence between independentM. guttatuspopulations. A PoolSeq scan of Tr14 region revealed a cluster of six genes, coincident with the inferredQTLpeak, with high allele frequency differences sufficient to explain observed phenotypic differentiation. One of these, the R2R3MYBtranscription factor Migut.N02661, is a plausible functional candidate and was also strongly associated (r2 = 0.27) with trichome phenotype in analyses of wild‐collected admixed individuals. Although functional analyses will be necessary to definitively link molecular variants in Tr14 with trichome divergence, our analyses are a major step in that direction. They point to a simple, and parallel, genetic basis for one axis ofMimulus guttatusadaptation to an extreme habitat, suggest a broadly conserved genetic basis for trichome variation across flowering plants and pave the way for further investigations of this challenging case of microgeographic incipient speciation.

     
    more » « less
  4. Genes that affect adaptive traits have been identified, but our knowledge of the genetic basis of adaptation in a more general sense (across multiple traits) remains limited. We combined population-genomic analyses of evolve-and-resequence experiments, genome-wide association mapping of performance traits, and analyses of gene expression to fill this knowledge gap and shed light on the genomics of adaptation to a marginal host (lentil) by the seed beetle Callosobruchus maculatus. Using population-genomic approaches, we detected modest parallelism in allele frequency change across replicate lines during adaptation to lentil. Mapping populations derived from each lentil-adapted line revealed a polygenic basis for two host-specific performance traits (weight and development time), which had low to modest heritabilities. We found less evidence of parallelism in genotype-phenotype associations across these lines than in allele frequency changes during the experiments. Differential gene expression caused by differences in recent evolutionary history exceeded that caused by immediate rearing host. Together, the three genomic datasets suggest that genes affecting traits other than weight and development time are likely to be the main causes of parallel evolution and that detoxification genes (especially cytochrome P450s and beta-glucosidase) could be especially important for colonization of lentil by C. maculatus. 
    more » « less
  5. Summary

    Global change forecasts in ecosystems require knowledge of within‐species diversity, particularly of dominant species within communities. We assessed site‐level diversity and capacity for adaptation inBouteloua gracilis, the dominant species in the Central US shortgrass steppe biome.

    We quantified genetic diversity from 17 sites across regional scales, north to south from New Mexico to South Dakota, and local scales in northern Colorado. We also quantified phenotype and plasticity within and among sites and determined the extent to which phenotypic diversity inB. graciliswas correlated with climate.

    Genome sequencing indicated pronounced population structure at the regional scale, and local differences indicated that gene flow and/or dispersal may also be limited. Within a common environment, we found evidence of genetic divergence in biomass‐related phenotypes, plasticity, and phenotypic variance, indicating functional divergence and different adaptive potential. Phenotypes were differentiated according to climate, chiefly median Palmer Hydrological Drought Index and other aridity metrics.

    Our results indicate conclusive differences in genetic variation, phenotype, and plasticity in this species and suggest a mechanism explaining variation in shortgrass steppe community responses to global change. This analysis ofB. gracilisintraspecific diversity across spatial scales will improve conservation and management of the shortgrass steppe ecosystem in the future.

     
    more » « less