skip to main content


Title: Genomic analyses provide insights into peach local adaptation and responses to climate change
The environment has constantly shaped plant genomes, but the genetic bases underlying how plants adapt to environmental influences remain largely unknown. We constructed a high-density genomic variation map of 263 geographically representative peach landraces and wild relatives. A combination of whole-genome selection scans and genome-wide environmental association studies (GWEAS) was performed to reveal the genomic bases of peach adaptation to diverse climates. A total of 2092 selective sweeps that underlie local adaptation to both mild and extreme climates were identified, including 339 sweeps conferring genomic pattern of adaptation to high altitudes. Using genome-wide environmental association studies (GWEAS), a total of 2755 genomic loci strongly associated with 51 specific environmental variables were detected. The molecular mechanism underlying adaptive evolution of high drought, strong UVB, cold hardiness, sugar content, flesh color, and bloom date were revealed. Finally, based on 30 yr of observation, a candidate gene associated with bloom date advance, representing peach responses to global warming, was identified. Collectively, our study provides insights into molecular bases of how environments have shaped peach genomes by natural selection and adds candidate genes for future studies on evolutionary genetics, adaptation to climate changes, and breeding.  more » « less
Award ID(s):
1855585
NSF-PAR ID:
10321876
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Genome Research
Volume:
31
Issue:
4
ISSN:
1088-9051
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Studies of species that experience environmental heterogeneity across their distributions have become an important tool for understanding mechanisms of adaptation and predicting responses to climate change. We examine population structure, demographic history and environmentally associated genomic variation inBombus vosnesenskii, a common bumble bee in the western USA, using whole genome resequencing of populations distributed across a broad range of latitudes and elevations. We find thatB. vosnesenskiiexhibits minimal population structure and weak isolation by distance, confirming results from previous studies using other molecular marker types. Similarly, demographic analyses with Sequentially Markovian Coalescent models suggest that minimal population structure may have persisted since the last interglacial period, with genomes from different parts of the species range showing similar historical effective population size trajectories and relatively small fluctuations through time. Redundancy analysis revealed a small amount of genomic variation explained by bioclimatic variables. Environmental association analysis with latent factor mixed modelling (LFMM2) identified few outlier loci that were sparsely distributed throughout the genome and although a few putative signatures of selective sweeps were identified, none encompassed particularly large numbers of loci. Some outlier loci were in genes with known regulatory relationships, suggesting the possibility of weak selection, although compared with other species examined with similar approaches, evidence for extensive local adaptation signatures in the genome was relatively weak. Overall, results indicateB. vosnesenskiiis an example of a generalist with a high degree of flexibility in its environmental requirements that may ultimately benefit the species under periods of climate change.

     
    more » « less
  2. INTRODUCTION Thousands of genetic variants have been associated with human diseases and traits through genome-wide association studies (GWASs). Translating these discoveries into improved therapeutics requires discerning which variants among hundreds of candidates are causally related to disease risk. To date, only a handful of causal variants have been confirmed. Here, we leverage 100 million years of mammalian evolution to address this major challenge. RATIONALE We compared genomes from hundreds of mammals and identified bases with unusually few variants (evolutionarily constrained). Constraint is a measure of functional importance that is agnostic to cell type or developmental stage. It can be applied to investigate any heritable disease or trait and is complementary to resources using cell type– and time point–specific functional assays like Encyclopedia of DNA Elements (ENCODE) and Genotype-Tissue Expression (GTEx). RESULTS Using constraint calculated across placental mammals, 3.3% of bases in the human genome are significantly constrained, including 57.6% of coding bases. Most constrained bases (80.7%) are noncoding. Common variants (allele frequency ≥ 5%) and low-frequency variants (0.5% ≤ allele frequency < 5%) are depleted for constrained bases (1.85 versus 3.26% expected by chance, P < 2.2 × 10 −308 ). Pathogenic ClinVar variants are more constrained than benign variants ( P < 2.2 × 10 −16 ). The most constrained common variants are more enriched for disease single-nucleotide polymorphism (SNP)–heritability in 63 independent GWASs. The enrichment of SNP-heritability in constrained regions is greater (7.8-fold) than previously reported in mammals and is even higher in primates (11.1-fold). It exceeds the enrichment of SNP-heritability in nonsynonymous coding variants (7.2-fold) and fine-mapped expression quantitative trait loci (eQTL)–SNPs (4.8-fold). The enrichment peaks near constrained bases, with a log-linear decrease of SNP-heritability enrichment as a function of the distance to a constrained base. Zoonomia constraint scores improve functionally informed fine-mapping. Variants at sites constrained in mammals and primates have greater posterior inclusion probabilities and higher per-SNP contributions. In addition, using both constraint and functional annotations improves polygenic risk score accuracy across a range of traits. Finally, incorporating constraint information into the analysis of noncoding somatic variants in medulloblastomas identifies new candidate driver genes. CONCLUSION Genome-wide measures of evolutionary constraint can help discern which variants are functionally important. This information may accelerate the translation of genomic discoveries into the biological, clinical, and therapeutic knowledge that is required to understand and treat human disease. Using evolutionary constraint in genomic studies of human diseases. ( A ) Constraint was calculated across 240 mammal species, including 43 primates (teal line). ( B ) Pathogenic ClinVar variants ( N = 73,885) are more constrained across mammals than benign variants ( N = 231,642; P < 2.2 × 10 −16 ). ( C ) More-constrained bases are more enriched for trait-associated variants (63 GWASs). ( D ) Enrichment of heritability is higher in constrained regions than in functional annotations (left), even in a joint model with 106 annotations (right). ( E ) Fine-mapping (PolyFun) using a model that includes constraint scores identifies an experimentally validated association at rs1421085. Error bars represent 95% confidence intervals. BMI, body mass index; LF, low frequency; PIP, posterior inclusion probability. 
    more » « less
  3. Abstract

    Understanding the molecular basis of repeated evolution improves our ability to predict evolution across the tree of life. Only since the last decade has high‐throughput sequencing enabled comparative genome scans to thoroughly examine the repeatability of genetic changes driving repeated phenotypic evolution. The Asian corn borer (ACB),Ostrinia furnacalis(Guenée), and the European corn borer (ECB),Ostrinia nubilalis(Hübner), are two closely related moths displaying repeatable phenological adaptation to a wide range of climates on two separate continents, largely manifesting as changes in the timing of diapause induction and termination across latitude. Candidate genes underlying diapause variation in North American ECB have been previously identified. Here, we sampled seven ACB populations across 23 degrees of latitude in China to elucidate the genetic basis of diapause variation and evolutionary mechanisms driving parallel clinal responses in the two species. Using pooled whole‐genome sequencing (Pool‐seq) data, population genomic analyses revealed hundreds of single nucleotide polymorphisms (SNP) whose allele frequencies covaried with mean diapause phenotypes along the cline. Genes involved in circadian rhythm were over‐represented among candidate genes with strong signatures of spatially varying selection. Only one of two circadian clock genes associated with diapause evolution in ECB showed evidence of reuse in ACB (period [per]), butperalleles were not shared between species nor with their outgroup, implicating independent mutational paths. Nonetheless, evidence of adaptive introgression was discovered at putative diapause loci located elsewhere in the genome, suggesting that de novo mutations and introgression might both underlie the repeated phenological evolution.

     
    more » « less
  4. null (Ed.)
    Abstract Background Genome structural variations (SVs) have been associated with key traits in a wide range of agronomically important species; however, SV profiles of peach and their functional impacts remain largely unexplored. Results Here, we present an integrated map of 202,273 SVs from 336 peach genomes. A substantial number of SVs have been selected during peach domestication and improvement, which together affect 2268 genes. Genome-wide association studies of 26 agronomic traits using these SVs identify a number of candidate causal variants. A 9-bp insertion in Prupe.4G186800 , which encodes a NAC transcription factor, is shown to be associated with early fruit maturity, and a 487-bp deletion in the promoter of PpMYB10.1 is associated with flesh color around the stone. In addition, a 1.67 Mb inversion is highly associated with fruit shape, and a gene adjacent to the inversion breakpoint, PpOFP1 , regulates flat shape formation. Conclusions The integrated peach SV map and the identified candidate genes and variants represent valuable resources for future genomic research and breeding in peach. 
    more » « less
  5. Abstract

    Genetic diversity becomes structured among populations over time due to genetic drift and divergent selection. Although population structure is often treated as a uniform underlying factor, recent resequencing studies of wild populations have demonstrated that diversity in many regions of the genome may be structured quite dissimilar to the genome‐wide pattern. Here, we explored the adaptive and nonadaptive causes of such genomic heterogeneity using population‐level, whole genome resequencing data obtained from annualMimulus guttatusindividuals collected across a rugged environment landscape. We found substantial variation in how genetic differentiation is structured both within and between chromosomes, although, in contrast to other studies, known inversion polymorphisms appear to serve only minor roles in this heterogeneity. In addition, much of the genome can be clustered into eight among‐population genetic differentiation patterns, but only two of these clusters are particularly consistent with patterns of isolation by distance. By performing genotype‐environment association analysis, we also identified genomic intervals where local adaptation to specific climate factors has accentuated genetic differentiation among populations, and candidate genes in these windows indicate climate adaptation may proceed through changes affecting specialized metabolism, drought resistance, and development. Finally, by integrating our findings with previous studies, we show that multiple aspects of plant reproductive biology may be common targets of balancing selection and that variants historically involved in climate adaptation among populations have probably also fuelled rapid adaptation to microgeographic environmental variation within sites.

     
    more » « less