skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Few Fixed Variants between Trophic Specialist Pupfish Species Reveal Candidate Cis -Regulatory Alleles Underlying Rapid Craniofacial Divergence
Abstract Investigating closely related species that rapidly evolved divergent feeding morphology is a powerful approach to identify genetic variation underlying variation in complex traits. This can also lead to the discovery of novel candidate genes influencing natural and clinical variation in human craniofacial phenotypes. We combined whole-genome resequencing of 258 individuals with 50 transcriptomes to identify candidate cis-acting genetic variation underlying rapidly evolving craniofacial phenotypes within an adaptive radiation of Cyprinodon pupfishes. This radiation consists of a dietary generalist species and two derived trophic niche specialists—a molluscivore and a scale-eating species. Despite extensive morphological divergence, these species only diverged 10 kya and produce fertile hybrids in the laboratory. Out of 9.3 million genome-wide SNPs and 80,012 structural variants, we found very few alleles fixed between species—only 157 SNPs and 87 deletions. Comparing gene expression across 38 purebred F1 offspring sampled at three early developmental stages, we identified 17 fixed variants within 10 kb of 12 genes that were highly differentially expressed between species. By measuring allele-specific expression in F1 hybrids from multiple crosses, we found that the majority of expression divergence between species was explained by trans-regulatory mechanisms. We also found strong evidence for two cis-regulatory alleles affecting expression divergence of two genes with putative effects on skeletal development (dync2li1 and pycr3). These results suggest that SNPs and structural variants contribute to the evolution of novel traits and highlight the utility of the San Salvador Island pupfish system as an evolutionary model for craniofacial development.  more » « less
Award ID(s):
1938571
PAR ID:
10276284
Author(s) / Creator(s):
;
Editor(s):
Wittkopp, Patricia
Date Published:
Journal Name:
Molecular Biology and Evolution
Volume:
38
Issue:
2
ISSN:
1537-1719
Page Range / eLocation ID:
405 to 423
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT Genome-wide association studies (GWAS) can identify genetic variants responsible for naturally occurring and quantitative phenotypic variation. Association studies therefore provide a powerful complement to approaches that rely on de novo mutations for characterizing gene function. Although bacteria should be amenable to GWAS, few GWAS have been conducted on bacteria, and the extent to which nonindependence among genomic variants (e.g., linkage disequilibrium [LD]) and the genetic architecture of phenotypic traits will affect GWAS performance is unclear. We apply association analyses to identify candidate genes underlying variation in 20 biochemical, growth, and symbiotic phenotypes among 153 strains of Ensifer meliloti . For 11 traits, we find genotype-phenotype associations that are stronger than expected by chance, with the candidates in relatively small linkage groups, indicating that LD does not preclude resolving association candidates to relatively small genomic regions. The significant candidates show an enrichment for nucleotide polymorphisms (SNPs) over gene presence-absence variation (PAV), and for five traits, candidates are enriched in large linkage groups, a possible signature of epistasis. Many of the variants most strongly associated with symbiosis phenotypes were in genes previously identified as being involved in nitrogen fixation or nodulation. For other traits, apparently strong associations were not stronger than the range of associations detected in permuted data. In sum, our data show that GWAS in bacteria may be a powerful tool for characterizing genetic architecture and identifying genes responsible for phenotypic variation. However, careful evaluation of candidates is necessary to avoid false signals of association. IMPORTANCE Genome-wide association analyses are a powerful approach for identifying gene function. These analyses are becoming commonplace in studies of humans, domesticated animals, and crop plants but have rarely been conducted in bacteria. We applied association analyses to 20 traits measured in Ensifer meliloti , an agriculturally and ecologically important bacterium because it fixes nitrogen when in symbiosis with leguminous plants. We identified candidate alleles and gene presence-absence variants underlying variation in symbiosis traits, antibiotic resistance, and use of various carbon sources; some of these candidates are in genes previously known to affect these traits whereas others were in genes that have not been well characterized. Our results point to the potential power of association analyses in bacteria, but also to the need to carefully evaluate the potential for false associations. 
    more » « less
  2. Major advances in crop yields are needed in the coming decades. However, plant breeding is currently limited by incremental improvements in quantitative traits that often rely on laborious selection of rare naturally occurring mutations in gene-regulatory regions. Here, we demonstrate that CRISPR/Cas9 genome editing of promoters generates diverse cis-regulatory alleles that provide beneficial quantitative variation for breeding. We devised a simple genetic scheme, which exploits trans-generational heritability of Cas9 activity in heterozygous loss-of-function mutant backgrounds, to rapidly evaluate the phenotypic impact of numerous promoter variants for genes regulating three major productivity traits in tomato: fruit size, inflorescence branching, and plant architecture. Our approach allows immediate selection and fixation of novel alleles in transgene-free plants and fine manipulation of yield components. Beyond a platform to enhance variation for diverse agricultural traits, our findings provide a foundation for dissecting complex relationships between gene-regulatory changes and control of quantitative traits. 
    more » « less
  3. Abstract The study of local adaptation in the presence of ongoing gene flow is the study of natural selection in action, revealing the functional genetic diversity most relevant to contemporary pressures. In addition to individual genes, genome-wide architecture can itself evolve to enable adaptation. Distributed across a steep thermal gradient along the east coast of North America, Atlantic silversides (Menidia menidia) exhibit an extraordinary degree of local adaptation in a suite of traits, and the capacity for rapid adaptation from standing genetic variation, but we know little about the patterns of genomic variation across the species range that enable this remarkable adaptability. Here, we use low-coverage, whole-transcriptome sequencing of Atlantic silversides sampled along an environmental cline to show marked signatures of divergent selection across a gradient of neutral differentiation. Atlantic silversides sampled across 1371 km of the southern section of its distribution have very low genome-wide differentiation (median FST = 0.006 across 1.9 million variants), consistent with historical connectivity and observations of recent migrants. Yet almost 14,000 single nucleotide polymorphisms (SNPs) are nearly fixed (FST > 0.95) for alternate alleles. Highly differentiated SNPs cluster into four tight linkage disequilibrium (LD) blocks that span hundreds of genes and several megabases. Variants in these LD blocks are disproportionately nonsynonymous and concentrated in genes enriched for multiple functions related to known adaptations in silversides, including variation in lipid storage, metabolic rate, and spawning behavior. Elevated levels of absolute divergence and demographic modeling suggest selection maintaining divergence across these blocks under gene flow. These findings represent an extreme case of heterogeneity in levels of differentiation across the genome, and highlight how gene flow shapes genomic architecture in continuous populations. Locally adapted alleles may be common features of populations distributed along environmental gradients, and will likely be key to conserving variation to enable future responses to environmental change. 
    more » « less
  4. Abstract Sex determination, the developmental process by which sexually dimorphic phenotypes are established, evolves fast. Evolutionary turnover in a sex determination pathway may occur via selection on alleles that are genetically linked to a new master sex determining locus on a newly formed proto‐sex chromosome. Species with polygenic sex determination, in which master regulatory genes are found on multiple different proto‐sex chromosomes, are informative models to study the evolution of sex determination and sex chromosomes. House flies are such a model system, with male determining loci possible on all six chromosomes and a female‐determiner on one of the chromosomes as well. The two most common male‐determining proto‐Y chromosomes form latitudinal clines on multiple continents, suggesting that temperature variation is an important selection pressure responsible for maintaining polygenic sex determination in this species. Temperature‐dependent fitness effects could be manifested through temperature‐dependent gene expression differences across proto‐Y chromosome genotypes. These gene expression differences may be the result ofcisregulatory variants that affect the expression of genes on the proto‐sex chromosomes, ortranseffects of the proto‐Y chromosomes on genes elswhere in the genome. We used RNA‐seq to identify genes whose expression depends on proto‐Y chromosome genotype and temperature in adult male house flies. We found no evidence for ecologically meaningful temperature‐dependent expression differences of sex determining genes between male genotypes, but we were probably not sampling an appropriate developmental time‐point to identify such effects. In contrast, we identified many other genes whose expression depends on the interaction between proto‐Y chromosome genotype and temperature, including genes that encode proteins involved in reproduction, metabolism, lifespan, stress response, and immunity. Notably, genes with genotype‐by‐temperature interactions on expression were not enriched on the proto‐sex chromosomes. Moreover, there was no evidence that temperature‐dependent expression is driven by chromosome‐widecis‐regulatory divergence between the proto‐Y and proto‐X alleles. Therefore, if temperature‐dependent gene expression is responsible for differences in phenotypes and fitness of proto‐Y genotypes across house fly populations, these effects are driven by a small number of temperature‐dependent alleles on the proto‐Y chromosomes that may havetranseffects on the expression of genes on other chromosomes. 
    more » « less
  5. Abstract We measured the floral bud transcriptome of 151 fully sequenced lines of Mimulus guttatus from one natural population. Thousands of single nucleotide polymorphisms (SNPs) are implicated as transcription regulators, but there is a striking difference in the allele frequency spectrum of cis-acting and trans-acting mutations. Cis-SNPs have intermediate frequencies (consistent with balancing selection) while trans-SNPs exhibit a rare-alleles model (consistent with purifying selection). This pattern only becomes clear when transcript variation is normalized on a gene-to-gene basis. If a global normalization is applied, as is typically in RNAseq experiments, asymmetric transcript distributions combined with “rarity disequilibrium” produce a superabundance of false positives for trans-acting SNPs. To explore the cause of purifying selection on trans-acting mutations, we identified gene expression modules as sets of coexpressed genes. The extent to which trans-acting mutations influence modules is a strong predictor of allele frequency. Mutations altering expression of genes with high “connectedness” (those that are highly predictive of the representative module expression value) have the lowest allele frequency. The expression modules can also predict whole-plant traits such as flower size. We find that a substantial portion of the genetic (co)variance among traits can be described as an emergent property of genetic effects on expression modules. 
    more » « less