skip to main content

Title: Global-level population genomics reveals differential effects of geography and phylogeny on horizontal gene transfer in soil bacteria
Although microorganisms are known to dominate Earth’s biospheres and drive biogeochemical cycling, little is known about the geographic distributions of microbial populations or the environmental factors that pattern those distributions. We used a global-level hierarchical sampling scheme to comprehensively characterize the evolutionary relationships and distributional limitations of the nitrogen-fixing bacterial symbionts of the crop chickpea, generating 1,027 draft whole-genome sequences at the level of bacterial populations, including 14 high-quality PacBio genomes from a phylogenetically representative subset. We find that diverse Mesorhizobium taxa perform symbiosis with chickpea and have largely overlapping global distributions. However, sampled locations cluster based on the phylogenetic diversity of Mesorhizobium populations, and diversity clusters correspond to edaphic and environmental factors, primarily soil type and latitude. Despite long-standing evolutionary divergence and geographic isolation, the diverse taxa observed to nodulate chickpea share a set of integrative conjugative elements (ICEs) that encode the major functions of the symbiosis. This symbiosis ICE takes 2 forms in the bacterial chromosome—tripartite and monopartite—with tripartite ICEs confined to a broadly distributed superspecies clade. The pairwise evolutionary relatedness of these elements is controlled as much by geographic distance as by the evolutionary relatedness of the background genome. In contrast, diversity in the broader gene content of Mesorhizobium genomes follows a tight linear relationship with core genome phylogenetic distance, with little detectable effect of geography. These results illustrate how geography and demography can operate differentially on the evolution of bacterial genomes and offer useful insights for the development of improved technologies for sustainable agriculture.  more » « less
Award ID(s):
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; « less
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Page Range / eLocation ID:
15200 to 15209
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Symbiosis often occurs between partners with distinct life history characteristics and dispersal mechanisms. Many bacterial symbionts have genomes comprising multiple replicons with distinct rates of evolution and horizontal transmission. Such differences might drive differences in population structure between hosts and symbionts and among the elements of the divided genomes of bacterial symbionts. These differences might, in turn, shape the evolution of symbiotic interactions and bacterial evolution. Here we use whole genome resequencing of a hierarchically structured sample of 191 strains ofSinorhizobium meliloticollected from 21 locations in southern Europe to characterize population structures of this bacterial symbiont, which forms a root nodule symbiosis with the host plantMedicago truncatula.S. melilotigenomes showed high local (within‐site) variation and little isolation by distance. This was particularly true for the two symbiosis elements, pSymA and pSymB, which have population structures that are similar to each other, but distinct from both the bacterial chromosome and the host plant. Given limited recombination on the chromosome, compared to the symbiosis elements, distinct population structures may result from differences in effective gene flow. Alternatively, positive or purifying selection, with little recombination, may explain distinct geographical patterns at the chromosome. Discordant population structure between hosts and symbionts indicates that geographically and genetically distinct host populations in different parts of the range might interact with genetically similar symbionts, potentially minimizing local specialization.

    more » « less
  2. Koomey, Michael (Ed.)
    ABSTRACT Elizabethkingia anophelis is an emerging global multidrug-resistant opportunistic pathogen. We assessed the diversity among 13 complete genomes and 23 draft genomes of E. anophelis strains derived from various environmental settings and human infections from different geographic regions around the world from 1950s to the present. Putative integrative and conjugative elements (ICEs) were identified in 31/36 (86.1%) strains in the study. A total of 52 putative ICEs (including eight degenerated elements lacking integrases) were identified and categorized into three types based on the architecture of the conjugation module and the phylogeny of the relaxase, coupling protein, TraG, and TraJ protein sequences. The type II and III ICEs were found to integrate adjacent to tRNA genes, while type I ICEs integrate into intergenic regions or into a gene. The ICEs carry various cargo genes, including transcription regulator genes and genes conferring antibiotic resistance. The adaptive immune CRISPR-Cas system was found in nine strains, including five strains in which CRISPR-Cas machinery and ICEs coexist at different locations on the same chromosome. One ICE-derived spacer was present in the CRISPR locus in one strain. ICE distribution in the strains showed no geographic or temporal patterns. The ICEs in E. anophelis differ in architecture and sequence from CTnDOT, a well-studied ICE prevalent in Bacteroides spp. The categorization of ICEs will facilitate further investigations of the impact of ICE on virulence, genome epidemiology, and adaptive genomics of E. anophelis . IMPORTANCE Elizabethkingia anophelis is an opportunistic human pathogen, and the genetic diversity between strains from around the world becomes apparent as more genomes are sequenced. Genome comparison identified three types of putative ICEs in 31 of 36 strains. The diversity of ICEs suggests that they had different origins. One of the ICEs was discovered previously from a large E. anophelis outbreak in Wisconsin in the United States; this ICE has integrated into the mutY gene of the outbreak strain, creating a mutator phenotype. Similar to ICEs found in many bacterial species, ICEs in E. anophelis carry various cargo genes that enable recipients to resist antibiotics and adapt to various ecological niches. The adaptive immune CRISPR-Cas system is present in nine of 36 strains. An ICE-derived spacer was found in the CRISPR locus in a strain that has no ICE, suggesting a past encounter and effective defense against ICE. 
    more » « less
  3. Dinoflagellates of the family Symbiodiniaceae are predominantly essential symbionts of corals and other marine organisms. Recent research reveals extensive genome sequence divergence among Symbiodiniaceae taxa and high phylogenetic diversity hidden behind subtly different cell morphologies. Using an alignment-free phylogenetic approach based on sub-sequences of fixed length k (i.e. k -mers), we assessed the phylogenetic signal among whole-genome sequences from 16 Symbiodiniaceae taxa (including the genera of Symbiodinium , Breviolum , Cladocopium , Durusdinium and Fugacium ) and two strains of Polarella glacialis as outgroup. Based on phylogenetic trees inferred from k -mers in distinct genomic regions (i.e. repeat-masked genome sequences, protein-coding sequences, introns and repeats) and in protein sequences, the phylogenetic signal associated with protein-coding DNA and the encoded amino acids is largely consistent with the Symbiodiniaceae phylogeny based on established markers, such as large subunit rRNA. The other genome sequences (introns and repeats) exhibit distinct phylogenetic signals, supporting the expected differential evolutionary pressure acting on these regions. Our analysis of conserved core k -mers revealed the prevalence of conserved k -mers (>95% core 23-mers among all 18 genomes) in annotated repeats and non-genic regions of the genomes. We observed 180 distinct repeat types that are significantly enriched in genomes of the symbiotic versus free-living Symbiodinium taxa, suggesting an enhanced activity of transposable elements linked to the symbiotic lifestyle. We provide evidence that representation of alignment-free phylogenies as dynamic networks enhances the ability to generate new hypotheses about genome evolution in Symbiodiniaceae. These results demonstrate the potential of alignment-free phylogenetic methods as a scalable approach for inferring comprehensive, unbiased whole-genome phylogenies of dinoflagellates and more broadly of microbial eukaryotes. 
    more » « less
  4. Abstract

    The geographic distribution of phenotypic variation among closely related populations is a valuable source of information about the evolutionary processes that generate and maintain biodiversity. Leapfrog distributions, in which phenotypically similar populations are disjunctly distributed and separated by one or more phenotypically distinct populations, represent geographic replicates for the existence of a phenotype, and are therefore especially informative. These geographic patterns have mostly been studied from phylogenetic perspectives to understand how common ancestry and divergent evolution drive their formation. Other processes, such as gene flow between populations, have not received as much attention. Here, we investigate the roles of divergence and gene flow between populations in the origin and maintenance of a leapfrog distribution inPhyllobatespoison frogs. We found evidence for high levels of gene flow between neighbouring populations but not over long distances, indicating that gene flow between populations exhibiting the central phenotype may have a homogenizing effect that maintains their similarity, and that introgression between ‘leapfroging’ taxa has not played a prominent role as a driver of phenotypic diversity inPhyllobates. Although phylogenetic analyses suggest that the leapfrog distribution was formed through independent evolution of the peripheral (i.e. leapfrogging) populations, the elevated levels of gene flow between geographically close populations poise alternative scenarios, such as the history of phenotypic change becoming decoupled from genome‐averaged patterns of divergence, which we cannot rule out. These results highlight the importance of incorporating gene flow between populations into the study of geographic variation in phenotypes, both as a driver of phenotypic diversity and as a confounding factor of phylogeographic inferences.

    more » « less
  5. Premise

    Divergence depends on the strength of selection and frequency of gene flow between taxa, while reproductive isolation relies on mating barriers and geographic distance. Less is known about how these processes interact at early stages of speciation. Here, we compared population‐level differentiation in floral phenotype and genetic sequence variation among recently divergedCastillejato explore patterns of diversification under different scenarios of reproductive isolation.


    Using target enrichment enabled by the Angiosperms353 probe set, we assessed genetic distance among 50 populations of fourCastillejaspecies. We investigated whether patterns of genetic divergence are explained by floral trait variation or geographic distance in two focal groups: the widespreadC. sessilifloraand the more restrictedC. purpureaspecies complex.


    We document thatC. sessilifloraand theC. purpureacomplex are characterized by high diversity in floral color across varying geographic scales. Despite phenotypic divergence, groups were not well supported in phylogenetic analyses, and little genetic differentiation was found across targeted Angiosperms353 loci. Nonetheless, a principal coordinate analysis of single nucleotide polymorphisms revealed differentiation withinC. sessilifloraacross floral morphs and geography and less differentiation among species of theC. purpureacomplex.


    Patterns of genetic distance inC. sessiliflorasuggest species cohesion maintained over long distances despite variation in floral traits. In theC. purpureacomplex, divergence in floral color across narrow geographic clines may be driven by recent selection on floral color. These contrasting patterns of floral and genetic differentiation reveal that divergence can arise via multiple eco‐evolutionary paths.

    more » « less