skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Average nucleotide identity-based Staphylococcus aureus strain grouping allows identification of strain-specific genes in the pangenome
Staphylococcus aureus causes both hospital- and community-acquired infections in humans worldwide. Due to the high incidence of infection, S. aureus is also one of the most sampled and sequenced pathogens today, providing an outstanding resource to understand variation at the bacterial subspecies level. We processed and downsampled 83,383 public S. aureus Illumina whole-genome shotgun sequences and 1,263 complete genomes to produce 7,954 representative substrains. Pairwise comparison of average nucleotide identity revealed a natural boundary of 99.5% that could be used to define 145 distinct strains within the species. We found that intermediate frequency genes in the pangenome (present in 10%–95% of genomes) could be divided into those closely linked to strain background (“strain-concentrated”) and those highly variable within strains (“strain-diffuse”). Non-core genes had different patterns of chromosome location. Notably, strain-diffuse genes were associated with prophages; strain-concentrated genes were associated with the vSaβ genome island and rare genes (<10% frequency) concentrated near the origin of replication. Antibiotic resistance genes were enriched in the strain-diffuse class, while virulence genes were distributed between strain-diffuse, strain-concentrated, core, and rare classes. This study shows how different patterns of gene movement help create strains as distinct subspecies entities and provide insight into the diverse histories of important S. aureus functions.  more » « less
Award ID(s):
2146260
PAR ID:
10584878
Author(s) / Creator(s):
; ; ; ; ;
Editor(s):
Hayer, Juliette
Publisher / Repository:
American Society for Microbiology
Date Published:
Journal Name:
mSystems
Volume:
9
Issue:
7
ISSN:
2379-5077
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Staphylococcus aureus infections are of growing concern given the increased incidence of antibiotic resistant strains. Egypt, like several other countries, has seen alarming increases in methicillin-resistant S. aureus (MRSA) infections. This species can rapidly acquire genes associated with resistance, as well as virulence factors, through mobile genetic elements, including phages. Recently, we sequenced 56 S. aureus genomes from Alexandria Main University Hospital in Alexandria, Egypt, complementing 17 S. aureus genomes publicly available from other sites in Egypt. In the current study, we found that the majority (73.6%) of these strains contain intact prophages, including Biseptimaviruses, Phietaviruses, and Triaviruses. Further investigation of these prophages revealed evidence of horizontal exchange of the integrase for two of the prophages. These Egyptian S. aureus prophages are predicted to encode numerous virulence factors, including genes associated with immune evasion and toxins, including the Panton–Valentine leukocidin (PVL)-associated genes lukF-PV/lukS-PV. Thus, prophages are likely to be a major contributor to the virulence of S. aureus strains in circulation in Egypt. 
    more » « less
  2. Engel, P (Ed.)
    Abstract Lactobacillaceae are an important family of lactic acid bacteria that play key roles in the gut microbiome of many animal species. In the honey bee (Apis mellifera) gut microbiome, many species of Lactobacillaceae are found, and there is functionally important strain-level variation in the bacteria. In this study, we completed whole-genome sequencing of 3 unique Lactobacillaceae isolates collected from hives in Virginia, USA. Using 107 genomes of known bee-associated Lactobacillaceae and Limosilactobacillus reuteri as an outgroup, the phylogenetics of the 3 isolates was assessed, and these isolates were identified as novel strains of Apilactobacillus kunkeei, Lactobacillus kullabergensis, and Bombilactobacillus mellis. Genome rearrangements, conserved orthologous genes (COG) categories and potential prophage regions were identified across the 3 novel strains. The new A. kunkeei strain was enriched in genes related to replication, recombination and repair, the L. kullabergensis strain was enriched for carbohydrate transport, and the B. mellis strain was enriched in transcription or transcriptional regulation and in some genes with unknown functions. Prophage regions were identified in the A. kunkeei and L. kullabergensis isolates. These new bee-associated strains add to our growing knowledge of the honey bee gut microbiome, and to Lactobacillaceae genomics more broadly. 
    more » « less
  3. Koomey, Michael (Ed.)
    ABSTRACT Elizabethkingia anophelis is an emerging global multidrug-resistant opportunistic pathogen. We assessed the diversity among 13 complete genomes and 23 draft genomes of E. anophelis strains derived from various environmental settings and human infections from different geographic regions around the world from 1950s to the present. Putative integrative and conjugative elements (ICEs) were identified in 31/36 (86.1%) strains in the study. A total of 52 putative ICEs (including eight degenerated elements lacking integrases) were identified and categorized into three types based on the architecture of the conjugation module and the phylogeny of the relaxase, coupling protein, TraG, and TraJ protein sequences. The type II and III ICEs were found to integrate adjacent to tRNA genes, while type I ICEs integrate into intergenic regions or into a gene. The ICEs carry various cargo genes, including transcription regulator genes and genes conferring antibiotic resistance. The adaptive immune CRISPR-Cas system was found in nine strains, including five strains in which CRISPR-Cas machinery and ICEs coexist at different locations on the same chromosome. One ICE-derived spacer was present in the CRISPR locus in one strain. ICE distribution in the strains showed no geographic or temporal patterns. The ICEs in E. anophelis differ in architecture and sequence from CTnDOT, a well-studied ICE prevalent in Bacteroides spp. The categorization of ICEs will facilitate further investigations of the impact of ICE on virulence, genome epidemiology, and adaptive genomics of E. anophelis . IMPORTANCE Elizabethkingia anophelis is an opportunistic human pathogen, and the genetic diversity between strains from around the world becomes apparent as more genomes are sequenced. Genome comparison identified three types of putative ICEs in 31 of 36 strains. The diversity of ICEs suggests that they had different origins. One of the ICEs was discovered previously from a large E. anophelis outbreak in Wisconsin in the United States; this ICE has integrated into the mutY gene of the outbreak strain, creating a mutator phenotype. Similar to ICEs found in many bacterial species, ICEs in E. anophelis carry various cargo genes that enable recipients to resist antibiotics and adapt to various ecological niches. The adaptive immune CRISPR-Cas system is present in nine of 36 strains. An ICE-derived spacer was found in the CRISPR locus in a strain that has no ICE, suggesting a past encounter and effective defense against ICE. 
    more » « less
  4. ABSTRACT We report here the draft genome sequences of Staphylococcus bacteriophages JBug18, Pike, Pontiff, and Pabna, which infect and lyse S. epidermidis and S. aureus strains. All bacteriophages belong to the morphological family Podoviridae and constitute attractive candidates for use as whole-phage therapeutics due to their compact genomes and lytic lifestyles. 
    more » « less
  5. Maresca, Julia A (Ed.)
    ABSTRACT We report draft genomes of two bacterial strains in the genera Hyphobacterium and Reichenbachiella , which are associated with the diatom Cyclotella cryptica strain CCMP332. Genomes from these strains were 2,691,501 bp and 3,325,829 bp, respectively, and will be useful for understanding interactions between diatoms and bacteria. 
    more » « less