skip to main content


Title: A Genus Definition for Bacteria and Archaea Based on a Standard Genome Relatedness Index
ABSTRACT Genus assignment is fundamental in the characterization of microbes, yet there is currently no unambiguous way to demarcate genera solely using standard genomic relatedness indices. Here, we propose an approach to demarcate genera that relies on the combined use of the average nucleotide identity, genome alignment fraction, and the distinction between type- and non-type species. More than 3,500 genomes representing type strains of species from >850 genera of either bacterial or archaeal lineages were tested. Over 140 genera were analyzed in detail within the taxonomic context of order/family. Significant genomic differences between members of a genus and type species of other genera in the same order/family were conserved in 94% of the cases. Nearly 90% (92% if polyphyletic genera are excluded) of the type strains were classified in agreement with current taxonomy. The 448 type strains that need reclassification directly impact 33% of the genera analyzed in detail. The results provide a first line of evidence that the combination of genomic indices provides added resolution to effectively demarcate genera within the taxonomic framework that is currently based on the 16S rRNA gene. We also identify the emergence of natural breakpoints at the genome level that can further help in the circumscription of taxa, increasing the proportion of directly impacted genera to at least 43% and pointing at inaccuracies on the use of the 16S rRNA gene as a taxonomic marker, despite its precision. Altogether, these results suggest that genomic coherence is an emergent property of genera in Bacteria and Archaea . IMPORTANCE In recent decades, the taxonomy of Bacteria and Archaea , and therefore genus designation, has been largely based on the use of a single ribosomal gene, the 16S rRNA gene, as a taxonomic marker. We propose an approach to delineate genera that excludes the direct use of the 16S rRNA gene and focuses on a standard genome relatedness index, the average nucleotide identity. Our findings are of importance to the microbiology community because the emergent properties of Bacteria and Archaea that are identified in this study will help assign genera with higher taxonomic resolution.  more » « less
Award ID(s):
1849227 1826734
NSF-PAR ID:
10140621
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
mBio
Volume:
11
Issue:
1
ISSN:
2150-7511
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    A polyphasic taxonomic approach, incorporating analysis of phenotypic features, cellular fatty acid profiles, 16S rRNA gene sequences, and determination of average nucleotide identity (ANI) plus digital DNA–DNA hybridization (dDDH), was applied to characterize an anaerobic bacterial strain designated KD22Tisolated from human feces. 16S rRNA gene-based phylogenetic analysis showed that strain KD22Twas found to be most closely related to species of the genusGabonibacter.At the 16S rRNA gene level, the closest species from the strain KD22Tcorresponded withGabonibacter massiliensisGM7T, with a similarity of 97.58%. Cells of strain KD22T were Gram-negative coccobacillus, positive for indole and negative for catalase, nitrate reduction, oxidase, and urease activities. The fatty acid analysis demonstrated the presence of a high concentration of iso-C15: 0(51.65%). Next, the complete whole-genome sequence of strain KD22T was 3,368,578 bp long with 42 mol% of DNA G + C contents. The DDH and ANI values between KD22T and type strains of phylogenetically related species were 67.40% and 95.43%, respectively. These phylogenetic, phenotypic, and genomic results supported the affiliation of strain KD22Tas a novel bacterial species within the genusGabonibacter.The proposed name isGabonibacter chumensisand the type strain is KD22T(= CSUR Q8104T = DSM 115208 T).

     
    more » « less
  2. Background Cellulolytic, hemicellulolytic, and amylolytic (CHA) enzyme-producing halophiles are understudied. The recently defined taxon Iocasia fonsfrigidae consists of one well-described anaerobic bacterial strain: NS-1 T . Prior to characterization of strain NS-1 T , an isolate designated Halocella sp. SP3-1 was isolated and its genome was published. Based on physiological and genetic comparisons, it was suggested that Halocella sp. SP3-1 may be another isolate of I. fronsfrigidae . Despite being geographic variants of the same species, data indicate that strain SP3-1 exhibits genetic, genomic, and physiological characteristics that distinguish it from strain NS-1 T . In this study, we examine the halophilic and alkaliphilic nature of strain SP3-1 and the genetic substrates underlying phenotypic differences between strains SP3-1 and NS-1 T with focus on sugar metabolism and CHA enzyme expression. Methods Standard methods in anaerobic cell culture were used to grow strains SP3-1 as well as other comparator species. Morphological characterization was done via electron microscopy and Schaeffer-Fulton staining. Data for sequence comparisons ( e.g. , 16S rRNA) were retrieved via BLAST and EzBioCloud. Alignments and phylogenetic trees were generated via CLUTAL_X and neighbor joining functions in MEGA (version 11). Genomes were assembled/annotated via the Prokka annotation pipeline. Clusters of Orthologous Groups (COGs) were defined by eegNOG 4.5. DNA-DNA hybridization calculations were performed by the ANI Calculator web service. Results Cells of strain SP3-1 are rods. SP3-1 cells grow at NaCl concentrations of 5-30% (w/v). Optimal growth occurs at 37 °C, pH 8.0, and 20% NaCl (w/v). Although phylogenetic analysis based on 16S rRNA gene indicates that strain SP3-1 belongs to the genus Iocasia with 99.58% average nucleotide sequence identity to Iocasia fonsfrigida NS-1 T , strain SP3-1 is uniquely an extreme haloalkaliphile. Moreover, strain SP3-1 ferments D-glucose to acetate, butyrate, carbon dioxide, hydrogen, ethanol, and butanol and will grow on L-arabinose, D-fructose, D-galactose, D-glucose, D-mannose, D-raffinose, D-xylose, cellobiose, lactose, maltose, sucrose, starch, xylan and phosphoric acid swollen cellulose (PASC). D-rhamnose, alginate, and lignin do not serve as suitable culture substrates for strain SP3-1. Thus, the carbon utilization profile of strain SP3-1 differs from that of I. fronsfrigidae strain NS-1 T . Differences between these two strains are also noted in their lipid composition. Genomic data reveal key differences between the genetic profiles of strain SP3-1 and NS-1 T that likely account for differences in morphology, sugar metabolism, and CHA-enzyme potential. Important to this study, I. fonsfrigidae SP3-1 produces and extracellularly secretes CHA enzymes at different levels and composition than type strain NS-1 T . The high salt tolerance and pH range of SP3-1 makes it an ideal candidate for salt and pH tolerant enzyme discovery. 
    more » « less
  3. Three novel carbon monoxide-oxidizing Halobacteria were isolated from Bonneville Salt Flats (Utah, USA) salt crusts and nearby saline soils. Phylogenetic analysis of 16S rRNA gene sequences revealed that strains PCN9 T , WSA2 T and WSH3 T belong to the genera Halobacterium , Halobaculum and Halovenus , respectively. Strains PCN9 T , WSA2 T and WSH3 T grew optimally at 40 °C (PCN9 T ) or 50 °C (WSA2 T , WSH3 T ). NaCl optima were 3 M (PCN9 T , WSA2 T ) or 4 M NaCl (WSH3 T ). Carbon monoxide was oxidized by all isolates, each of which contained a molybdenum-dependent CO dehydrogenase. G+C contents for the three respective isolates were 66.75, 67.62, and 63.97 mol% as derived from genome analyses. The closest phylogenetic relatives for PCN9 T , WSA2 T and WSH3 T were Halobacterium noricense A1 T , Halobaculum roseum D90 T and Halovenus aranensis EB27 T with 98.71, 98.19 and 95.95 % 16S rRNA gene sequence similarities, respectively. Genome comparisons of PCN9 T with Halobacterium noricense A1 T yielded an average nucleotide identity (ANI) of 82.0% and a digital DNA–DNA hybridization (dDDH) value of 25.7 %; comparisons of WSA2 T with Halobaculum roseum D90 T yielded ANI and dDDH values of 86.34 and 31.1 %, respectively. The ANI value for a comparison of WSH3 T with Halovenus aranensis EB27 T was 75.2 %. Physiological, biochemical, genetic and genomic characteristics of PCN9 T , WSA2 T and WSH3 T differentiated them from their closest phylogenetic neighbours and indicated that they represent novel species for which the names Halobaculum bonnevillei , Halobaculum saliterrae and Halovenus carboxidivorans are proposed, respectively. The type strains are PCN9 T (=JCM 32472=LMG 31022=ATCC TSD-126), WSA2 T (=JCM 32473=ATCC TSD-127) and WSH3 T (=JCM 32474=ATCC TSD-128). 
    more » « less
  4. null (Ed.)
    Abstract Background Antibiotic-producing Streptomyces bacteria are ubiquitous in nature, yet most studies of its diversity have focused on free-living strains inhabiting diverse soil environments and those in symbiotic relationship with invertebrates. Results We studied the draft genomes of 73 Streptomyces isolates sampled from the skin (wing and tail membranes) and fur surfaces of bats collected in Arizona and New Mexico. We uncovered large genomic variation and biosynthetic potential, even among closely related strains. The isolates, which were initially identified as three distinct species based on sequence variation in the 16S rRNA locus, could be distinguished as 41 different species based on genome-wide average nucleotide identity. Of the 32 biosynthetic gene cluster (BGC) classes detected, non-ribosomal peptide synthetases, siderophores, and terpenes were present in all genomes. On average, Streptomyces genomes carried 14 distinct classes of BGCs (range = 9–20). Results also revealed large inter- and intra-species variation in gene content (single nucleotide polymorphisms, accessory genes and singletons) and BGCs, further contributing to the overall genetic diversity present in bat-associated Streptomyces . Finally, we show that genome-wide recombination has partly contributed to the large genomic variation among strains of the same species. Conclusions Our study provides an initial genomic assessment of bat-associated Streptomyces that will be critical to prioritizing those strains with the greatest ability to produce novel antibiotics. It also highlights the need to recognize within-species variation as an important factor in genetic manipulation studies, diversity estimates and drug discovery efforts in Streptomyces . 
    more » « less
  5. Hird, Sarah M. (Ed.)
    The gut microbiome provides vital functions for mammalian hosts, yet research on its variability and function across adult life spans and multiple generations is limited in large mammalian carnivores. Here, we used 16S rRNA gene and metagenomic high-throughput sequencing to profile the bacterial taxonomic composition, genomic diversity, and metabolic function of fecal samples collected from 12 wild spotted hyenas ( Crocuta crocuta ) residing in the Masai Mara National Reserve, Kenya, over a 23-year period spanning three generations. The metagenomic data came from four of these hyenas and spanned two 2-year periods. With these data, we determined the extent to which host factors predicted variation in the gut microbiome and identified the core microbes present in the guts of hyenas. We also investigated novel genomic diversity in the mammalian gut by reporting the first metagenome-assembled genomes (MAGs) for hyenas. We found that gut microbiome taxonomic composition varied temporally, but despite this, a core set of 14 bacterial genera were identified. The strongest predictors of the microbiome were host identity and age, suggesting that hyenas possess individualized microbiomes and that these may change with age during adulthood. The gut microbiome functional profiles of the four adult hyenas were also individual specific and were associated with prey abundance, indicating that the functions of the gut microbiome vary with host diet. We recovered 149 high-quality MAGs from the hyenas’ guts; some MAGs were classified as taxa previously reported for other carnivores, but many were novel and lacked species-level matches to genomes in existing reference databases. IMPORTANCE There is a gap in knowledge regarding the genomic diversity and variation of the gut microbiome across a host’s life span and across multiple generations of hosts in wild mammals. Using two types of sequencing approaches, we found that although gut microbiomes were individualized and temporally variable among hyenas, they correlated similarly to large-scale changes in the ecological conditions experienced by their hosts. We also recovered 149 high-quality MAGs from the hyena gut, greatly expanding the microbial genome repertoire known for hyenas, carnivores, and wild mammals in general. Some MAGs came from genera abundant in the gastrointestinal tracts of canid species and other carnivores, but over 80% of MAGs were novel and from species not previously represented in genome databases. Collectively, our novel body of work illustrates the importance of surveying the gut microbiome of nonmodel wild hosts, using multiple sequencing methods and computational approaches and at distinct scales of analysis. 
    more » « less