skip to main content


Title: Comprehensive comparative genomics reveals over 50 phyla of free-living and pathogenic bacteria are associated with diverse members of the amoebozoa
Abstract

The Amoebozoa, a group containing predominantly amoeboid unicellular protists has been shown to play an important ecological role in controlling environmental bacteria. Amoebozoans not only graze bacteria but also serve as a safe niche for bacterial replication and harbor endosymbiotic bacteria including dangerous human pathogens. Despite their importance, only a few lineages of Amoebozoa have been studied in this regard. In this research, we conducted a comprehensive genomic and transcriptomic study with expansive taxon sampling by including representatives from the three known clades of the Amoebozoa. We used culture independent whole culture and single cell genomics/transcriptomics to investigate the association of bacteria with diverse amoebozoans. Relative to current published evidence, we recovered the largest number of bacterial phyla (64) and human pathogen genera (51) associated with the Amoebozoa. Using single cell genomics/transcriptomics we were able to determine up to 24 potential endosymbiotic bacterial phyla, some potentially endosymbionts. This includes the majority of multi-drug resistant pathogens designated as major public health threats. Our study demonstrates amoebozoans are associated with many more phylogenetically diverse bacterial phyla than previously recognized. It also shows that all amoebozoans are capable of harboring far more dangerous human pathogens than presently documented, making them of primal public health concern.

 
more » « less
Award ID(s):
1831958
NSF-PAR ID:
10360626
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Reports
Volume:
11
Issue:
1
ISSN:
2045-2322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    With the advent of metagenomics, the importance of microorganisms and how their interactions are relevant to ecosystem resilience, sustainability, and human health has become evident. Cataloging and preserving biodiversity is paramount not only for the Earth’s natural systems but also for discovering solutions to challenges that we face as a growing civilization. Metagenomics pertains to the in silico study of all microorganisms within an ecological community in situ,however, many software suites recover only prokaryotes and have limited to no support for viruses and eukaryotes.

    Results

    In this study, we introduce theViral Eukaryotic Bacterial Archaeal(VEBA) open-source software suite developed to recover genomes from all domains. To our knowledge,VEBAis the first end-to-end metagenomics suite that can directly recover, quality assess, and classify prokaryotic, eukaryotic, and viral genomes from metagenomes.VEBAimplements a novel iterative binning procedure and hybrid sample-specific/multi-sample framework that yields more genomes than any existing methodology alone.VEBAincludes a consensus microeukaryotic database containing proteins from existing databases to optimize microeukaryotic gene modeling and taxonomic classification.VEBAalso provides a unique clustering-based dereplication strategy allowing for sample-specific genomes and genes to be directly compared across non-overlapping biological samples. Finally,VEBAis the only pipeline that automates the detection of candidate phyla radiation bacteria and implements the appropriate genome quality assessments.VEBA’s capabilities are demonstrated by reanalyzing 3 existing public datasets which recovered a total of 948 MAGs (458 prokaryotic, 8 eukaryotic, and 482 viral) including several uncharacterized organisms and organisms with no public genome representatives.

    Conclusions

    TheVEBAsoftware suite allows for the in silico recovery of microorganisms from all domains of life by integrating cutting edge algorithms in novel ways.VEBAfully integrates both end-to-end and task-specific metagenomic analysis in a modular architecture that minimizes dependencies and maximizes productivity. The contributions ofVEBAto the metagenomics community includes seamless end-to-end metagenomics analysis but also provides users with the flexibility to perform specific analytical tasks.VEBAallows for the automation of several metagenomics steps and shows that new information can be recovered from existing datasets.

     
    more » « less
  2. BACKGROUND Diverse organisms, from archaea and bacteria to plants and humans, use receptor systems to recognize both pathogens and dangerous self-derived or environmentally derived stimuli. These intricate, well-coordinated immune systems, composed of innate and adaptive components, ensure host survival. In the late 20th century, researchers identified the Toll/interleukin-1/resistance gene (TIR) domain as an evolutionarily conserved component of animal and plant innate immune systems. Today, TIR-domain proteins are known to be broadly distributed across the tree of life. The TIR domain was first recognized as an adaptor for the assembly of macromolecular signaling complexes in mammalian innate immune pathways. Work on axon degeneration in animals—as well as on plant, archaeal, and bacterial immune systems—has uncovered additional enzymatic activities for TIR domains. ADVANCES Mammalian axons initiate a self-destruct program upon injury and during disease that is mediated by the sterile alpha and TIR motif containing 1 (SARM1) protein. The SARM1 TIR domain enzymatically consumes the essential metabolic cofactor nicotinamide adenine dinucleotide (NAD + ) to promote axonal death. Identification of the SARM1 NAD + -consuming enzyme (NADase) revealed that TIR domains can function as enzymes. Given the evolutionary conservation of TIR domains, studies investigated whether the SARM1 TIR NADase was also conserved. Indeed, bacteria, archaea, and plant TIR domains possess NADase activity. In prokaryotes, TIR NADase activity is found in an ancient antiphage immune system. In plants, identification of TIR NADase activity and linkage of TIR enzymatic products to downstream signaling components addressed the question of how nucleotide-binding, leucine-rich repeat (NLR) receptors trigger hypersensitive cell death during an immune response. Studies in plants show that their TIR domains can cleave nucleic acids and possess 2′,3′ cyclic adenosine monophosphate (2′,3′-cAMP) and 2′,3′ cyclic guanosine monophosphate (2′,3′-cGMP) synthetase activity that aids cell death programs in plant innate immunity. Thus, TIR domains constitute an ancient family of enzymes that are activated in immune and cell death pathways. OUTLOOK The discovery of TIR-domain enzyme activities carries implications for innate immunity and neurodegeneration. The identification of the SARM1 NADase defined a drug target for a wide number of neurodegenerative diseases that is being exploited in both preclinical and clinical studies. Hyperactive mutations in the SARM1 NADase have been discovered in amyotrophic lateral sclerosis (ALS) patients. Future work will seek to clarify the contribution of the SARM1 axon degeneration pathway to ALS pathogenesis. NAD + biology influences cellular processes from metabolism to DNA repair to aging. How TIR enzymes influence the NAD + metabolome and its associated pathways in bacteria, archaea, plants, and animals will be an exciting area for upcoming investigation. The discovery of the diversity of TIR enzymatic products is revealing signaling pathways across kingdoms. Discovery of TIR enzymatic function in plants and animals may yet inspire studies of enzymatic functions for Toll-like receptors in animals. We anticipate that cross-kingdom studies of TIR-domain function will guide interventions that will span the tree of life, from treating human neurodegenerative disorders and bacterial infections to preventing plant diseases. Conserved TIR-domain enzymatic activity. TIR-domain proteins from prokaryotes and eukaryotes cleave NAD + into nicotinamide (Nam), ADP-ribose (ADPR), cyclic ADP-ribose (cADPR), isomers of cyclic ADP-ribose (2′ or 3′cADPR), and related molecules [e.g., phosphoribosyl adenosine monophosphate (pRib-AMP)]. Plant TIR domains also possess a nuclease activity, can degrade DNA and RNA, and can function as a 2′,3′-cAMP or 2′,3′-cGMP synthetase. TIR enzymatic activity drives cell death and immune pathways across kingdoms. TIR activity can kill cells directly through NAD + depletion or indirectly using enzymatic products as signal molecules. The representative TIR domain structure shown here is Protein Data Bank ID 6O0Q. EDS1, enhanced disease susceptibility 1; ThsA, Thoeris A. 
    more » « less
  3. Amoebozoa include lineages of diverse ecology, behavior, and morphology. They are assumed to encompass members with the largest genome sizes of all living things, yet genomic studies in the group are limited. Trichosphaerium, a polymorphic, multinucleate, marine amoeba with a complicated life cycle, has puzzled experts for over a century. In an effort to explore the genomic diversity and investigate extraordinary behavior observed among the Amoebozoa, we used integrated omics approaches to study this enigmatic marine amoeba. Omics data, including single-cell transcriptomics and cytological data, demonstrate that Trichosphaerium sp. possesses the complete meiosis toolkit genes. These genes are expressed in life stages of the amoeba including medium and large cells. The life cycle of Trichosphaerium sp. involves asexual processes via binary fission and multiple fragmentation of giant cells, as well as sexual-like processes involving genes implicated in sexual reproduction and polyploidization. These findings are in stark contrast to a life cycle previously reported for this amoeba. Despite the extreme morphological plasticity observed in Trichosphaerium, our genomic data showed that populations maintain a species-level intragenomic variation. A draft genome of Trichosphaerium indicates elevated lateral gene transfer (LGT) from bacteria and giant viruses. Gene trafficking in Trichosphaerium is the highest within Amoebozoa and among the highest in microbial eukaryotes. 
    more » « less
  4. Abstract Bacteriophages from the Inoviridae family (inoviruses) are characterized by their unique morphology, genome content and infection cycle. One of the most striking features of inoviruses is their ability to establish a chronic infection whereby the viral genome resides within the cell in either an exclusively episomal state or integrated into the host chromosome and virions are continuously released without killing the host. To date, a relatively small number of inovirus isolates have been extensively studied, either for biotechnological applications, such as phage display, or because of their effect on the toxicity of known bacterial pathogens including Vibrio cholerae and Neisseria meningitidis . Here, we show that the current 56 members of the Inoviridae family represent a minute fraction of a highly diverse group of inoviruses. Using a machine learning approach leveraging a combination of marker gene and genome features, we identified 10,295 inovirus-like sequences from microbial genomes and metagenomes. Collectively, our results call for reclassification of the current Inoviridae family into a viral order including six distinct proposed families associated with nearly all bacterial phyla across virtually every ecosystem. Putative inoviruses were also detected in several archaeal genomes, suggesting that, collectively, members of this supergroup infect hosts across the domains Bacteria and Archaea. Finally, we identified an expansive diversity of inovirus-encoded toxin–antitoxin and gene expression modulation systems, alongside evidence of both synergistic (CRISPR evasion) and antagonistic (superinfection exclusion) interactions with co-infecting viruses, which we experimentally validated in a Pseudomonas model. Capturing this previously obscured component of the global virosphere may spark new avenues for microbial manipulation approaches and innovative biotechnological applications. 
    more » « less
  5. ABSTRACT Little is known about the public health risks associated with natural creek sediments that are affected by runoff and fecal pollution from agricultural and livestock practices. For instance, the persistence of foodborne pathogens such as Shiga toxin-producing Escherichia coli (STEC) originating from these practices remains poorly quantified. Towards closing these knowledge gaps, the water-sediment interface of two creeks in the Salinas River Valley of California was sampled over a 9-month period using metagenomics and traditional culture-based tests for STEC. Our results revealed that these sediment communities are extremely diverse and have functional and taxonomic diversity comparable to that observed in soils. With our sequencing effort (∼4 Gbp per library), we were unable to detect any pathogenic E. coli in the metagenomes of 11 samples that had tested positive using culture-based methods, apparently due to relatively low abundance. Furthermore, there were no significant differences in the abundance of human- or cow-specific gut microbiome sequences in the downstream impacted sites compared to that in upstream more pristine (control) sites, indicating natural dilution of anthropogenic inputs. Notably, the high number of metagenomic reads carrying antibiotic resistance genes (ARGs) found in all samples was significantly higher than ARG reads in other available freshwater and soil metagenomes, suggesting that these communities may be natural reservoirs of ARGs. The work presented here should serve as a guide for sampling volumes, amount of sequencing to apply, and what bioinformatics analyses to perform when using metagenomics for public health risk studies of environmental samples such as sediments. IMPORTANCE Current agricultural and livestock practices contribute to fecal contamination in the environment and the spread of food- and waterborne disease and antibiotic resistance genes (ARGs). Traditionally, the level of pollution and risk to public health are assessed by culture-based tests for the intestinal bacterium Escherichia coli . However, the accuracy of these traditional methods (e.g., low accuracy in quantification, and false-positive signal when PCR based) and their suitability for sediments remain unclear. We collected sediments for a time series metagenomics study from one of the most highly productive agricultural regions in the United States in order to assess how agricultural runoff affects the native microbial communities and if the presence of Shiga toxin-producing Escherichia coli (STEC) in sediment samples can be detected directly by sequencing. Our study provided important information on the potential for using metagenomics as a tool for assessment of public health risk in natural environments. 
    more » « less