Abstract Pan-genome analyses of metagenome-assembled genomes (MAGs) may suffer from the known issues with MAGs: fragmentation, incompleteness and contamination. Here, we conducted a critical assessment of pan-genomics of MAGs, by comparing pan-genome analysis results of complete bacterial genomes and simulated MAGs. We found that incompleteness led to significant core gene (CG) loss. The CG loss remained when using different pan-genome analysis tools (Roary, BPGA, Anvi’o) and when using a mixture of MAGs and complete genomes. Contamination had little effect on core genome size (except for Roary due to in its gene clustering issue) but had major influence on accessory genomes. Importantly, the CG loss was partially alleviated by lowering the CG threshold and using gene prediction algorithms that consider fragmented genes, but to a less degree when incompleteness was higher than 5%. The CG loss also led to incorrect pan-genome functional predictions and inaccurate phylogenetic trees. Our main findings were supported by a study of real MAG-isolate genome data. We conclude that lowering CG threshold and predicting genes in metagenome mode (as Anvi’o does with Prodigal) are necessary in pan-genome analysis of MAGs. Development of new pan-genome analysis tools specifically for MAGs are needed in future studies. 
                        more » 
                        « less   
                    
                            
                            Zetaproteobacteria Pan-Genome Reveals Candidate Gene Cluster for Twisted Stalk Biosynthesis and Export
                        
                    
    
            Twisted stalks are morphologically unique bacterial extracellular organo-metallic structures containing Fe(III) oxyhydroxides that are produced by microaerophilic Fe(II)-oxidizers belonging to the Betaproteobacteria and Zetaproteobacteria. Understanding the underlying genetic and physiological mechanisms of stalk formation is of great interest based on their potential as novel biogenic nanomaterials and their relevance as putative biomarkers for microbial Fe(II) oxidation on ancient Earth. Despite the recognition of these special biominerals for over 150 years, the genetic foundation for the stalk phenotype has remained unresolved. Here we present a candidate gene cluster for the biosynthesis and secretion of the stalk organic matrix that we identified with a trait-based analyses of a pan-genome comprising 16 Zetaproteobacteria isolate genomes. The “ s talk f ormation in Z etaproteobacteria” (sfz) cluster comprises six genes ( sfz1-sfz6 ), of which sfz1 and sfz2 were predicted with functions in exopolysaccharide synthesis, regulation, and export, sfz4 and sfz6 with functions in cell wall synthesis manipulation and carbohydrate hydrolysis, and sfz3 and sfz5 with unknown functions. The stalk-forming Betaproteobacteria Ferriphaselus R-1 and OYT-1, as well as dread-forming Zetaproteobacteria Mariprofundus aestuarium CP-5 and Mariprofundus ferrinatatus CP-8 contain distant sfz gene homologs, whereas stalk-less Zetaproteobacteria and Betaproteobacteria lack the entire gene cluster. Our pan-genome analysis further revealed a significant enrichment of clusters of orthologous groups (COGs) across all Zetaproteobacteria isolate genomes that are associated with the regulation of a switch between sessile and motile growth controlled by the intracellular signaling molecule c-di-GMP. Potential interactions between stalk-former unique transcription factor genes, sfz genes, and c-di-GMP point toward a c-di-GMP regulated surface attachment function of stalks during sessile growth. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 1826734
- PAR ID:
- 10325474
- Date Published:
- Journal Name:
- Frontiers in Microbiology
- Volume:
- 12
- ISSN:
- 1664-302X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Many bacterial species in nature possess the ability to transition into a sessile lifestyle and aggregate into cohesive colonies, known as biofilms. Within a biofilm, bacterial cells are encapsulated within an extracellular polymeric substance (EPS) comprised of polysaccharides, proteins, nucleic acids, lipids, and other small molecules. The transition from planktonic growth to the biofilm lifecycle provides numerous benefits to bacteria, such as facilitating adherence to abiotic surfaces, evasion of a host immune system, and resistance to common antibiotics. As a result, biofilm-forming bacteria contribute to 65% of infections in humans, and substantially increase the energy and time required for treatment and recovery. Several biofilm specific exopolysaccharides, including cellulose, alginate, Pel polysaccharide, and poly- N -acetylglucosamine (PNAG), have been shown to play an important role in bacterial biofilm formation and their production is strongly correlated with pathogenicity and virulence. In many bacteria the biosynthetic machineries required for assembly of these exopolysaccharides are regulated by common signaling molecules, with the second messenger cyclic di-guanosine monophosphate (c - di-GMP) playing an especially important role in the post-translational activation of exopolysaccharide biosynthesis. Research on treatments of antibiotic-resistant and biofilm-forming bacteria through direct targeting of c-di-GMP signaling has shown promise, including peptide-based treatments that sequester intracellular c-di-GMP. In this review, we will examine the direct role c-di-GMP plays in the biosynthesis and export of biofilm exopolysaccharides with a focus on the mechanism of post-translational activation of these pathways, as well as describe novel approaches to inhibit biofilm formation through direct targeting of c-di-GMP.more » « less
- 
            Yount, Jacob (Ed.)ABSTRACT Building iron-sulfur (Fe-S) clusters and assembling Fe-S proteins are essential actions for life on Earth. The three processes that sustain life, photosynthesis, nitrogen fixation, and respiration, require Fe-S proteins. Genes coding for Fe-S proteins can be found in nearly every sequenced genome. Fe-S proteins have a wide variety of functions, and therefore, defective assembly of Fe-S proteins results in cell death or global metabolic defects. Compared to alternative essential cellular processes, there is less known about Fe-S cluster synthesis and Fe-S protein maturation. Moreover, new factors involved in Fe-S protein assembly continue to be discovered. These facts highlight the growing need to develop a deeper biological understanding of Fe-S cluster synthesis, holo-protein maturation, and Fe-S cluster repair. Here, we outline bacterial strategies used to assemble Fe-S proteins and the genetic regulation of these processes. We focus on recent and relevant findings and discuss future directions, including the proposal of using Fe-S protein assembly as an antipathogen target.more » « less
- 
            Cyclic dimeric guanosine monophosphate (c-di-GMP) serves as a second messenger that modulates bacterial cellular processes, including biofilm formation. While proteins containing both c-di-GMP synthesizing (GGDEF) and c-di-GMP hydrolyzing (EAL) domains are widely predicted in bacterial genomes, it is poorly understood how domains with opposing enzymatic activity are regulated within a single polypeptide. Herein, we report the characterization of a globin-coupled sensor protein (GCS) fromPaenibacillus dendritiformis(DcpG) with bifunctional c-di-GMP enzymatic activity. DcpG contains a regulatory sensor globin domain linked to diguanylate cyclase (GGDEF) and phosphodiesterase (EAL) domains that are differentially regulated by gas binding to the heme; GGDEF domain activity is activated by the Fe(II)-NO state of the globin domain, while EAL domain activity is activated by the Fe(II)-O2state. The in vitro activity of DcpG is mimicked in vivo by the biofilm formation ofP. dendritiformisin response to gaseous environment, with nitric oxide conditions leading to the greatest amount of biofilm formation. The ability of DcpG to differentially control GGDEF and EAL domain activity in response to ligand binding is likely due to the unusual properties of the globin domain, including rapid ligand dissociation rates and high midpoint potentials. Using structural information from small-angle X-ray scattering and negative stain electron microscopy studies, we developed a structural model of DcpG, providing information about the regulatory mechanism. These studies provide information about full-length GCS protein architecture and insight into the mechanism by which a single regulatory domain can selectively control output domains with opposing enzymatic activities.more » « less
- 
            Stress and starvation causes bacterial cells to activate the stringent response. This results in down-regulation of energy-requiring processes related to growth, as well as an upregulation of genes associated with survival and stress responses. Guanosine tetra- and pentaphosphates (collectively referred to as (p)ppGpp) are critical for this process. In Gram-positive bacteria, a main function of (p)ppGpp is to limit cellular levels of GTP, one consequence of which is reduced transcription of genes that require GTP as the initiating nucleotide, such as rRNA genes. In Streptomycetes, the stringent response is also linked to complex morphological differentiation and to production of secondary metabolites, including antibiotics. These processes are also influenced by the second messenger c-di-GMP. Since GTP is a substrate for both (p)ppGpp and c-di-GMP, a finely tuned regulation of cellular GTP levels is required to ensure adequate synthesis of these guanosine derivatives. Here, we discuss mechanisms that operate to control guanosine metabolism and how they impinge on the production of antibiotics in Streptomyces species.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    