skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.
Attention:The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 7:00 AM ET to 7:30 AM ET on Friday, April 24 due to maintenance. We apologize for the inconvenience.


Title: Bioinformatics Investigations of Universal Stress Proteins from Mercury-Methylating Desulfovibrionaceae
The presence of methylmercury in aquatic environments and marine food sources is of global concern. The chemical reaction for the addition of a methyl group to inorganic mercury occurs in diverse bacterial taxonomic groups including the Gram-negative, sulfate-reducing Desulfovibrionaceae family that inhabit extreme aquatic environments. The availability of whole-genome sequence datasets for members of the Desulfovibrionaceae presents opportunities to understand the microbial mechanisms that contribute to methylmercury production in extreme aquatic environments. We have applied bioinformatics resources and developed visual analytics resources to categorize a collection of 719 putative universal stress protein (USP) sequences predicted from 93 genomes of Desulfovibrionaceae. We have focused our bioinformatics investigations on protein sequence analytics by developing interactive visualizations to categorize Desulfovibrionaceae universal stress proteins by protein domain composition and functionally important amino acids. We identified 651 Desulfovibrionaceae universal stress protein sequences, of which 488 sequences had only one USP domain and 163 had two USP domains. The 488 single USP domain sequences were further categorized into 340 sequences with ATP-binding motif and 148 sequences without ATP-binding motif. The 163 double USP domain sequences were categorized into (1) both USP domains with ATP-binding motif (3 sequences); (2) both USP domains without ATP-binding motif (138 sequences); and (3) one USP domain with ATP-binding motif (21 sequences). We developed visual analytics resources to facilitate the investigation of these categories of datasets in the presence or absence of the mercury-methylating gene pair (hgcAB). Future research could utilize these functional categories to investigate the participation of universal stress proteins in the bacterial cellular uptake of inorganic mercury and methylmercury production, especially in anaerobic aquatic environments.  more » « less
Award ID(s):
2029363 1901377
PAR ID:
10295331
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Microorganisms
Volume:
9
Issue:
8
ISSN:
2076-2607
Page Range / eLocation ID:
1780
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Choanoflagellates are single-celled eukaryotes with complex signaling pathways. They are considered the closest non-metazoan ancestors to mammals and other metazoans and form multicellular-like states called rosettes. The choanoflagellate Monosiga brevicollis contains over 150 PDZ domains, an important peptide-binding domain in all three domains of life (Archaea, Bacteria, and Eukarya). Therefore, an understanding of PDZ domain signaling pathways in choanoflagellates may provide insight into the origins of multicellularity. PDZ domains recognize the C-terminus of target proteins and regulate signaling and trafficking pathways, as well as cellular adhesion. Here, we developed a computational software suite, Domain Analysis and Motif Matcher (DAMM), that analyzes peptide-binding cleft sequence identity as compared with human PDZ domains and that can be used in combination with literature searches of known human PDZ-interacting sequences to predict target specificity in choanoflagellate PDZ domains. We used this program, protein biochemistry, fluorescence polarization, and structural analyses to characterize the specificity of A9UPE9_MONBE, a M. brevicollis PDZ domain-containing protein with no homology to any metazoan protein, finding that its PDZ domain is most similar to those of the DLG family. We then identified two endogenous sequences that bind A9UPE9 PDZ with <100 μM affinity, a value commonly considered the threshold for cellular PDZ–peptide interactions. Taken together, this approach can be used to predict cellular targets of previously uncharacterized PDZ domains in choanoflagellates and other organisms. Our data contribute to investigations into choanoflagellate signaling and how it informs metazoan evolution. 
    more » « less
  2. The CRISPR-associated protein 9 (Cas9) has been engineered as a precise gene editing tool to make double-strand breaks. CRISPR-associated protein 9 binds the folded guide RNA (gRNA) that serves as a binding scaffold to guide it to the target DNA duplex via a RecA-like strand-displacement mechanism but without ATP binding or hydrolysis. The target search begins with the protospacer adjacent motif or PAM-interacting domain, recognizing it at the major groove of the duplex and melting its downstream duplex where an RNA-DNA heteroduplex is formed at nanomolar affinity. The rate-limiting step is the formation of an R-loop structure where the HNH domain inserts between the target heteroduplex and the displaced non-target DNA strand. Once the R-loop structure is formed, the non-target strand is rapidly cleaved by RuvC and ejected from the active site. This event is immediately followed by cleavage of the target DNA strand by the HNH domain and product release. Within CRISPR-associated protein 9, the HNH domain is inserted into the RuvC domain near the RuvC active site via two linker loops that provide allosteric communication between the two active sites. Due to the high flexibility of these loops and active sites, biophysical techniques have been instrumental in characterizing the dynamics and mechanism of the CRISPR-associated protein 9 nucleases, aiding structural studies in the visualization of the complete active sites and relevant linker structures. Here, we review biochemical, structural, and biophysical studies on the underlying mechanism with emphasis on how CRISPR-associated protein 9 selects the target DNA duplex and rejects non-target sequences. 
    more » « less
  3. The CRISPR-associated protein 9 (Cas9) has been engineered as a precise gene editing tool to make double-strand breaks. CRISPR-associated protein 9 binds the folded guide RNA (gRNA) that serves as a binding scaffold to guide it to the target DNA duplex via a RecA-like strand-displacement mechanism but without ATP binding or hydrolysis. The target search begins with the protospacer adjacent motif or PAM-interacting domain, recognizing it at the major groove of the duplex and melting its downstream duplex where an RNA-DNA heteroduplex is formed at nanomolar affinity. The rate-limiting step is the formation of an R-loop structure where the HNH domain inserts between the target heteroduplex and the displaced non-target DNA strand. Once the R-loop structure is formed, the non-target strand is rapidly cleaved by RuvC and ejected from the active site. This event is immediately followed by cleavage of the target DNA strand by the HNH domain and product release. Within CRISPR-associated protein 9, the HNH domain is inserted into the RuvC domain near the RuvC active site via two linker loops that provide allosteric communication between the two active sites. Due to the high flexibility of these loops and active sites, biophysical techniques have been instrumental in characterizing the dynamics and mechanism of the CRISPR-associated protein 9 nucleases, aiding structural studies in the visualization of the complete active sites and relevant linker structures. Here, we review biochemical, structural, and biophysical studies on the underlying mechanism with emphasis on how CRISPR-associated protein 9 selects the target DNA duplex and rejects non-target sequences. 
    more » « less
  4. Receptor tyrosine kinases (RTKs) mediate the actions of growth factors in metazoans. In decapod crustaceans, RTKs are implicated in various physiological processes, such molting and growth, limb regeneration, reproduction and sexual differentiation, and innate immunity. RTKs are organized into two main types: insulin receptors (InsRs) and growth factor receptors, which include epidermal growth factor receptor (EGFR), fibroblast growth factor receptor (FGFR), vascular endothelial growth factor receptor (VEGFR), and platelet-derived growth factor receptor (PDGFR). The identities of crustacean RTK genes are incomplete. A phylogenetic analysis of the CrusTome transcriptome database, which included all major crustacean taxa, showed that RTK sequences segregated into receptor clades representing InsR (72 sequences), EGFR (228 sequences), FGFR (129 sequences), and PDGFR/VEGFR (PVR; 235 sequences). These four receptor families were distinguished by the domain organization of the extracellular N-terminal region and motif sequences in the protein kinase catalytic domain in the C-terminus or the ligand-binding domain in the N-terminus. EGFR1 formed a single monophyletic group, while the other RTK sequences were divided into subclades, designated InsR1-3, FGFR1-3, and PVR1-2. In decapods, isoforms within the RTK subclades were common. InsRs were characterized by leucine-rich repeat, furin-like cysteine-rich, and fibronectin type 3 domains in the N-terminus. EGFRs had leucine-rich repeat, furin-like cysteine-rich, and growth factor IV domains. N-terminal regions of FGFR1 had one to three immunoglobulin-like domains, whereas FGFR2 had a cadherin tandem repeat domain. PVRs had between two and five immunoglobulin-like domains. A classification nomenclature of the four RTK classes, based on phylogenetic analysis and multiple sequence alignments, is proposed. 
    more » « less
  5. Kent, Angela D. (Ed.)
    ABSTRACT Methylmercury is a potent bioaccumulating neurotoxin that is produced by specific microorganisms that methylate inorganic mercury. Methylmercury production in diverse anaerobic bacteria and archaea was recently linked to the hgcAB genes. However, the full phylogenetic and metabolic diversity of mercury-methylating microorganisms has not been fully unraveled due to the limited number of cultured experimentally verified methylators and the limitations of primer-based molecular methods. Here, we describe the phylogenetic diversity and metabolic flexibility of putative mercury-methylating microorganisms by hgcAB identification in publicly available isolate genomes and metagenome-assembled genomes (MAGs) as well as novel freshwater MAGs. We demonstrate that putative mercury methylators are much more phylogenetically diverse than previously known and that hgcAB distribution among genomes is most likely due to several independent horizontal gene transfer events. The microorganisms we identified possess diverse metabolic capabilities spanning carbon fixation, sulfate reduction, nitrogen fixation, and metal resistance pathways. We identified 111 putative mercury methylators in a set of previously published permafrost metatranscriptomes and demonstrated that different methylating taxa may contribute to hgcA expression at different depths. Overall, we provide a framework for illuminating the microbial basis of mercury methylation using genome-resolved metagenomics and metatranscriptomics to identify putative methylators based upon hgcAB presence and describe their putative functions in the environment. IMPORTANCE Accurately assessing the production of bioaccumulative neurotoxic methylmercury by characterizing the phylogenetic diversity, metabolic functions, and activity of methylators in the environment is crucial for understanding constraints on the mercury cycle. Much of our understanding of methylmercury production is based on cultured anaerobic microorganisms within the Deltaproteobacteria , Firmicutes , and Euryarchaeota. Advances in next-generation sequencing technologies have enabled large-scale cultivation-independent surveys of diverse and poorly characterized microorganisms from numerous ecosystems. We used genome-resolved metagenomics and metatranscriptomics to highlight the vast phylogenetic and metabolic diversity of putative mercury methylators and their depth-discrete activities in thawing permafrost. This work underscores the importance of using genome-resolved metagenomics to survey specific putative methylating populations of a given mercury-impacted ecosystem. 
    more » « less