skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: TOWARDS INTERPRETING ZOONOTIC POTENTIAL OF BETACORONAVIRUS SEQUENCES WITH ATTENTION
Current methods for viral discovery target evolutionarily conserved proteins that accurately identify virus families but remain unable to distinguish the zoonotic potential of newly discovered viruses. Here, we apply an attention-enhanced longshort- term memory (LSTM) deep neural net classifier to a highly conserved viral protein target to predict zoonotic potential across betacoronaviruses. The classifier performs with a 94% accuracy. Analysis and visualization of attention at the sequence and structure-level features indicate possible association between important protein-protein interactions governing viral replication in zoonotic betacoronaviruses and zoonotic transmission.  more » « less
Award ID(s):
1717282
PAR ID:
10350220
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
ICLR
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Liu, Shan-Lu (Ed.)
    ABSTRACT Betacoronavirusesencode a conserved accessory gene within the +1 open reading frame (ORF) of nucleocapsid called the internal N gene. This gene is referred to as “I” for mouse hepatitis virus (MHV), ORF9b for severe acute respiratory CoV (SARS-CoV) and SARS-CoV-2, and ORF8b for Middle East respiratory syndrome CoV (MERS-CoV). Previous studies have shown ORF8b and ORF9b have immunoevasive properties, while the only known information for MHV I is its localization within the virion of the hepatotropic/neurotropic A59 strain of MHV. Whether MHV I is an innate immune antagonist or has other functions has not been evaluated. In this report, we show that the I protein of the neurotropic JHM strain of MHV (JHMV) lacks a N terminal domain present in other MHV strains, has immunoevasive properties, and is a component of the virion. Genetic deletion of JHMV I (rJHMVIΔ57-137) resulted in a highly attenuated virus bothin vitroandin vivothat displayed a post RNA replication/transcription defect that ultimately resulted in fewer infectious virions packaged compared with wild-type virus. This phenotype was only seen for rJHMVIΔ57-137, suggesting the structural changes predicted for A59 I altered its function, as genetic deletion of A59 I did not change viral replication or pathogenicity. Together, these data show that JHMV I both acts as a mild innate immune antagonist and aids in viral assembly and infectious virus production, and suggest that the internal N proteins from different betacoronaviruses have both common and virus strain-specific properties.IMPORTANCECoV accessory genes are largely studied in overexpression assays and have been identified as innate immune antagonists. However, functions identified after overexpression are often not confirmed in the infected animal host. Furthermore, some accessory proteins are components of the CoV virion, but their role in viral replication and release remains unclear. Here, we utilized reverse genetics to abrogate expression of a conserved CoV accessory gene, the internal N (“I”) gene, of the neurotropic JHMV strain of MHV and found that loss of the I gene resulted in a post replication defect that reduced virion assembly and ultimately infectious virus production, while also increasing some inflammatory molecule expression. Thus, the JHMV I protein has roles in virion assembly that were previously underappreciated and in immunoevasion. 
    more » « less
  2. Back and forth transmission of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) between humans and animals will establish wild reservoirs of virus that endanger long-term efforts to control COVID-19 in people and to protect vulnerable animal populations. Better targeting surveillance and laboratory experiments to validate zoonotic potential requires predicting high-risk host species. A major bottleneck to this effort is the few species with available sequences for angiotensin-converting enzyme 2 receptor, a key receptor required for viral cell entry. We overcome this bottleneck by combining species' ecological and biological traits with three-dimensional modelling of host-virus protein–protein interactions using machine learning. This approach enables predictions about the zoonotic capacity of SARS-CoV-2 for greater than 5000 mammals—an order of magnitude more species than previously possible. Our predictions are strongly corroborated by in vivo studies. The predicted zoonotic capacity and proximity to humans suggest enhanced transmission risk from several common mammals, and priority areas of geographic overlap between these species and global COVID-19 hotspots. With molecular data available for only a small fraction of potential animal hosts, linking data across biological scales offers a conceptual advance that may expand our predictive modelling capacity for zoonotic viruses with similarly unknown host ranges. 
    more » « less
  3. Abstract The ongoing COVID-19 pandemic highlights the necessity for a more fundamental understanding of the coronavirus life cycle. The causative agent of the disease, SARS-CoV-2, is being studied extensively from a structural standpoint in order to gain insight into key molecular mechanisms required for its survival. Contained within the untranslated regions of the SARS-CoV-2 genome are various conserved stem-loop elements that are believed to function in RNA replication, viral protein translation, and discontinuous transcription. While the majority of these regions are variable in sequence, a 41-nucleotide s2m element within the genome 3′ untranslated region is highly conserved among coronaviruses and three other viral families. In this study, we demonstrate that the SARS-CoV-2 s2m element dimerizes by forming an intermediate homodimeric kissing complex structure that is subsequently converted to a thermodynamically stable duplex conformation. This process is aided by the viral nucleocapsid protein, potentially indicating a role in mediating genome dimerization. Furthermore, we demonstrate that the s2m element interacts with multiple copies of host cellular microRNA (miRNA) 1307-3p. Taken together, our results highlight the potential significance of the dimer structures formed by the s2m element in key biological processes and implicate the motif as a possible therapeutic drug target for COVID-19 and other coronavirus-related diseases. 
    more » « less
  4. null (Ed.)
    The bovine immune system is known for its unusual traits relating to immunoglobulin and antiviral responses. Peptidylarginine deiminases (PADs) are phylogenetically conserved enzymes that cause post-translational deimination, contributing to protein moonlighting in health and disease. PADs also regulate extracellular vesicle (EV) release, forming a critical part of cellular communication. As PAD-mediated mechanisms in bovine immunology and physiology remain to be investigated, this study profiled deimination signatures in serum and serum-EVs in Bos taurus. Bos EVs were poly-dispersed in a 70–500 nm size range and showed differences in deiminated protein cargo, compared with whole sera. Key immune, metabolic and gene regulatory proteins were identified to be post-translationally deiminated with some overlapping hits in sera and EVs (e.g., immunoglobulins), while some were unique to either serum or serum-EVs (e.g., histones). Protein–protein interaction network analysis of deiminated proteins revealed KEGG pathways common for serum and serum-EVs, including complement and coagulation cascades, viral infection (enveloped viruses), viral myocarditis, bacterial and parasitic infections, autoimmune disease, immunodeficiency intestinal IgA production, B-cell receptor signalling, natural killer cell mediated cytotoxicity, platelet activation and hematopoiesis, alongside metabolic pathways including ferroptosis, vitamin digestion and absorption, cholesterol metabolism and mineral absorption. KEGG pathways specific to EVs related to HIF-1 signalling, oestrogen signalling and biosynthesis of amino acids. KEGG pathways specific for serum only, related to Epstein–Barr virus infection, transcription mis-regulation in cancer, bladder cancer, Rap1 signalling pathway, calcium signalling pathway and ECM-receptor interaction. This indicates differences in physiological and pathological pathways for deiminated proteins in serum-EVs, compared with serum. Our findings may shed light on pathways underlying a number of pathological and anti-pathogenic (viral, bacterial, parasitic) pathways, with putative translatable value to human pathologies, zoonotic diseases and development of therapies for infections, including anti-viral therapies. 
    more » « less
  5. The s2m, a highly conserved 41-nt hairpin structure in the SARS-CoV-2 genome, serves as an attractive therapeutic target that may have important roles in the virus life cycle or interactions with the host. However, the conserved s2m in Delta SARS-CoV-2, a previously dominant variant characterized by high infectivity and disease severity, has received relatively less attention than that of the original SARS-CoV-2 virus. The focus of this work is to identify and define the s2m changes between Delta and SARS-CoV-2 and the subsequent impact of those changes upon the s2m dimerization and interactions with the host microRNA miR-1307-3p. Bioinformatics analysis of the GISAID database targeting the s2m element reveals a >99% correlation of a single nucleotide mutation at the 15th position (G15U) in Delta SARS-CoV-2. Based on1H NMR spectroscopy assignments comparing the imino proton resonance region of s2m and the s2m G15U at 19°C, we show that the U15–A29 base pair closes, resulting in a stabilization of the upper stem without overall secondary structure deviation. Increased stability of the upper stem did not affect the chaperone activity of the viral N protein, as it was still able to convert the kissing dimers formed by s2m G15U into a stable duplex conformation, consistent with the s2m reference. However, we show that the s2m G15U mutation drastically impacts the binding of host miR-1307-3p. These findings demonstrate that the observed G15U mutation alters the secondary structure of s2m with subsequent impact on viral binding of host miR-1307-3p, with potential consequences on immune responses. 
    more » « less