skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Predicting the zoonotic capacity of mammals to transmit SARS-CoV-2
Back and forth transmission of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) between humans and animals will establish wild reservoirs of virus that endanger long-term efforts to control COVID-19 in people and to protect vulnerable animal populations. Better targeting surveillance and laboratory experiments to validate zoonotic potential requires predicting high-risk host species. A major bottleneck to this effort is the few species with available sequences for angiotensin-converting enzyme 2 receptor, a key receptor required for viral cell entry. We overcome this bottleneck by combining species' ecological and biological traits with three-dimensional modelling of host-virus protein–protein interactions using machine learning. This approach enables predictions about the zoonotic capacity of SARS-CoV-2 for greater than 5000 mammals—an order of magnitude more species than previously possible. Our predictions are strongly corroborated by in vivo studies. The predicted zoonotic capacity and proximity to humans suggest enhanced transmission risk from several common mammals, and priority areas of geographic overlap between these species and global COVID-19 hotspots. With molecular data available for only a small fraction of potential animal hosts, linking data across biological scales offers a conceptual advance that may expand our predictive modelling capacity for zoonotic viruses with similarly unknown host ranges.  more » « less
Award ID(s):
1717282 1947040
PAR ID:
10347548
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Proceedings of the Royal Society B: Biological Sciences
Volume:
288
Issue:
1963
ISSN:
0962-8452
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    The novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of COVID-19. The main receptor of SARS-CoV-2, angiotensin I converting enzyme 2 (ACE2), is now undergoing extensive scrutiny to understand the routes of transmission and sensitivity in different species. Here, we utilized a unique dataset of ACE2 sequences from 410 vertebrate species, including 252 mammals, to study the conservation of ACE2 and its potential to be used as a receptor by SARS-CoV-2. We designed a five-category binding score based on the conservation properties of 25 amino acids important for the binding between ACE2 and the SARS-CoV-2 spike protein. Only mammals fell into the medium to very high categories and only catarrhine primates into the very high category, suggesting that they are at high risk for SARS-CoV-2 infection. We employed a protein structural analysis to qualitatively assess whether amino acid changes at variable residues would be likely to disrupt ACE2/SARS-CoV-2 spike protein binding and found the number of predicted unfavorable changes significantly correlated with the binding score. Extending this analysis to human population data, we found only rare (frequency <0.001) variants in 10/25 binding sites. In addition, we found significant signals of selection and accelerated evolution in the ACE2 coding sequence across all mammals, and specific to the bat lineage. Our results, if confirmed by additional experimental data, may lead to the identification of intermediate host species for SARS-CoV-2, guide the selection of animal models of COVID-19, and assist the conservation of animals both in native habitats and in human care. 
    more » « less
  2. Tully, Damien (Ed.)
    Virus host shifts are generally associated with novel adaptations to exploit the cells of the new host species optimally. Surprisingly, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has apparently required little to no significant adaptation to humans since the start of the Coronavirus Disease 2019 (COVID-19) pandemic and to October 2020. Here we assess the types of natural selection taking place in Sarbecoviruses in horseshoe bats versus the early SARS-CoV-2 evolution in humans. While there is moderate evidence of diversifying positive selection in SARS-CoV-2 in humans, it is limited to the early phase of the pandemic, and purifying selection is much weaker in SARS-CoV-2 than in related bat Sarbecoviruses . In contrast, our analysis detects evidence for significant positive episodic diversifying selection acting at the base of the bat virus lineage SARS-CoV-2 emerged from, accompanied by an adaptive depletion in CpG composition presumed to be linked to the action of antiviral mechanisms in these ancestral bat hosts. The closest bat virus to SARS-CoV-2, RmYN02 (sharing an ancestor about 1976), is a recombinant with a structure that includes differential CpG content in Spike; clear evidence of coinfection and evolution in bats without involvement of other species. While an undiscovered “facilitating” intermediate species cannot be discounted, collectively, our results support the progenitor of SARS-CoV-2 being capable of efficient human–human transmission as a consequence of its adaptive evolutionary history in bats, not humans, which created a relatively generalist virus. 
    more » « less
  3. Pandemics originating from non-human animals highlight the need to understand how natural hosts have evolved in response to emerging human pathogens and which groups may be susceptible to infection and/or potential reservoirs to mitigate public health and conservation concerns. Multiple zoonotic coronaviruses, such as severe acute respiratory syndrome-associated coronavirus (SARS-CoV), SARS-CoV-2 and Middle Eastern respiratory syndrome-associated coronavirus (MERS-CoV), are hypothesized to have evolved in bats. We investigate angiotensin-converting enzyme 2 (ACE2), the host protein bound by SARS-CoV and SARS-CoV-2, and dipeptidyl-peptidase 4 (DPP4 or CD26), the host protein bound by MERS-CoV, in the largest bat datasets to date. Both the ACE2 and DPP4 genes are under strong selection pressure in bats, more so than in other mammals, and in residues that contact viruses. Additionally, mammalian groups vary in their similarity to humans in residues that contact SARS-CoV, SARS-CoV-2 and MERS-CoV, and increased similarity to humans in binding residues is broadly predictive of susceptibility to SARS-CoV-2. This work augments our understanding of the relationship between coronaviruses and mammals, particularly bats, provides taxonomically diverse data for studies of how host proteins are bound by coronaviruses and can inform surveillance, conservation and public health efforts. 
    more » « less
  4. Abstract SARS-CoV-2 receptor binding domains (RBDs) interact with both the ACE2 receptor and heparan sulfate on the surface of host cells to enhance SARS-CoV-2 infection. We show that suramin, a polysulfated synthetic drug, binds to the ACE2 receptor and heparan sulfate binding sites on the RBDs of wild-type, Delta, and Omicron variants. Specifically, heparan sulfate and suramin had enhanced preferential binding for Omicron RBD, and suramin is most potent against the live SARS-CoV-2 Omicron variant (B.1.1.529) when compared to wild type and Delta (B.1.617.2) variants in vitro. These results suggest that inhibition of live virus infection occurs through dual SARS-CoV-2 targets of S-protein binding and previously reported RNA-dependent RNA polymerase inhibition and offers the possibility for this and other polysulfated molecules to be used as potential therapeutic and prophylactic options against COVID-19. 
    more » « less
  5. Abstract Establishing the host range for novel viruses remains a challenge. Here, we address the challenge of identifying non-human animal coronaviruses that may infect humans by creating an artificial neural network model that learns from spike protein sequences of alpha and beta coronaviruses and their binding annotation to their host receptor. The proposed method produces a human-Binding Potential (h-BiP) score that distinguishes, with high accuracy, the binding potential among coronaviruses. Three viruses, previously unknown to bind human receptors, were identified: Bat coronavirus BtCoV/133/2005 and Pipistrellus abramus bat coronavirus HKU5-related (both MERS related viruses), andRhinolophus affiniscoronavirus isolate LYRa3 (a SARS related virus). We further analyze the binding properties of BtCoV/133/2005 and LYRa3 using molecular dynamics. To test whether this model can be used for surveillance of novel coronaviruses, we re-trained the model on a set that excludes SARS-CoV-2 and all viral sequences released after the SARS-CoV-2 was published. The results predict the binding of SARS-CoV-2 with a human receptor, indicating that machine learning methods are an excellent tool for the prediction of host expansion events. 
    more » « less