skip to main content


Title: Genetic and Structural Analysis of SARS-CoV-2 Spike Protein for Universal Epitope Selection
Abstract Evaluation of immunogenic epitopes for universal vaccine development in the face of ongoing SARS-CoV-2 evolution remains a challenge. Herein, we investigate the genetic and structural conservation of an immunogenically relevant epitope (C662–C671) of spike (S) protein across SARS-CoV-2 variants to determine its potential utility as a broad-spectrum vaccine candidate against coronavirus diseases. Comparative sequence analysis, structural assessment, and molecular dynamics simulations of C662–C671 epitope were performed. Mathematical tools were employed to determine its mutational cost. We found that the amino acid sequence of C662–C671 epitope is entirely conserved across the observed major variants of SARS-CoV-2 in addition to SARS-CoV. Its conformation and accessibility are predicted to be conserved, even in the highly mutated Omicron variant. Costly mutational rate in the context of energy expenditure in genome replication and translation can explain this strict conservation. These observations may herald an approach to developing vaccine candidates for universal protection against emergent variants of coronavirus.  more » « less
Award ID(s):
1915843 1832184 2019745
NSF-PAR ID:
10336928
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Editor(s):
Ozkan, Banu
Date Published:
Journal Name:
Molecular Biology and Evolution
Volume:
39
Issue:
5
ISSN:
0737-4038
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The COVID-19 pandemic caused by SARS-CoV-2 sparked intensive research into the development of effective vaccines, 50 of which have been approved thus far, including the novel mRNA-based vaccines developed by Pfizer and Moderna. Although limiting the severity of the disease, the mRNA-based vaccines presented drawbacks, such as the cold chain requirement. Moreover, antibody levels generated by these vaccines decline significantly after 6 months. These vaccines deliver mRNA encoding the full-length spike (S) glycoprotein of SARS-CoV-2, but must be updated as new strains and variants of concern emerge, creating a demand for adjusted formulations and booster campaigns. To overcome these challenges, we have developed COVID-19 vaccine candidates based on the highly conserved SARS CoV-2, 809-826 B-cell peptide epitope (denoted 826) conjugated to cowpea mosaic virus (CPMV) nanoparticles and bacteriophage Qβ virus-like particles, both platforms have exceptional thermal stability and facilitate epitope delivery with inbuilt adjuvant activity. We evaluated two administration methods: subcutaneous injection and an implantable polymeric scaffold. Mice received a prime–boost regimen of 100 μg per dose (2 weeks apart) or a single dose of 200 μg administered as a liquid formulation, or a polymer implant. Antibody titers were evaluated longitudinally over 50 weeks. The vaccine candidates generally elicited an early Th2-biased immune response, which stimulates the production of SARS-CoV-2 neutralizing antibodies, followed by a switch to a Th1-biased response for most formulations. Exceptionally, vaccine candidate 826-CPMV (administered as prime-boost, soluble injection) elicited a balanced Th1/Th2 immune response, which is necessary to prevent pulmonary immunopathology associated with Th2 bias extremes. While the Qβ-based vaccine elicited overall higher antibody titers, the CPMV-induced antibodies had higher avidity. Regardless of the administration route and formulation, our vaccine candidates maintained high antibody titers for more than 50 weeks, confirming a potent and durable immune response against SARS-CoV-2 even after a single dose. 
    more » « less
  2. The pandemic caused by the SARS-CoV-2 virus, the agent responsible for the COVID-19 disease, has affected millions of people worldwide. There is constant search for new therapies to either prevent or mitigate the disease. Fortunately, we have observed the successful development of multiple vaccines. Most of them are focused on one viral envelope protein, the spike protein. However, such focused approaches may contribute for the rise of new variants, fueled by the constant selection pressure on envelope proteins, and the widespread dispersion of coronaviruses in nature. Therefore, it is important to examine other proteins, preferentially those that are less susceptible to selection pressure, such as the nucleocapsid (N) protein. Even though the N protein is less accessible to humoral response, peptides from its conserved regions can be presented by class I Human Leukocyte Antigen (HLA) molecules, eliciting an immune response mediated by T-cells. Given the increased number of protein sequences deposited in biological databases daily and the N protein conservation among viral strains, computational methods can be leveraged to discover potential new targets for SARS-CoV-2 and SARS-CoV-related viruses. Here we developed SARS-Arena, a user-friendly computational pipeline that can be used by practitioners of different levels of expertise for novel vaccine development. SARS-Arena combines sequence-based methods and structure-based analyses to (i) perform multiple sequence alignment (MSA) of SARS-CoV-related N protein sequences, (ii) recover candidate peptides of different lengths from conserved protein regions, and (iii) model the 3D structure of the conserved peptides in the context of different HLAs. We present two main Jupyter Notebook workflows that can help in the identification of new T-cell targets against SARS-CoV viruses. In fact, in a cross-reactive case study, our workflows identified a conserved N protein peptide (SPRWYFYYL) recognized by CD8 + T-cells in the context of HLA-B7 + . SARS-Arena is available at https://github.com/KavrakiLab/SARS-Arena . 
    more » « less
  3. Abstract

    The COVID‐19 pandemic continues to be a severe threat to human health, especially due to current and emerging SARS‐CoV‐2 variants with potential to escape humoral immunity developed after vaccination or infection. The development of broadly neutralizing antibodies that engage evolutionarily conserved epitopes on coronavirus spike proteins represents a promising strategy to improve therapy and prophylaxis against SARS‐CoV‐2 and variants thereof. Herein, a facile multivalent engineering approach is employed to achieve large synergistic improvements in the neutralizing activity of a SARS‐CoV‐2 cross‐reactive nanobody (VHH‐72) initially generated against SARS‐CoV. This synergy is epitope specific and is not observed for a second high‐affinity nanobody against a non‐conserved epitope in the receptor‐binding domain. Importantly, a hexavalent VHH‐72 nanobody retains binding to spike proteins from multiple highly transmissible SARS‐CoV‐2 variants (B.1.1.7 and B.1.351) and potently neutralizes them. Multivalent VHH‐72 nanobodies also display drug‐like biophysical properties, including high stability, high solubility, and low levels of non‐specific binding. The unique neutralizing and biophysical properties of VHH‐72 multivalent nanobodies make them attractive as therapeutics against SARS‐CoV‐2 variants.

     
    more » « less
  4. null (Ed.)
    The novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of COVID-19. The main receptor of SARS-CoV-2, angiotensin I converting enzyme 2 (ACE2), is now undergoing extensive scrutiny to understand the routes of transmission and sensitivity in different species. Here, we utilized a unique dataset of ACE2 sequences from 410 vertebrate species, including 252 mammals, to study the conservation of ACE2 and its potential to be used as a receptor by SARS-CoV-2. We designed a five-category binding score based on the conservation properties of 25 amino acids important for the binding between ACE2 and the SARS-CoV-2 spike protein. Only mammals fell into the medium to very high categories and only catarrhine primates into the very high category, suggesting that they are at high risk for SARS-CoV-2 infection. We employed a protein structural analysis to qualitatively assess whether amino acid changes at variable residues would be likely to disrupt ACE2/SARS-CoV-2 spike protein binding and found the number of predicted unfavorable changes significantly correlated with the binding score. Extending this analysis to human population data, we found only rare (frequency <0.001) variants in 10/25 binding sites. In addition, we found significant signals of selection and accelerated evolution in the ACE2 coding sequence across all mammals, and specific to the bat lineage. Our results, if confirmed by additional experimental data, may lead to the identification of intermediate host species for SARS-CoV-2, guide the selection of animal models of COVID-19, and assist the conservation of animals both in native habitats and in human care. 
    more » « less
  5. Abstract

    The glycosylation on the spike (S) protein of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes COVID-19, modulates the viral infection by altering conformational dynamics, receptor interaction and host immune responses. Several variants of concern (VOCs) of SARS-CoV-2 have evolved during the pandemic, and crucial mutations on the S protein of the virus have led to increased transmissibility and immune escape. In this study, we compare the site-specific glycosylation and overall glycomic profiles of the wild type Wuhan-Hu-1 strain (WT) S protein and five VOCs of SARS-CoV-2: Alpha, Beta, Gamma, Delta and Omicron. Interestingly, both N- and O-glycosylation sites on the S protein are highly conserved among the spike mutant variants, particularly at the sites on the receptor-binding domain (RBD). The conservation of glycosylation sites is noteworthy, as over 2 million SARS-CoV-2 S protein sequences have been reported with various amino acid mutations. Our detailed profiling of the glycosylation at each of the individual sites of the S protein across the variants revealed intriguing possible association of glycosylation pattern on the variants and their previously reported infectivity. While the sites are conserved, we observed changes in the N- and O-glycosylation profile across the variants. The newly emerged variants, which showed higher resistance to neutralizing antibodies and vaccines, displayed a decrease in the overall abundance of complex-type glycans with both fucosylation and sialylation and an increase in the oligomannose-type glycans across the sites. Among the variants, the glycosylation sites with significant changes in glycan profile were observed at both theN-terminal domain and RBD of S protein, with Omicron showing the highest deviation. The increase in oligomannose-type happens sequentially from Alpha through Delta. Interestingly, Omicron does not contain more oligomannose-type glycans compared to Delta but does contain more compared to the WT and other VOCs. O-glycosylation at the RBD showed lower occupancy in the VOCs in comparison to the WT. Our study on the sites and pattern of glycosylation on the SARS-CoV-2 S proteins across the VOCs may help to understand how the virus evolved to trick the host immune system. Our study also highlights how the SARS-CoV-2 virus has conserved bothN- andO- glycosylation sites on the S protein of the most successful variants even after undergoing extensive mutations, suggesting a correlation between infectivity/ transmissibility and glycosylation.

     
    more » « less