skip to main content


Title: Charge-based interactions through peptide position 4 drive diversity of antigen presentation by human leukocyte antigen class I molecules
Abstract

Human leukocyte antigen class I (HLA-I) molecules bind and present peptides at the cell surface to facilitate the induction of appropriate CD8+ T cell-mediated immune responses to pathogen- and self-derived proteins. The HLA-I peptide-binding cleft contains dominant anchor sites in the B and F pockets that interact primarily with amino acids at peptide position 2 and the C-terminus, respectively. Nonpocket peptide–HLA interactions also contribute to peptide binding and stability, but these secondary interactions are thought to be unique to individual HLA allotypes or to specific peptide antigens. Here, we show that two positively charged residues located near the top of peptide-binding cleft facilitate interactions with negatively charged residues at position 4 of presented peptides, which occur at elevated frequencies across most HLA-I allotypes. Loss of these interactions was shown to impair HLA-I/peptide binding and complex stability, as demonstrated by both in vitro and in silico experiments. Furthermore, mutation of these Arginine-65 (R65) and/or Lysine-66 (K66) residues in HLA-A*02:01 and A*24:02 significantly reduced HLA-I cell surface expression while also reducing the diversity of the presented peptide repertoire by up to 5-fold. The impact of the R65 mutation demonstrates that nonpocket HLA-I/peptide interactions can constitute anchor motifs that exert an unexpectedly broad influence on HLA-I-mediated antigen presentation. These findings provide fundamental insights into peptide antigen binding that could broadly inform epitope discovery in the context of viral vaccine development and cancer immunotherapy.

 
more » « less
NSF-PAR ID:
10369787
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; ; ; ; « less
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
PNAS Nexus
Volume:
1
Issue:
3
ISSN:
2752-6542
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Peptide binding to major histocompatibility complexes (MHCs) is a central component of the immune system, and understanding the mechanism behind stable peptide–MHC binding will aid the development of immunotherapies. While MHC binding is mostly influenced by the identity of the so-called anchor positions of the peptide, secondary interactions from nonanchor positions are known to play a role in complex stability. However, current MHC-binding prediction methods lack an analysis of the major conformational states and might underestimate the impact of secondary interactions. In this work, we present an atomically detailed analysis of peptide–MHC binding that can reveal the contributions of any interaction toward stability. We propose a simulation framework that uses both umbrella sampling and adaptive sampling to generate a Markov state model (MSM) for a coronavirus-derived peptide (QFKDNVILL), bound to one of the most prevalent MHC receptors in humans (HLA-A24:02). While our model reaffirms the importance of the anchor positions of the peptide in establishing stable interactions, our model also reveals the underestimated importance of position 4 (p4), a nonanchor position. We confirmed our results by simulating the impact of specific peptide mutations and validated these predictions through competitive binding assays. By comparing the MSM of the wild-type system with those of the D4A and D4P mutations, our modeling reveals stark differences in unbinding pathways. The analysis presented here can be applied to any peptide–MHC complex of interest with a structural model as input, representing an important step toward comprehensive modeling of the MHC class I pathway.

     
    more » « less
  2. Bias in neural network model training datasets has been observed to decrease prediction accuracy for groups underrepresented in training data. Thus, investigating the composition of training datasets used in machine learning models with healthcare applications is vital to ensure equity. Two such machine learning models are NetMHCpan-4.1 and NetMHCIIpan-4.0, used to predict antigen binding scores to major histocompatibility complex class I and II molecules, respectively. As antigen presentation is a critical step in mounting the adaptive immune response, previous work has used these or similar predictions models in a broad array of applications, from explaining asymptomatic viral infection to cancer neoantigen prediction. However, these models have also been shown to be biased toward hydrophobic peptides, suggesting the network could also contain other sources of bias. Here, we report the composition of the networks’ training datasets are heavily biased toward European Caucasian individuals and against Asian and Pacific Islander individuals. We test the ability of NetMHCpan-4.1 and NetMHCpan-4.0 to distinguish true binders from randomly generated peptides on alleles not included in the training datasets. Unexpectedly, we fail to find evidence that the disparities in training data lead to a meaningful difference in prediction quality for alleles not present in the training data. We attempt to explain this result by mapping the HLA sequence space to determine the sequence diversity of the training dataset. Furthermore, we link the residues which have the greatest impact on NetMHCpan predictions to structural features for three alleles (HLA-A*34:01, HLA-C*04:03, HLA-DRB1*12:02).

     
    more » « less
  3. The pandemic caused by the SARS-CoV-2 virus, the agent responsible for the COVID-19 disease, has affected millions of people worldwide. There is constant search for new therapies to either prevent or mitigate the disease. Fortunately, we have observed the successful development of multiple vaccines. Most of them are focused on one viral envelope protein, the spike protein. However, such focused approaches may contribute for the rise of new variants, fueled by the constant selection pressure on envelope proteins, and the widespread dispersion of coronaviruses in nature. Therefore, it is important to examine other proteins, preferentially those that are less susceptible to selection pressure, such as the nucleocapsid (N) protein. Even though the N protein is less accessible to humoral response, peptides from its conserved regions can be presented by class I Human Leukocyte Antigen (HLA) molecules, eliciting an immune response mediated by T-cells. Given the increased number of protein sequences deposited in biological databases daily and the N protein conservation among viral strains, computational methods can be leveraged to discover potential new targets for SARS-CoV-2 and SARS-CoV-related viruses. Here we developed SARS-Arena, a user-friendly computational pipeline that can be used by practitioners of different levels of expertise for novel vaccine development. SARS-Arena combines sequence-based methods and structure-based analyses to (i) perform multiple sequence alignment (MSA) of SARS-CoV-related N protein sequences, (ii) recover candidate peptides of different lengths from conserved protein regions, and (iii) model the 3D structure of the conserved peptides in the context of different HLAs. We present two main Jupyter Notebook workflows that can help in the identification of new T-cell targets against SARS-CoV viruses. In fact, in a cross-reactive case study, our workflows identified a conserved N protein peptide (SPRWYFYYL) recognized by CD8 + T-cells in the context of HLA-B7 + . SARS-Arena is available at https://github.com/KavrakiLab/SARS-Arena . 
    more » « less
  4. Abstract

    SH2B1 is a multidomain protein that serves as a key adaptor to regulate numerous cellular events, such as insulin, leptin, and growth hormone signaling pathways. Many of these protein‐protein interactions are mediated by the SH2 domain of SH2B1, which recognizes ligands containing a phosphorylated tyrosine (pY), including peptides derived from janus kinase 2, insulin receptor, and insulin receptor substrate‐1 and −2. Specificity for the SH2 domain of SH2B1 is conferred in these ligands either by a hydrophobic or an acidic side chain at the +3 position C‐terminal to the pY. This specificity for chemically disparate species suggests that SH2B1 relies on distinct thermodynamic or structural mechanisms to bind to peptides. Using binding and structural strategies, we have identified unique thermodynamic signatures for each peptide binding mode, and several SH2B1 residues, including K575 and R578, that play distinct roles in peptide binding. The high‐resolution structure of the SH2 domain of SH2B1 further reveals conformationally plastic protein loops that may contribute to the ability of the protein to recognize dissimilar ligands. Together, numerous hydrophobic and electrostatic interactions, in addition to backbone conformational flexibility, permit the recognition of diverse peptides by SH2B1. An understanding of this expanded peptide recognition will allow for the identification of novel physiologically relevant SH2B1/peptide interactions, which can contribute to the design of obesity and diabetes pharmaceuticals to target the ligand‐binding interface of SH2B1 with high specificity.

     
    more » « less
  5. Abstract

    Human herpes virus 6B (HHV‐6B) is a widespread virus that infects most people early in infancy and establishes a chronic life‐long infection with periodic reactivation. CD4 T cells have been implicated in control of HHV‐6B, but antigenic targets and functional characteristics of the CD4 T‐cell response are poorly understood. We identified 25 naturally processed MHC‐II peptides, derived from six different HHV‐6B proteins, and showed that they were recognized by CD4 T‐cell responses in HLA‐matched donors. The peptides were identified by mass spectrometry after elution from HLA‐DR molecules isolated from HHV‐6B‐infected T cells. The peptides showed strong binding to matched HLA alleles and elicited recall T‐cell responses in vitro. T‐cell lines expanded in vitro were used for functional characterization of the response. Responding cells were mainly CD3+CD4+, produced IFN‐γ, TNF‐α, and low levels of IL‐2, alone or in combination, highlighting the presence of polyfunctional T cells in the overall response. Many of the responding cells mobilized CD107a, stored granzyme B, and mediated specific killing of peptide‐pulsed target cells. These results highlight a potential role for polyfunctional cytotoxic CD4 T cells in the long‐term control of HHV‐6B infection.

     
    more » « less