skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Allele-specific activation, enzyme kinetics, and inhibitor sensitivities of EGFR exon 19 deletion mutations in lung cancer
Oncogenic mutations within the epidermal growth factor receptor (EGFR) are found in 15 to 30% of all non–small-cell lung carcinomas. The term exon 19 deletion (ex19del) is collectively used to refer to more than 20 distinct genomic alterations within exon 19 that comprise the most common EGFR mutation subtype in lung cancer. Despite this heterogeneity, clinical treatment decisions are made irrespective of which EGFR ex19del variant is present within the tumor, and there is a paucity of information regarding how individual ex19del variants influence protein structure and function. Herein, we identified allele-specific functional differences among ex19del variants attributable to recurring sequence and structure motifs. We built all-atom structural models of 60 ex19del variants identified in patients and combined molecular dynamics simulations with biochemical and biophysical experiments to analyze three ex19del mutations (E746_A750, E746_S752 > V, and L747_A750 > P). We demonstrate that sequence variation in ex19del alters oncogenic cell growth, dimerization propensity, enzyme kinetics, and tyrosine kinase inhibitor (TKI) sensitivity. We show that in contrast to E746_A750 and E746_S752 > V, the L747_A750 > P variant forms highly active ligand-independent dimers. Enzyme kinetic analysis and TKI inhibition experiments suggest that E746_S752 > V and L747_A750 > P display reduced TKI sensitivity due to decreased adenosine 5′-triphosphate K m . Through these analyses, we propose an expanded framework for interpreting ex19del variants and considerations for therapeutic intervention.  more » « less
Award ID(s):
1753060 2308307
PAR ID:
10421702
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
119
Issue:
30
ISSN:
0027-8424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Mechanistic understanding of oncogenic variants facilitates the development and optimization of treatment strategies. We recently identified in-frame, tandem duplication ofEGFRexons 18 - 25, which causes EGFR Kinase Domain Duplication (EGFR-KDD). Here, we characterize the prevalence ofERBBfamily KDDs across multiple human cancers and evaluate the functional biochemistry of EGFR-KDD as it relates to pathogenesis and potential therapeutic intervention. We provide computational and experimental evidence that EGFR-KDD functions by forming asymmetric EGF-independent intra-molecular and EGF-dependent inter-molecular dimers. Time-resolved fluorescence microscopy and co-immunoprecipitation reveals EGFR-KDD can form ligand-dependent inter-molecular homo- and hetero-dimers/multimers. Furthermore, we show that inhibition of EGFR-KDD activity is maximally achieved by blocking both intra- and inter-molecular dimerization. Collectively, our findings define a previously unrecognized model of EGFR dimerization, providing important insights for the understanding of EGFR activation mechanisms and informing personalized treatment of patients with tumors harboring EGFR-KDD. Finally, we establishERBBKDDs as recurrent oncogenic events in multiple cancers. 
    more » « less
  2. Numerous studies have shown genetic variation at the LCORL-NCAPG locus is strongly associated with growth traits in beef cattle. However, a causative molecular variant has yet to be identified. To define all possible candidate variants, 34 Charolais-sired calves were whole-genome sequenced, including 17 homozygous for a long-range haplotype associated with increased growth (QQ) and 17 homozygous for potential ancestral haplotypes for this region (qq). The Q haplotype was refined to an 814 kb region between chr6:37,199,897–38,014,080 and contained 218 variants not found in qq individuals. These variants include an insertion in an intron of NCAPG, a previously documented mutation in NCAPG (rs109570900), two coding sequence mutations in LCORL (rs109696064 and rs384548488), and 15 variants located within ATAC peaks that were predicted to affect transcription factor binding. Notably, rs384548488 is a frameshift variant likely resulting in loss of function for long isoforms of LCORL. To test the association of the coding sequence variants of LCORL with phenotype, 405 cattle from five populations were genotyped. The two variants were in complete linkage disequilibrium. Statistical analysis of the three populations that contained QQ animals revealed significant (p < 0.05) associations with genotype and birth weight, live weight, carcass weight, hip height, and average daily gain. These findings affirm the link between this locus and growth in beef cattle and describe DNA variants that define the haplotype. However, further studies will be required to define the true causative mutation. 
    more » « less
  3. Abstract Background3′-end processing by cleavage and polyadenylation is an important and finely tuned regulatory process during mRNA maturation. Numerous genetic variants are known to cause or contribute to human disorders by disrupting the cis-regulatory code of polyadenylation signals. Yet, due to the complexity of this code, variant interpretation remains challenging. ResultsWe introduce a residual neural network model,APARENT2, that can infer 3′-cleavage and polyadenylation from DNA sequence more accurately than any previous model. This model generalizes to the case of alternative polyadenylation (APA) for a variable number of polyadenylation signals. We demonstrate APARENT2’s performance on several variant datasets, including functional reporter data and human 3′ aQTLs from GTEx. We apply neural network interpretation methods to gain insights into disrupted or protective higher-order features of polyadenylation. We fine-tune APARENT2 on human tissue-resolved transcriptomic data to elucidate tissue-specific variant effects. By combining APARENT2 with models of mRNA stability, we extend aQTL effect size predictions to the entire 3′ untranslated region. Finally, we perform in silico saturation mutagenesis of all human polyadenylation signals and compare the predicted effects of$${>}43$$ > 43 million variants against gnomAD. While loss-of-function variants were generally selected against, we also find specific clinical conditions linked to gain-of-function mutations. For example, we detect an association between gain-of-function mutations in the 3′-end and autism spectrum disorder. To experimentally validate APARENT2’s predictions, we assayed clinically relevant variants in multiple cell lines, including microglia-derived cells. ConclusionsA sequence-to-function model based on deep residual learning enables accurate functional interpretation of genetic variants in polyadenylation signals and, when coupled with large human variation databases, elucidates the link between functional 3′-end mutations and human health. 
    more » « less
  4. While worldwide efforts for improving COVID-19 vaccines are currently considered a top priority, the role of the genetic variants responsible for virus receptor protein stability is less studied. Angiotensin-converting enzyme-2 is the primary target of the SARS-CoV-1/SARS-CoV-2 spike (S) glycoprotein, enabling entry into the human body. Here, we applied computational saturation mutagenesis approaches to determine the folding energy caused by all possible mutations in ACE2 proteins within ACE2 - SARS-CoV-1-S/ACE2 - SARS-CoV-2-S complexes. We observed ACE2 mutations at residue D350 causing the most stabilizing effects on the protein. In addition, we identified ACE2 genetic variations in African Americans (rs73635825, rs766996587, and rs780574871), Latino Americans (rs924799658), and both groups (rs4646116 and rs138390800) affecting stability in the ACE2 - SARS-CoV-2-S complex. The findings in this study may aid in targeting the design of stable neutralizing peptides for treating minority patients. 
    more » « less
  5. Elkins, Christopher A. (Ed.)
    ABSTRACT Monitoring the prevalence of SARS-CoV-2 variants is necessary to make informed public health decisions during the COVID-19 pandemic. PCR assays have received global attention, facilitating a rapid understanding of variant dynamics because they are more accessible and scalable than genome sequencing. However, as PCR assays target only a few mutations, their accuracy could be reduced when these mutations are not exclusive to the target variants. Here we introduce PRIMES, an algorithm that evaluates the sensitivity and specificity of SARS-CoV-2 variant-specific PCR assays across different geographical regions by incorporating sequences deposited in the GISAID database. Using PRIMES, we determined that the accuracy of several PCR assays decreased when applied beyond the geographic scope of the study in which the assays were developed. Subsequently, we used this tool to design Alpha and Delta variant-specific PCR assays for samples from Illinois, USA. In silico analysis using PRIMES determined the sensitivity/specificity to be 0.99/0.99 for the Alpha variant-specific PCR assay and 0.98/1.00 for the Delta variant-specific PCR assay in Illinois, respectively. We applied these two variant-specific PCR assays to six local sewage samples and determined the dominant SARS-CoV-2 variant of either the wild type, the Alpha variant, or the Delta variant. Using next-generation sequencing (NGS) of the spike (S) gene amplicons of the Delta variant-dominant samples, we found six mutations exclusive to the Delta variant (S:T19R, S:Δ156/157, S:L452R, S:T478K, S:P681R, and S:D950N). The consistency between the variant-specific PCR assays and the NGS results supports the applicability of PRIMES. IMPORTANCE Monitoring the introduction and prevalence of variants of concern (VOCs) and variants of interest (VOIs) in a community can help the local authorities make informed public health decisions. PCR assays can be designed to keep track of SARS-CoV-2 variants by measuring unique mutation markers that are exclusive to the target variants. However, the mutation markers may not be exclusive to the target variants because of regional and temporal differences in variant dynamics. We introduce PRIMES, an algorithm that enables the design of reliable PCR assays for variant detection. Because PCR is more accessible, scalable, and robust for sewage samples than sequencing technology, our findings will contribute to improving global SARS-CoV-2 variant surveillance. 
    more » « less