skip to main content


Title: Detection of Neanderthal Adaptively Introgressed Genetic Variants That Modulate Reporter Gene Expression in Human Immune Cells
Abstract Although some variation introgressed from Neanderthals has undergone selective sweeps, little is known about its functional significance. We used a Massively Parallel Reporter Assay (MPRA) to assay 5,353 high-frequency introgressed variants for their ability to modulate the gene expression within 170 bp of endogenous sequence. We identified 2,548 variants in active putative cis-regulatory elements (CREs) and 292 expression-modulating variants (emVars). These emVars are predicted to alter the binding motifs of important immune transcription factors, are enriched for associations with neutrophil and white blood cell count, and are associated with the expression of genes that function in innate immune pathways including inflammatory response and antiviral defense. We combined the MPRA data with other data sets to identify strong candidates to be driver variants of positive selection including an emVar that may contribute to protection against severe COVID-19 response. We endogenously deleted two CREs containing expression-modulation variants linked to immune function, rs11624425 and rs80317430, identifying their primary genic targets as ELMSAN1, and PAN2 and STAT2, respectively, three genes differentially expressed during influenza infection. Overall, we present the first database of experimentally identified expression-modulating Neanderthal-introgressed alleles contributing to potential immune response in modern humans.  more » « less
Award ID(s):
2020205
NSF-PAR ID:
10342838
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Editor(s):
Falush, Daniel
Date Published:
Journal Name:
Molecular Biology and Evolution
Volume:
39
Issue:
1
ISSN:
0737-4038
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. INTRODUCTION Genome-wide association studies (GWASs) have identified thousands of human genetic variants associated with diverse diseases and traits, and most of these variants map to noncoding loci with unknown target genes and function. Current approaches to understand which GWAS loci harbor causal variants and to map these noncoding regulators to target genes suffer from low throughput. With newer multiancestry GWASs from individuals of diverse ancestries, there is a pressing and growing need to scale experimental assays to connect GWAS variants with molecular mechanisms. Here, we combined biobank-scale GWASs, massively parallel CRISPR screens, and single-cell sequencing to discover target genes of noncoding variants for blood trait loci with systematic targeting and inhibition of noncoding GWAS loci with single-cell sequencing (STING-seq). RATIONALE Blood traits are highly polygenic, and GWASs have identified thousands of noncoding loci that map to candidate cis -regulatory elements (CREs). By combining CRE-silencing CRISPR perturbations and single-cell readouts, we targeted hundreds of GWAS loci in a single assay, revealing target genes in cis and in trans . For select CREs that regulate target genes, we performed direct variant insertion. Although silencing the CRE can identify the target gene, direct variant insertion can identify magnitude and direction of effect on gene expression for the GWAS variant. In select cases in which the target gene was a transcription factor or microRNA, we also investigated the gene-regulatory networks altered upon CRE perturbation and how these networks differ across blood cell types. RESULTS We inhibited candidate CREs from fine-mapped blood trait GWAS variants (from ~750,000 individual of diverse ancestries) in human erythroid progenitors. In total, we targeted 543 variants (254 loci) mapping to candidate CREs, generating multimodal single-cell data including transcriptome, direct CRISPR gRNA capture, and cell surface proteins. We identified target genes in cis (within 500 kb) for 134 CREs. In most cases, we found that the target gene was the closest gene and that specific enhancer-associated biochemical hallmarks (H3K27ac and accessible chromatin) are essential for CRE function. Using multiple perturbations at the same locus, we were able to distinguished between causal variants from noncausal variants in linkage disequilibrium. For a subset of validated CREs, we also inserted specific GWAS variants using base-editing STING-seq (beeSTING-seq) and quantified the effect size and direction of GWAS variants on gene expression. Given our transcriptome-wide data, we examined dosage effects in cis and trans in cases in which the cis target is a transcription factor or microRNA. We found that trans target genes are also enriched for GWAS loci, and identified gene clusters within trans gene networks with distinct biological functions and expression patterns in primary human blood cells. CONCLUSION In this work, we investigated noncoding GWAS variants at scale, identifying target genes in single cells. These methods can help to address the variant-to-function challenges that are a barrier for translation of GWAS findings (e.g., drug targets for diseases with a genetic basis) and greatly expand our ability to understand mechanisms underlying GWAS loci. Identifying causal variants and their target genes with STING-seq. Uncovering causal variants and their target genes or function are a major challenge for GWASs. STING-seq combines perturbation of noncoding loci with multimodal single-cell sequencing to profile hundreds of GWAS loci in parallel. This approach can identify target genes in cis and trans , measure dosage effects, and decipher gene-regulatory networks. 
    more » « less
  2. Individuals infected with the SARS-CoV-2 virus present with a wide variety of symptoms ranging from asymptomatic to severe and even lethal outcomes. Past research has revealed a genetic haplotype on chromosome 3 that entered the human population via introgression from Neanderthals as the strongest genetic risk factor for the severe response to COVID-19. However, the specific variants along this introgressed haplotype that contribute to this risk and the biological mechanisms that are involved remain unclear. Here, we assess the variants present on the risk haplotype for their likelihood of driving the genetic predisposition to severe COVID-19 outcomes. We do this by first exploring their impact on the regulation of genes involved in COVID-19 infection using a variety of population genetics and functional genomics tools. We then perform a locus-specific massively parallel reporter assay to individually assess the regulatory potential of each allele on the haplotype in a multipotent immune-related cell line. We ultimately reduce the set of over 600 linked genetic variants to identify four introgressed alleles that are strong functional candidates for driving the association between this locus and severe COVID-19. Using reporter assays in the presence/absence of SARS-CoV-2 , we find evidence that these variants respond to viral infection. These variants likely drive the locus’ impact on severity by modulating the regulation of two critical chemokine receptor genes: CCR1 and CCR5 . These alleles are ideal targets for future functional investigations into the interaction between host genomics and COVID-19 outcomes. 
    more » « less
  3. Abstract Background Pancreatic cancer is a complex disease with a desmoplastic stroma, extreme hypoxia, and inherent resistance to therapy. Understanding the signaling and adaptive response of such an aggressive cancer is key to making advances in therapeutic efficacy. Redox factor-1 (Ref-1), a redox signaling protein, regulates the conversion of several transcription factors (TFs), including HIF-1α, STAT3 and NFκB from an oxidized to reduced state leading to enhancement of their DNA binding. In our previously published work, knockdown of Ref-1 under normoxia resulted in altered gene expression patterns on pathways including EIF2, protein kinase A, and mTOR. In this study, single cell RNA sequencing (scRNA-seq) and proteomics were used to explore the effects of Ref-1 on metabolic pathways under hypoxia. Methods scRNA-seq comparing pancreatic cancer cells expressing less than 20% of the Ref-1 protein was analyzed using left truncated mixture Gaussian model and validated using proteomics and qRT-PCR. The identified Ref-1’s role in mitochondrial function was confirmed using mitochondrial function assays, qRT-PCR, western blotting and NADP assay. Further, the effect of Ref-1 redox function inhibition against pancreatic cancer metabolism was assayed using 3D co-culture in vitro and xenograft studies in vivo. Results Distinct transcriptional variation in central metabolism, cell cycle, apoptosis, immune response, and genes downstream of a series of signaling pathways and transcriptional regulatory factors were identified in Ref-1 knockdown vs Scrambled control from the scRNA-seq data. Mitochondrial DEG subsets downregulated with Ref-1 knockdown were significantly reduced following Ref-1 redox inhibition and more dramatically in combination with Devimistat in vitro. Mitochondrial function assays demonstrated that Ref-1 knockdown and Ref-1 redox signaling inhibition decreased utilization of TCA cycle substrates and slowed the growth of pancreatic cancer co-culture spheroids. In Ref-1 knockdown cells, a higher flux rate of NADP + consuming reactions was observed suggesting the less availability of NADP + and a higher level of oxidative stress in these cells. In vivo xenograft studies demonstrated that tumor reduction was potent with Ref-1 redox inhibitor similar to Devimistat. Conclusion Ref-1 redox signaling inhibition conclusively alters cancer cell metabolism by causing TCA cycle dysfunction while also reducing the pancreatic tumor growth in vitro as well as in vivo. 
    more » « less
  4. SUMMARY

    Transcriptional regulators of the general stress response (GSR) reprogram the expression of selected genes to transduce informational signals into cellular events, ultimately manifested in a plant's ability to cope with environmental challenges. Identification of the core GSR regulatory proteins will uncover the principal modules and their mode of action in the establishment of adaptive responses. To define the GSR regulatory components, we employed a yeast‐one‐hybrid assay to identify the protein(s) binding to the previously established functional GSR motif, termed the rapid stress response element (RSRE). This led to the isolation of octadecanoid‐responsive AP2/ERF‐domain transcription factor 47 (ORA47), a methyl jasmonate inducible protein. Subsequently, ORA47 transcriptional activity was confirmed using the RSRE‐driven luciferase (LUC) activity assay performed in the ORA47 loss‐ and gain‐of‐function lines introgressed into the 4xRSRE::Luc background. In addition, the prime contribution of CALMODULIN‐BINDING TRANSCRIPTIONAL ACTIVATOR3 (CAMTA3) protein in the induction of RSRE was reaffirmed by genetic studies. Moreover, exogenous application of methyl jasmonate led to enhanced levels ofORA47andCAMTA3transcripts, as well as the induction of RSRE::LUC activity. Metabolic analyses illustrated the reciprocal functional inputs of ORA47 and CAMTA3 in increasing JA levels. Lastly, transient assays identified JASMONATE ZIM‐domain1 (JAZ1) as a repressor of RSRE::LUC activity. Collectively, the present study provides fresh insight into the initial features of the mechanism that transduces informational signals into adaptive responses. This mechanism involves the functional interplay between the JA biosynthesis/signaling cascade and the transcriptional reprogramming that potentiates GSR. Furthermore, these findings offer a window into the role of intraorganellar communication in the establishment of adaptive responses.

     
    more » « less
  5. The genetic variants introduced into the ancestors of modern humans from interbreeding with Neanderthals have been suggested to contribute an unexpected extent to complex human traits. However, testing this hypothesis has been challenging due to the idiosyncratic population genetic properties of introgressed variants. We developed rigorous methods to assess the contribution of introgressed Neanderthal variants to heritable trait variation and applied these methods to analyze 235,592 introgressed Neanderthal variants and 96 distinct phenotypes measured in about 300,000 unrelated white British individuals in the UK Biobank. Introgressed Neanderthal variants make a significant contribution to trait variation (explaining 0.12% of trait variation on average). However, the contribution of introgressed variants tends to be significantly depleted relative to modern human variants matched for allele frequency and linkage disequilibrium (about 59% depletion on average), consistent with purifying selection on introgressed variants. Different from previous studies (McArthur et al., 2021), we find no evidence for elevated heritability across the phenotypes examined. We identified 348 independent significant associations of introgressed Neanderthal variants with 64 phenotypes. Previous work (Skov et al., 2020) has suggested that a majority of such associations are likely driven by statistical association with nearby modern human variants that are the true causal variants. Applying a customized fine-mapping led us to identify 112 regions across 47 phenotypes containing 4303 unique genetic variants where introgressed variants are highly likely to have a phenotypic effect. Examination of these variants reveals their substantial impact on genes that are important for the immune system, development, and metabolism. 
    more » « less