skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: DORGE: Discovery of Oncogenes and tumoR suppressor genes using Genetic and Epigenetic features
Data-driven discovery of cancer driver genes, including tumor suppressor genes (TSGs) and oncogenes (OGs), is imperative for cancer prevention, diagnosis, and treatment. Although epigenetic alterations are important for tumor initiation and progression, most known driver genes were identified based on genetic alterations alone. Here, we developed an algorithm, DORGE (Discovery of Oncogenes and tumor suppressoR genes using Genetic and Epigenetic features), to identify TSGs and OGs by integrating comprehensive genetic and epigenetic data. DORGE identified histone modifications as strong predictors for TSGs, and it found missense mutations, super enhancers, and methylation differences as strong predictors for OGs. We extensively validated DORGE-predicted cancer driver genes using independent functional genomics data. We also found that DORGE-predicted dual-functional genes (both TSGs and OGs) are enriched at hubs in protein-protein interaction and drug-gene networks. Overall, our study has deepened the understanding of epigenetic mechanisms in tumorigenesis and revealed previously undetected cancer driver genes.  more » « less
Award ID(s):
1846216
PAR ID:
10219958
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Science Advances
Volume:
6
Issue:
46
ISSN:
2375-2548
Page Range / eLocation ID:
eaba6784
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Background: Colorectal cancer (CRC) is a term that refers to the combination of colon and rectal cancer as they are being treated as a single tumor. In CRC, 72% of tumors are colon cancer, while the other 28% represent rectal cancer. CRC is a multifactorial disease caused by both genetic and epigenetic changes in the colon mucosal cells, affecting the oncogenes, DNA repair genes, and tumor suppressor genes. Currently, two DNA methylation-based biomarkers for CRC have received FDA approval: SEPT9, used in blood-based screening tests, and a combination of NDRG4 and BMP3 for stool-based tests. Although DNA methylation biomarkers have been explored in colorectal cancer (CRC), the identification of robust and clinically valuable biomarkers remains a challenge, particularly for early-stage detection and precancerous lesions. Patients often receive diagnoses at the locally advanced stage, which limits the potential utility of current biomarkers in clinical settings. Methods: The datasets used in this study were retrieved from the GEO database, specifically GSE75548 and GSE75546 for rectal cancer and GSE50760 and GSE101764 for colon cancer, summing up to a total of 130 paired samples. These datasets represent expression profiling by array, methylation profiling by genome tiling array, and expression profiling by high-throughput sequencing and include rectal and colon cancer samples paired with adjacent normal tissue samples. Differential analysis was used to identify differentially methylated CPG sites (DMCs) and identify differentially expressed genes (DEGs). Results: From the integration of DMCs with DEGs in colorectal cancer, we identified 150 candidates for methylation-regulated genes (MRGs) with two genes common across all cohorts (GNG7 and PDX1) highlighted as candidate biomarkers in CRC. The functional enrichment analysis and protein–protein interactions (PPIs) identified relevant pathways involved in CRC, including the Wnt signaling pathway, extracellular matrix (ECM) organization, among other enriched pathways. Conclusions: Our findings show the strength of our in silco computational approach in jointly identifying methylation-regulated biomarkers for colon cancer and highlight several genes and pathways as biomarker candidates for further investigations. 
    more » « less
  2. Abstract The discovery of cancer driver mutations is a fundamental goal in cancer research. While many cancer driver mutations have been discovered in the protein-coding genome, research into potential cancer drivers in the non-coding regions showed limited success so far. Here, we present a novel comprehensive framework Dr.Nod for detection of non-coding cis-regulatory candidate driver mutations that are associated with dysregulated gene expression using tissue-matched enhancer-gene annotations. Applying the framework to data from over 1500 tumours across eight tissues revealed a 4.4-fold enrichment of candidate driver mutations in regulatory regions of known cancer driver genes. An overarching conclusion that emerges is that the non-coding driver mutations contribute to cancer by significantly altering transcription factor binding sites, leading to upregulation of tissue-matched oncogenes and down-regulation of tumour-suppressor genes. Interestingly, more than half of the detected cancer-promoting non-coding regulatory driver mutations are over 20 kb distant from the cancer-associated genes they regulate. Our results show the importance of tissue-matched enhancer-gene maps, functional impact of mutations, and complex background mutagenesis model for the prediction of non-coding regulatory drivers. In conclusion, our study demonstrates that non-coding mutations in enhancers play a previously underappreciated role in cancer and dysregulation of clinically relevant target genes. 
    more » « less
  3. Cancer is a group of diseases characterized by uncontrolled growth and spread of abnormal cells. The underlying cause of cancer relates to the cell cycle, during which DNA is replicated. Cancer cells accumulate DNA mutations that help them acquire cancerous features, such as evading cell death and indefinite growth [1]. If these DNA mutations are in coding regions, they are translated to mutated proteins. The epitopes that contain these mutations are called neoantigens. Neoantigens are highly tumor-specific and can be targeted with immunotherapies [2]. During cell division, tumor suppressor genes play a role in the case of DNA damage or replication errors. The p53 protein is a tumor suppressor gene product that prevents tumor formation by activating processes that block cell division when DNA damage has occurred [3]. Mutant p53 does not effectively bind DNA or activate the production of proteins necessary for the stop signal. This project explored a hypothesis that a set of distinct p53 protein mutations can be selected to serve as potential targets for cancer immunotherapy and vaccines by using immunoinformatics predictive analysis tools. By comparing these potential targets with experimental results, we can predict epitopes that may serve as neoantigen targets for immunotherapy. We identified candidate immunogenic epitopes using the NCI’s TP53 Database (NCI DB - tp53.isb-cgc.org), Cancer Epitope Database and Analysis Resource (CEDAR - cedar.iedb.org), and a powerful new bioinformatics tool (nextgen-tools.iedb.org/) [4] hosted by Immune Epitope Database (IEDB - iedb.org) and CEDAR.  Comparing predicted epitopes to highly mutable regions of p53 in tumor variants from NCI DB revealed areas of overlap that may be priority candidate epitopes for immunotherapy.  Experimental data from CEDAR tested the immunogenicity of normal and mutated protein versions to help avoid harmful cross-reactions. These results help predict cancer epitope amino acid sequences relevant to understanding the immune system's role in cancer progression, prevention, and treatment. These studies also set the stage for important subsequent undergraduate research projects to further characterize predicted cancer neoantigens. 
    more » « less
  4. The majority of lung cancer patients are diagnosed with metastatic disease. This study identified a set of 73 microRNAs (miRNAs) that classified lung cancer tumors from normal lung tissues with an overall accuracy of 96.3% in the training patient cohort (n = 109) and 91.7% in unsupervised classification and 92.3% in supervised classification in the validation set (n = 375). Based on association with patient survival (n = 1016), 10 miRNAs were identified as potential tumor suppressors (hsa-miR-144, hsa-miR-195, hsa-miR-223, hsa-miR-30a, hsa-miR-30b, hsa-miR-30d, hsa-miR-335, hsa-miR-363, hsa-miR-451, and hsa-miR-99a), and 4 were identified as potential oncogenes (hsa-miR-21, hsa-miR-31, hsa-miR-411, and hsa-miR-494) in lung cancer. Experimentally confirmed target genes were identified for the 73 diagnostic miRNAs, from which proliferation genes were selected from CRISPR-Cas9/RNA interference (RNAi) screening assays. Pansensitive and panresistant genes to 21 NCCN-recommended drugs with concordant mRNA and protein expression were identified. DGKE and WDR47 were found with significant associations with responses to both systemic therapies and radiotherapy in lung cancer. Based on our identified miRNA-regulated molecular machinery, an inhibitor of PDK1/Akt BX-912, an anthracycline antibiotic daunorubicin, and a multi-targeted protein kinase inhibitor midostaurin were discovered as potential repositioning drugs for treating lung cancer. These findings have implications for improving lung cancer diagnosis, optimizing treatment selection, and discovering new drug options for better patient outcomes. 
    more » « less
  5. The risk of developing cancer is correlated with body size and lifespan within species, but there is no correlation between cancer and either body size or lifespan between species indicating that large, long-lived species have evolved enhanced cancer protection mechanisms. Previously we showed that several large bodied Afrotherian lineages evolved reduced intrinsic cancer risk, particularly elephants and their extinct relatives ( Proboscideans ), coincident with pervasive duplication of tumor suppressor genes (Vazquez and Lynch, 2021). Unexpectedly, we also found that Xenarthrans (sloths, armadillos, and anteaters) evolved very low intrinsic cancer risk. Here, we show that: (1) several Xenarthran lineages independently evolved large bodies, long lifespans, and reduced intrinsic cancer risk; (2) the reduced cancer risk in the stem lineages of Xenarthra and Pilosa coincided with bursts of tumor suppressor gene duplications; (3) cells from sloths proliferate extremely slowly while Xenarthran cells induce apoptosis at very low doses of DNA damaging agents; and (4) the prevalence of cancer is extremely low Xenarthrans , and cancer is nearly absent from armadillos. These data implicate the duplication of tumor suppressor genes in the evolution of remarkably large body sizes and decreased cancer risk in Xenarthrans and suggest they are a remarkably cancer-resistant group of mammals. 
    more » « less