skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Friday, November 14 until 2:00 AM ET on Saturday, November 15 due to maintenance. We apologize for the inconvenience.


Title: Accurate integration of single-cell DNA and RNA for analyzing intratumor heterogeneity using MaCroDNA
Abstract Cancers develop and progress as mutations accumulate, and with the advent of single-cell DNA and RNA sequencing, researchers can observe these mutations and their transcriptomic effects and predict proteomic changes with remarkable temporal and spatial precision. However, to connect genomic mutations with their transcriptomic and proteomic consequences, cells with either only DNA data or only RNA data must be mapped to a common domain. For this purpose, we present MaCroDNA, a method that uses maximum weighted bipartite matching of per-gene read counts from single-cell DNA and RNA-seq data. Using ground truth information from colorectal cancer data, we demonstrate the advantage of MaCroDNA over existing methods in accuracy and speed. Exemplifying the utility of single-cell data integration in cancer research, we suggest, based on results derived using MaCroDNA, that genomic mutations of large effect size increasingly contribute to differential expression between cells as Barrett’s esophagus progresses to esophageal cancer, reaffirming the findings of the previous studies.  more » « less
Award ID(s):
2106837
PAR ID:
10479511
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Nature Communications
Volume:
14
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Triple negative breast cancer (TNBC) is an aggressive type of breast cancer with very little treatment options. TNBC is very heterogeneous with large alterations in the genomic, transcriptomic, and proteomic landscapes leading to various subtypes with differing responses to therapeutic treatments. We applied a multi-omics data integration method to evaluate the correlation of important regulatory features in TNBC BRCA1 wild-type MDA-MB-231 and TNBC BRCA1 5382insC mutated HCC1937 cells compared with non-tumorigenic epithelial breast MCF10A cells. The data includes DNA methylation, RNAseq, protein, phosphoproteomics, and histone post-translational modification. Data integration methods identified regulatory features from each omics method that had greater than 80% positive correlation within each TNBC subtype. Key regulatory features at each omics level were identified distinguishing the three cell lines and were involved in important cancer related pathways such as TGFβ signaling, PI3K/AKT/mTOR, and Wnt/beta-catenin signaling. We observed overexpression of PTEN, which antagonizes the PI3K/AKT/mTOR pathway, and MYC, which downregulates the same pathway in the HCC1937 cells relative to the MDA-MB-231 cells. The PI3K/AKT/mTOR and Wnt/beta-catenin pathways are both downregulated in HCC1937 cells relative to MDA-MB-231 cells, which likely explains the divergent sensitivities of these cell lines to inhibitors of downstream signaling pathways. The DNA methylation and RNAseq data is freely available via GEO GSE171958 and the proteomics data is available via the ProteomeXchange PXD025238. 
    more » « less
  2. Abstract Lung cancer sequencing efforts have uncovered mutational signatures that are attributed to exposure to the cigarette smoke carcinogen benzo[a]pyrene. Benzo[a]pyrene metabolizes in cells to benzo[a]pyrene diol epoxide (BPDE) and reacts with guanine nucleotides to form bulky BPDE adducts. These DNA adducts block transcription and replication, compromising cell function and survival, and are repaired in human cells by the nucleotide excision repair pathway. Here, we applied high-resolution genomic assays to measure BPDE-induced damage formation and mutagenesis in human cells. We integrated the new damage and mutagenesis data with previous repair, DNA methylation, RNA expression, DNA replication, and chromatin component measurements in the same cell lines, along with lung cancer mutagenesis data. BPDE damage formation is significantly enhanced by DNA methylation and in accessible chromatin regions, including transcribed and early-replicating regions. Binding of transcription factors is associated primarily with reduced, but also enhanced damage formation, depending on the factor. While DNA methylation does not appear to influence repair efficiency, this repair was significantly elevated in accessible chromatin regions, which accumulated fewer mutations. Thus, when damage and repair drive mutagenesis in opposing directions, the final mutational patterns appear to be dictated by the efficiency of repair rather than the frequency of underlying damages. 
    more » « less
  3. Abstract Human cancers often re-express germline factors, yet their mechanistic role in oncogenesis and cancer progression remains unknown. Here we demonstrate that DEAD-box helicase 4 (DDX4), a germline factor and RNA helicase conserved in all multicellular organisms, contributes to increased cell motility and cisplatin-mediated drug resistance in small cell lung cancer (SCLC) cells. Proteomic analysis suggests that DDX4 expression upregulates proteins related to DNA repair and immune/inflammatory response. Consistent with these trends in cell lines, DDX4 depletion compromised in vivo tumor development while its overexpression enhanced tumor growth even after cisplatin treatment in nude mice. Further, the relatively higher DDX4 expression in SCLC patients correlates with decreased survival and shows increased expression of immune/inflammatory response markers. Taken together, we propose that DDX4 increases SCLC cell survival, by increasing the DNA damage and immune response pathways, especially under challenging conditions such as cisplatin treatment. 
    more » « less
  4. Abstract The transcriptional plasticity of cancer cells promotes intercellular heterogeneity in response to anticancer drugs and facilitates the generation of subpopulation surviving cells. Characterizing single-cell transcriptional heterogeneity after drug treatments can provide mechanistic insights into drug efficacy. Here, we used single-cell RNA-seq to examine transcriptomic profiles of cancer cells treated with paclitaxel, celecoxib and the combination of the two drugs. By normalizing the expression of endogenous genes to spike-in molecules, we found that cellular mRNA abundance shows dynamic regulation after drug treatment. Using a random forest model, we identified gene signatures classifying single cells into three states: transcriptional repression, amplification and control-like. Treatment with paclitaxel or celecoxib alone generally repressed gene transcription across single cells. Interestingly, the drug combination resulted in transcriptional amplification and hyperactivation of mitochondrial oxidative phosphorylation pathway linking to enhanced cell killing efficiency. Finally, we identified a regulatory module enriched with metabolism and inflammation-related genes activated in a subpopulation of paclitaxel-treated cells, the expression of which predicted paclitaxel efficacy across cancer cell lines and in vivo patient samples. Our study highlights the dynamic global transcriptional activity driving single-cell heterogeneity during drug response and emphasizes the importance of adding spike-in molecules to study gene expression regulation using single-cell RNA-seq. 
    more » « less
  5. Abstract BackgroundTumour progression relies on the ability of cancer cells to penetrate and invade neighbouring tissues. E-cadherin loss is associated with increased cell invasion in gastric carcinoma, and germline mutations of the E-cadherin gene are causative of hereditary diffuse gastric cancer. Although E-cadherin dysfunction impacts cell–cell adhesion, cell dissemination also requires an imbalance of adhesion to the extracellular matrix (ECM). MethodsTo identify ECM components and receptors relevant for adhesion of E-cadherin dysfunctional cells, we implemented a novel ECM microarray platform coupled with molecular interaction networks. The functional role of putative candidates was determined by combining micropattern traction microscopy, protein modulation and in vivo approaches, as well as transcriptomic data of 262 gastric carcinoma samples, retrieved from the cancer genome atlas (TCGA). ResultsHere, we show that E-cadherin mutations induce an abnormal interplay of cells with specific components of the ECM, which encompasses increased traction forces and Integrin β1 activation. Integrin β1 synergizes with E-cadherin dysfunction, promoting cell scattering and invasion. The significance of the E-cadherin-Integrin β1 crosstalk was validated inDrosophilamodels and found to be consistent with evidence from human gastric carcinomas, where increased tumour grade and poor survival are associated with low E-cadherin and high Integrin β1 levels. ConclusionsIntegrin β1 is a key mediator of invasion in carcinomas with E-cadherin impairment and should be regarded as a biomarker of poor prognosis in gastric cancer. 
    more » « less