Single-cell sequencing provides a powerful approach for elucidating intratumor heterogeneity by resolving cell-to-cell variability. However, it also poses additional challenges including elevated error rates, allelic dropout and non-uniform coverage. A recently introduced single-cell-specific mutation detection algorithm leverages the evolutionary relationship between cells for denoising the data. However, due to its probabilistic nature, this method does not scale well with the number of cells. Here, we develop a novel combinatorial approach for utilizing the genealogical relationship of cells in detecting mutations from noisy single-cell sequencing data. Our method, called scVILP, jointly detects mutations in individual cells and reconstructs a perfect phylogeny among these cells. We employ a novel Integer Linear Program algorithm for deterministically and efficiently solving the joint inference problem. We show that scVILP achieves similar or better accuracy but significantly better runtime over existing methods on simulated data. We also applied scVILP to an empirical human cancer dataset from a high grade serous ovarian cancer patient.
more »
« less
This content will become publicly available on December 1, 2025
Unveiling cellular morphology: statistical analysis using a Riemannian elastic metric in cancer cell image datasets
Elastic metrics can provide a powerful tool to study the heterogeneity arising from cellular morphology. To assess their potential application (e.g. classifying cancer treated cells), we consider a specific instance of the elastic metric, the Square Root Velocity (SRV) metric and evaluate its performance against the linear metric for two datasets of osteosarcoma (bone cancer) cells including pharmacological treatments, and normal and cancerous breast cells. Our comparative statistical analysis shows superior performance of the SRV at capturing cell shape heterogeneity when comparing distance to the mean shapes, with better separation and interpretation between different cell groups. Secondly, when using multidimensional scaling (MDS) to find a low-dimensional embedding for unrescaled contours, we observe that while the linear metric better preserves original pairwise distances, the SRV yields better classification.
more »
« less
- Award ID(s):
- 2227605
- PAR ID:
- 10562670
- Publisher / Repository:
- Springer Nature
- Date Published:
- Journal Name:
- Information Geometry
- Volume:
- 7
- Issue:
- S2
- ISSN:
- 2511-2481
- Page Range / eLocation ID:
- 845 to 859
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Cellular heterogeneity, an inherent feature of biological systems, plays a critical role in processes such as development, immune response, and disease progression. Human mesenchymal stem cells (hMSCs) exemplify this heterogeneity due to their multi-lineage differentiation potential. However, their inherent variability complicates clinical use, and there is no universally accepted method for detecting and quantifying cell population heterogeneity. Dielectrophoresis (DEP) has emerged as a powerful electrokinetic technique for characterizing and manipulating cells based on their dielectric properties, offering label-free analysis capabilities. Quantitative information from the DEP spectrum, such as transient slope, measure cells’ transition between negative and positive DEP behaviors. In this study, we employed DEP to estimate transient slope of various cell populations, including relatively homogeneous HEK-293 cells, heterogeneous hMSCs, and cancer cells (PC3 and DU145). Our analysis encompassed hMSCs derived from bone marrow, adipose, and umbilical cord tissue, to capture tissue-specific heterogeneity. Transient slope was assessed using two methods, involving linear trendline fitting to different low-frequency regions of the DEP spectrum. We found that transient slope serves as a reliable indicator of cell population heterogeneity, with more heterogeneous populations exhibiting lower transient slopes and higher standard deviations. Validation using cell morphology, size, and stemness further supported the utility of transient slope as a heterogeneity metric. This label-free approach holds promise for advancing cell sorting, biomanufacturing, and personalized medicine.more » « less
-
Abstract The transcriptional plasticity of cancer cells promotes intercellular heterogeneity in response to anticancer drugs and facilitates the generation of subpopulation surviving cells. Characterizing single-cell transcriptional heterogeneity after drug treatments can provide mechanistic insights into drug efficacy. Here, we used single-cell RNA-seq to examine transcriptomic profiles of cancer cells treated with paclitaxel, celecoxib and the combination of the two drugs. By normalizing the expression of endogenous genes to spike-in molecules, we found that cellular mRNA abundance shows dynamic regulation after drug treatment. Using a random forest model, we identified gene signatures classifying single cells into three states: transcriptional repression, amplification and control-like. Treatment with paclitaxel or celecoxib alone generally repressed gene transcription across single cells. Interestingly, the drug combination resulted in transcriptional amplification and hyperactivation of mitochondrial oxidative phosphorylation pathway linking to enhanced cell killing efficiency. Finally, we identified a regulatory module enriched with metabolism and inflammation-related genes activated in a subpopulation of paclitaxel-treated cells, the expression of which predicted paclitaxel efficacy across cancer cell lines and in vivo patient samples. Our study highlights the dynamic global transcriptional activity driving single-cell heterogeneity during drug response and emphasizes the importance of adding spike-in molecules to study gene expression regulation using single-cell RNA-seq.more » « less
-
Inevitably, almost all cancer patients develop resistance to targeted therapy. Intratumour heterogeneity is a major cause of drug resistance. Mathematical models that explain experiments quantitatively are useful in understanding the origin of intratumour heterogeneity, which then could be used to explore scenarios for efficacious therapy. Here, we develop a mathematical model to investigate intratumour heterogeneity in breast cancer by exploiting the observation that HER2+ and HER2− cells could divide symmetrically or asymmetrically. Our predictions for the evolution of cell fractions are in quantitative agreement with single-cell experiments. Remarkably, the colony size of HER2+ cells emerging from a single HER2− cell (or vice versa), which occurs in about four cell doublings, also agrees with experimental results, without tweaking any parameter in the model. The theory explains experimental data on the responses of breast tumours under different treatment protocols. We then used the model to predict that, not only the order of two drugs, but also the treatment period for each drug and the tumour cell plasticity could be manipulated to improve the treatment efficacy. Mathematical models, when integrated with data on patients, make possible exploration of a broad range of parameters readily, which might provide insights in devising effective therapies.more » « less
-
Abstract Breast cancer is a heterogenous disease that can be classified into multiple subtypes including the most aggressive basal-like and triple-negative subtypes. Understanding the heterogeneity within the normal mammary basal epithelial cells holds the key to inform us about basal-like cancer cell differentiation dynamics as well as potential cells of origin. Although it is known that the mammary basal compartment contains small pools of stem cells that fuel normal tissue morphogenesis and regeneration, a comprehensive yet focused analysis of the transcriptional makeup of the basal cells is lacking. We used single-cell RNA-sequencing and multiplexed RNA in-situ hybridization to characterize mammary basal cell heterogeneity. We used bioinformatic and computational pipelines to characterize the molecular features as well as predict differentiation dynamics and cell–cell communications of the newly identified basal cell states. We used genetic cell labeling to map the in vivo fates of cells in one of these states. We identified four major distinct transcriptional states within the mammary basal cells that exhibit gene expression signatures suggestive of different functional activity and metabolic preference. Our in vivo labeling and ex vivo organoid culture data suggest that one of these states, marked by Egr2 expression, represents a dynamic transcriptional state that all basal cells transit through during pubertal mammary morphogenesis. Our study provides a systematic approach to understanding the molecular heterogeneity of mammary basal cells and identifies previously unknown dynamics of basal cell transcriptional states.more » « less
An official website of the United States government
