skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Reconstructing data-driven governing equations for cell phenotypic transitions: integration of data science and systems biology
Abstract Cells with the same genome can exist in different phenotypes and can change between distinct phenotypes when subject to specific stimuli and microenvironments. Some examples include cell differentiation during development, reprogramming for induced pluripotent stem cells and transdifferentiation, cancer metastasis and fibrosis progression. The regulation and dynamics of cell phenotypic conversion is a fundamental problem in biology, and has a long history of being studied within the formalism of dynamical systems. A main challenge for mechanism-driven modeling studies is acquiring sufficient amount of quantitative information for constraining model parameters. Advances in quantitative experimental approaches, especially high throughput single-cell techniques, have accelerated the emergence of a new direction for reconstructing the governing dynamical equations of a cellular system from quantitative single-cell data, beyond the dominant statistical approaches. Here I review a selected number of recent studies using live- and fixed-cell data and provide my perspective on future development.  more » « less
Award ID(s):
2205148
PAR ID:
10409774
Author(s) / Creator(s):
Date Published:
Journal Name:
Physical Biology
Volume:
19
Issue:
6
ISSN:
1478-3967
Page Range / eLocation ID:
061001
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Comprehensive and accurate comparisons of transcriptomic distributions of cells from samples taken from two different biological states, such as healthy versus diseased individuals, are an emerging challenge in single-cell RNA sequencing (scRNA-seq) analysis. Current methods for detecting differentially abundant (DA) subpopulations between samples rely heavily on initial clustering of all cells in both samples. Often, this clustering step is inadequate since the DA subpopulations may not align with a clear cluster structure, and important differences between the two biological states can be missed. Here, we introduce DA-seq, a targeted approach for identifying DA subpopulations not restricted to clusters. DA-seq is a multiscale method that quantifies a local DA measure for each cell, which is computed from its k nearest neighboring cells across a range of k values. Based on this measure, DA-seq delineates contiguous significant DA subpopulations in the transcriptomic space. We apply DA-seq to several scRNA-seq datasets and highlight its improved ability to detect differences between distinct phenotypes in severe versus mildly ill COVID-19 patients, melanomas subjected to immune checkpoint therapy comparing responders to nonresponders, embryonic development at two time points, and young versus aging brain tissue. DA-seq enabled us to detect differences between these phenotypes. Importantly, we find that DA-seq not only recovers the DA cell types as discovered in the original studies but also reveals additional DA subpopulations that were not described before. Analysis of these subpopulations yields biological insights that would otherwise be undetected using conventional computational approaches. 
    more » « less
  2. Abstract The recurrence of cancer following chemotherapy treatment is a major cause of death across solid and hematologic cancers. In B-cell acute lymphoblastic leukemia (B-ALL), relapse after initial chemotherapy treatment leads to poor patient outcomes. Here we test the hypothesis that chemotherapy-treated versus control B-ALL cells can be characterized based on cellular physical phenotypes. To quantify physical phenotypes of chemotherapy-treated leukemia cells, we use cells derived from B-ALL patients that are treated for 7 days with a standard multidrug chemotherapy regimen of vincristine, dexamethasone, and L-asparaginase (VDL). We conduct physical phenotyping of VDL-treated versus control cells by tracking the sequential deformations of single cells as they flow through a series of micron-scale constrictions in a microfluidic device; we call this method Quantitative Cyclical Deformability Cytometry. Using automated image analysis, we extract time-dependent features of deforming cells including cell size and transit time (TT) with single-cell resolution. Our findings show that VDL-treated B-ALL cells have faster TTs and transit velocity than control cells, indicating that VDL-treated cells are more deformable. We then test how effectively physical phenotypes can predict the presence of VDL-treated cells in mixed populations of VDL-treated and control cells using machine learning approaches. We find that TT measurements across a series of sequential constrictions can enhance the classification accuracy of VDL-treated cells in mixed populations using a variety of classifiers. Our findings suggest the predictive power of cell physical phenotyping as a complementary prognostic tool to detect the presence of cells that survive chemotherapy treatment. Ultimately such complementary physical phenotyping approaches could guide treatment strategies and therapeutic interventions. Insight box Cancer cells that survive chemotherapy treatment are major contributors to patient relapse, but the ability to predict recurrence remains a challenge. Here we investigate the physical properties of leukemia cells that survive treatment with chemotherapy drugs by deforming individual cells through a series of micron-scale constrictions in a microfluidic channel. Our findings reveal that leukemia cells that survive chemotherapy treatment are more deformable than control cells. We further show that machine learning algorithms applied to physical phenotyping data can predict the presence of cells that survive chemotherapy treatment in a mixed population. Such an integrated approach using physical phenotyping and machine learning could be valuable to guide patient treatments. 
    more » « less
  3. Wren, Jonathan (Ed.)
    Abstract MotivationThe rapid advance in single-cell RNA sequencing (scRNA-seq) technology over the past decade has provided a rich resource of gene expression profiles of single cells measured on patients, facilitating the study of many biological questions at the single-cell level. One intriguing research is to study the single cells which play critical roles in the phenotypes of patients, which has the potential to identify those cells and genes driving the disease phenotypes. To this end, deep learning models are expected to well encode the single-cell information and achieve precise prediction of patients’ phenotypes using scRNA-seq data. However, we are facing critical challenges in designing deep learning models for classifying patient samples due to (i) the samples collected in the same dataset contain a variable number of cells—some samples might only have hundreds of cells sequenced while others could have thousands of cells, and (ii) the number of samples available is typically small and the expression profile of each cell is noisy and extremely high-dimensional. Moreover, the black-box nature of existing deep learning models makes it difficult for the researchers to interpret the models and extract useful knowledge from them. ResultsWe propose a prototype-based and cell-informed model for patient phenotype classification, termed ProtoCell4P, that can alleviate problems of the sample scarcity and the diverse number of cells by leveraging the cell knowledge with representatives of cells (called prototypes), and precisely classify the patients by adaptively incorporating information from different cells. Moreover, this classification process can be explicitly interpreted by identifying the key cells for decision making and by further summarizing the knowledge of cell types to unravel the biological nature of the classification. Our approach is explainable at the single-cell resolution which can identify the key cells in each patient’s classification. The experimental results demonstrate that our proposed method can effectively deal with patient classifications using single-cell data and outperforms the existing approaches. Furthermore, our approach is able to uncover the association between cell types and biological classes of interest from a data-driven perspective. Availability and implementationhttps://github.com/Teddy-XiongGZ/ProtoCell4P. 
    more » « less
  4. Abstract Advances in single-cell technologies allow scrutinizing of heterogeneous cell states, however, detecting cell-state transitions from snap-shot single-cell transcriptome data remains challenging. To investigate cells with transient properties or mixed identities, we present MuTrans, a method based on multiscale reduction technique to identify the underlying stochastic dynamics that prescribes cell-fate transitions. By iteratively unifying transition dynamics across multiple scales, MuTrans constructs the cell-fate dynamical manifold that depicts progression of cell-state transitions, and distinguishes stable and transition cells. In addition, MuTrans quantifies the likelihood of all possible transition trajectories between cell states using coarse-grained transition path theory. Downstream analysis identifies distinct genes that mark the transient states or drive the transitions. The method is consistent with the well-established Langevin equation and transition rate theory. Applying MuTrans to datasets collected from five different single-cell experimental platforms, we show its capability and scalability to robustly unravel complex cell fate dynamics induced by transition cells in systems such as tumor EMT, iPSC differentiation and blood cell differentiation. Overall, our method bridges data-driven and model-based approaches on cell-fate transitions at single-cell resolution. 
    more » « less
  5. Abstract BackgroundCurrent methods for analyzing single-cell datasets have relied primarily on static gene expression measurements to characterize the molecular state of individual cells. However, capturing temporal changes in cell state is crucial for the interpretation of dynamic phenotypes such as the cell cycle, development, or disease progression. RNA velocity infers the direction and speed of transcriptional changes in individual cells, yet it is unclear how these temporal gene expression modalities may be leveraged for predictive modeling of cellular dynamics. ResultsHere, we present the first task-oriented benchmarking study that investigates integration of temporal sequencing modalities for dynamic cell state prediction. We benchmark ten integration approaches on ten datasets spanning different biological contexts, sequencing technologies, and species. We find that integrated data more accurately infers biological trajectories and achieves increased performance on classifying cells according to perturbation and disease states. Furthermore, we show that simple concatenation of spliced and unspliced molecules performs consistently well on classification tasks and can be used over more memory intensive and computationally expensive methods. ConclusionsThis work illustrates how integrated temporal gene expression modalities may be leveraged for predicting cellular trajectories and sample-associated perturbation and disease phenotypes. Additionally, this study provides users with practical recommendations for task-specific integration of single-cell gene expression modalities. 
    more » « less