skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Single-Cell Multi-Modal GAN (scMMGAN) reveals spatial patterns in single-cell data from triple negative breast cancer
Exciting advances in technologies to measure biological systems are currently at the forefront of research. The ability to gather data along an increasing number of omic dimensions has created a need for tools to analyze all of this information together, rather than siloing each technology into separate analysis pipelines. To advance this goal, we introduce a framework called the Single-Cell Multi-Modal GAN (scMMGAN) that integrates data from multiple modalities into a unified representation in the ambient data space for downstream analysis using a combination of adversarial learning and data geometry techniques. The framework’s key improvement is an additional diffusion geometry loss with a new kernel that constrains the otherwise over-parameterized GAN network. We demonstrate scMMGAN’s ability to produce more meaningful alignments than alternative methods on a wide variety of data modalities, and that its output can be used to draw conclusions from real-world biological experimental data. We highlight data from an experiment studying the development of triple negative breast cancer, where we show how scMMGAN can be used to identify novel gene associations and we demonstrate that cell clusters identified only on the scRNAseq data occur in localized spatial patterns that reveal insights on the spatial transcriptomic images.  more » « less
Award ID(s):
2047856
PAR ID:
10352699
Author(s) / Creator(s):
Date Published:
Journal Name:
Patterns
ISSN:
2666-3899
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Multi-modal single cell RNA assays capture RNA content as well as other data modalities, such as spatial cell position or the electrophysiological properties of cells. Compared to dedicated scRNA-seq assays however, they may unintentionally capture RNA from multiple adjacent cells, exhibit lower RNA sequencing depth compared to scRNA-seq, or lack genome-wide RNA measurements. We present scProjection, a method for mapping individual multi-modal RNA measurements to deeply sequenced scRNA-seq atlases to extract cell type-specific, single cell gene expression profiles. We demonstrate several use cases of scProjection, including the identification of spatial motifs from spatial transcriptome assays, distinguishing RNA contributions from neighboring cells in both spatial and multi-modal single cell assays, and imputing expression measurements of un-measured genes from gene markers. scProjection therefore combines the advantages of both multi-modal and scRNA-seq assays to yield precise multi-modal measurements of single cells. 
    more » « less
  2. Abstract Recently, lineage tracing technology using CRISPR/Cas9 genome editing has enabled simultaneous readouts of gene expressions and lineage barcodes, which allows for the reconstruction of the cell division tree and makes it possible to reconstruct ancestral cell types and trace the origin of each cell type. Meanwhile, trajectory inference methods are widely used to infer cell trajectories and pseudotime in a dynamic process using gene expression data of present-day cells. Here, we present TedSim (single-cell temporal dynamics simulator), which simulates the cell division events from the root cell to present-day cells, simultaneously generating two data modalities for each single cell: the lineage barcode and gene expression data. TedSim is a framework that connects the two problems: lineage tracing and trajectory inference. Using TedSim, we conducted analysis to show that (i) TedSim generates realistic gene expression and barcode data, as well as realistic relationships between these two data modalities; (ii) trajectory inference methods can recover the underlying cell state transition mechanism with balanced cell type compositions; and (iii) integrating gene expression and barcode data can provide more insights into the temporal dynamics in cell differentiation compared to using only one type of data, but better integration methods need to be developed. 
    more » « less
  3. Abstract Neural communication networks form the fundamental basis for brain function. These communication networks are enabled by emitted ligands such as neurotransmitters, which activate receptor complexes to facilitate communication. Thus, neural communication is fundamentally dependent on the transcriptome. Here we develop NeuronChat, a method and package for the inference, visualization and analysis of neural-specific communication networks among pre-defined cell groups using single-cell expression data. We incorporate a manually curated molecular interaction database of neural signaling for both human and mouse, and benchmark NeuronChat on several published datasets to validate its ability in predicting neural connectivity. Then, we apply NeuronChat to three different neural tissue datasets to illustrate its functionalities in identifying interneural communication networks, revealing conserved or context-specific interactions across different biological contexts, and predicting communication pattern changes in diseased brains with autism spectrum disorder. Finally, we demonstrate NeuronChat can utilize spatial transcriptomics data to infer and visualize neural-specific cell-cell communication. 
    more » « less
  4. Abstract Multimodal single-cell sequencing technologies provide unprecedented information on cellular heterogeneity from multiple layers of genomic readouts. However, joint analysis of two modalities without properly handling the noise often leads to overfitting of one modality by the other and worse clustering results than vanilla single-modality analysis. How to efficiently utilize the extra information from single cell multi-omics to delineate cell states and identify meaningful signal remains as a significant computational challenge. In this work, we propose a deep learning framework, named SAILERX, for efficient, robust, and flexible analysis of multi-modal single-cell data. SAILERX consists of a variational autoencoder with invariant representation learning to correct technical noises from sequencing process, and a multimodal data alignment mechanism to integrate information from different modalities. Instead of performing hard alignment by projecting both modalities to a shared latent space, SAILERX encourages the local structures of two modalities measured by pairwise similarities to be similar. This strategy is more robust against overfitting of noises, which facilitates various downstream analysis such as clustering, imputation, and marker gene detection. Furthermore, the invariant representation learning part enables SAILERX to perform integrative analysis on both multi- and single-modal datasets, making it an applicable and scalable tool for more general scenarios. 
    more » « less
  5. null (Ed.)
    Glass nanopipettes have shown promise for applications in single-cell manipulation, analysis, and imaging. In recent years, plasmonic nanopipettes have been developed to enable surface-enhanced Raman spectroscopy (SERS) measurements for single-cell analysis. In this work, we developed a SERS-active nanopipette that can be used to perform long-term and reliable intracellular analysis of single living cells with minimal damage, which is achieved by optimizing the nanopipette geometry and the surface density of the gold nanoparticle (AuNP) layer at the nanopipette tip. To demonstrate its ability in single-cell analysis, we used the nanopipette for intracellular pH sensing. Intracellular pH (pH i ) is vital to cells as it influences cell function and behavior and pathological conditions. The pH sensitivity was realized by simply modifying the AuNP layer with the pH reporter molecule 4-mercaptobenzoic acid. With a response time of less than 5 seconds, the pH sensing range is from 6.0 to 8.0 and the maximum sensitivity is 0.2 pH units. We monitored the pH i change of individual HeLa and fibroblast cells, triggered by the extracellular pH (pH e ) change. The HeLa cancer cells can better resist pH e change and adapt to the weak acidic environment. Plasmonic nanopipettes can be further developed to monitor other intracellular biomarkers. 
    more » « less