skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Topological data analysis of spatial patterning in heterogeneous cell populations: clustering and sorting with varying cell-cell adhesion
Abstract Different cell types aggregate and sort into hierarchical architectures during the formation of animal tissues. The resulting spatial organization depends (in part) on the strength of adhesion of one cell type to itself relative to other cell types. However, automated and unsupervised classification of these multicellular spatial patterns remains challenging, particularly given their structural diversity and biological variability. Recent developments based on topological data analysis are intriguing to reveal similarities in tissue architecture, but these methods remain computationally expensive. In this article, we show that multicellular patterns organized from two interacting cell types can be efficiently represented through persistence images. Our optimized combination of dimensionality reduction via autoencoders, combined with hierarchical clustering, achieved high classification accuracy for simulations with constant cell numbers. We further demonstrate that persistence images can be normalized to improve classification for simulations with varying cell numbers due to proliferation. Finally, we systematically consider the importance of incorporating different topological features as well as information about each cell type to improve classification accuracy. We envision that topological machine learning based on persistence images will enable versatile and robust classification of complex tissue architectures that occur in development and disease.  more » « less
Award ID(s):
2106566 2038039
PAR ID:
10462530
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
npj Systems Biology and Applications
Volume:
9
Issue:
1
ISSN:
2056-7189
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Spatiotemporal patterns in multicellular systems are important to understanding tissue dynamics, for instance, during embryonic development and disease. Here, we use a multiphase field model to study numerically the behavior of a near-confluent monolayer of deformable cells with intercellular friction. Varying friction and cell motility drives a solid–liquid transition, and near the transition boundary, we find the emergence of local nematic order of cell deformation driven by shear-aligning cellular flows. Intercellular friction contributes to the monolayer’s viscosity, which significantly increases the spatial correlation in the flow and, concomitantly, the extent of nematic order. We also show that local hexatic and nematic order are tightly coupled and propose a mechanical-geometric model for the colocalization of + 1 / 2 nematic defects and 5–7 disclination pairs, which are the structural defects in the hexatic phase. Such topological defects coincide with regions of high cell–cell overlap, suggesting that they may mediate cellular extrusion from the monolayer, as found experimentally. Our results delineate a mechanical basis for the recent observation of nematic and hexatic order in multicellular collectives in experiments and simulations and pinpoint a generic pathway to couple topological and physical effects in these systems. 
    more » « less
  2. In digital pathology, the spatial context of cells is important for cell classification, cancer diagnosis and prognosis. To model such complex cell context, however, is challenging. Cells form different mixtures, lineages, clusters and holes. To model such structural patterns in a learnable fashion, we introduce several mathematical tools from spatial statistics and topological data analysis. We incorporate such structural descriptors into a deep generative model as both conditional inputs and a differentiable loss. This way, we are able to generate high quality multi-class cell layouts for the first time. We show that the topology-rich cell layouts can be used for data augmentation and improve the performance of downstream tasks such as cell classification. 
    more » « less
  3. Within multicellular living systems, cells coordinate their positions with spatiotemporal accuracy to form various tissue structures and control development. These arrangements can be regulated by tissue geometry, biochemical cues, as well as mechanical perturbations. However, how cells pack during dynamic three-dimensional multicellular architectures formation remains unclear. Here, examining a growing spherical multicellular system, human lung alveolospheres, we observe an emergence of hexagonal packing order and a structural transition of cells that comprise the spherical epithelium. Surprisingly, the cell packing behavior on the spherical surface of lung alveolospheres resembles hard-disks packing on spheres, where the less deformable cell nuclei act as effective “hard disks” and prevent cells from getting too close. Nucleus-to-cell size ratio increases during lung spheroids growth; as a result, we find more hexagon-concentrated cellular packing with increasing bond orientational order. Furthermore, by osmotically changing the compactness of cells on alveolospheres, we observe a more ordered packing when nucleus-to-cell size ratio increases, and vice versa. These more ordered cell packing characteristics are consistent with reduced cell dynamics, together suggesting that better cellular packing stabilizes local cell neighborhoods and may regulate more complex biological functions such as cellular maturation and tissue morphogenesis. 
    more » « less
  4. The mammalian brain consists of an intricate tapestry of cell types, with diversity crucial for function that arises from both differential gene expression and circuit-specific anatomy. Yet, retrieving high-content gene-expression information while retaining 3D positional anatomy at cellular resolution has been difficult, limiting integrative understanding of brain structure and function. Here we introduce and apply a technology for 3D intact-tissue RNA sequencing, termed STARmap (Spatially-resolved Transcript Amplicon Readout Mapping), which integrates highly-specific signal amplification, novel hydrogel-tissue chemistry, and an error-reduction sequencing process. The capabilities of STARmap were tested by mapping from 160 to 1,020 distinct genes simultaneously in sections of mouse brain at single-cell resolution with unprecedented efficiency, accuracy and reproducibility. These experiments led to the discovery of multiple new neocortical cell types, with gene markers and spatial patterns of organization not previously described, by comparison of the molecularly-defined architectures of sensory versus cognitive neocortex, and by quantification of expression of activity-regulated genes as a function of stimulation condition, spatial position, and cell typology. By adapting STARmap to thick tissue blocks, we observed and confirmed a novel molecularly-defined gradient distribution of excitatory neuron subtypes across cubic millimeter-scale volumes (>30,000 cells), and discovered a short-range 3D pattern of self-clustering shared by many inhibitory neuron subtypes that was accurately identifiable with a 3D STARmap approach. 
    more » « less
  5. Abstract Spatially resolved transcriptomics technologies enable the measurement of transcriptome information while retaining the spatial context at the regional, cellular or sub-cellular level. While previous computational methods have relied on gene expression information alone for clustering single-cell populations, more recent methods have begun to leverage spatial location and histology information to improve cell clustering and cell-type identification. In this study, using seven semi-synthetic datasets with real spatial locations, simulated gene expression and histology images as well as ground truth cell-type labels, we evaluate 15 clustering methods based on clustering accuracy, robustness to data variation and input parameters, computational efficiency, and software usability. Our analysis demonstrates that even though incorporating the additional spatial and histology information leads to increased accuracy in some datasets, it does not consistently improve clustering compared with using only gene expression data. Our results indicate that for the clustering of spatial transcriptomics data, there are still opportunities to enhance the overall accuracy and robustness by improving information extraction and feature selection from spatial and histology data. 
    more » « less