skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Scan-IT: Domain segmentation of spatial transcriptomics images by graph neural network
Complex biological tissues consist of numerous cells in a highly coordinated manner and carry out various biological functions. Therefore, segmenting a tissue into spatial and functional domains is critically important for understanding and controlling the biological functions. The emerging spatial transcriptomic technologies allow simultaneous measurements of thousands of genes with precise spatial information, providing an unprecedented opportunity for dissecting biological tissues. However, how to utilize such noisy, sparse, and high dimensional data for tissue segmentation remains a major challenge. Here, we develop a deep learning-based method, named SCAN-IT by transforming the spatial domain identification problem into an image segmentation problem, with cells mimicking pixels and expression values of genes within a cell representing the color channels. Specifically, SCAN-IT relies on geometric modeling, graph neural networks, and an informatics approach, DeepGraphInfomax. We demonstrate that SCAN-IT can handle datasets from a wide range of spatial transcriptomics techniques, including the ones with high spatial resolution but low gene coverage as well as those with low spatial resolution but high gene coverage. We show that SCAN-IT outperforms state-of-the-art methods using a benchmark dataset with ground truth domain annotations.  more » « less
Award ID(s):
2007595 1949629
PAR ID:
10327626
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
British Machine Vision Conference
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Spatially-resolved RNA profiling has now been widely used to understand cells’ structural organizations and functional roles in tissues, yet it is challenging to reconstruct the whole spatial transcriptomes due to various inherent technical limitations in tissue section preparation and RNA capture and fixation in the application of the spatial RNA profiling technologies. Here, we introduce a graph-guided neural tensor decomposition (GNTD) model for reconstructing whole spatial transcriptomes in tissues. GNTD employs a hierarchical tensor structure and formulation to explicitly model the high-order spatial gene expression data with a hierarchical nonlinear decomposition in a three-layer neural network, enhanced by spatial relations among the capture spots and gene functional relations for accurate reconstruction from highly sparse spatial profiling data. Extensive experiments on 22 Visium spatial transcriptomics datasets and 3 high-resolution Stereo-seq datasets as well as simulation data demonstrate that GNTD consistently improves the imputation accuracy in cross-validations driven by nonlinear tensor decomposition and incorporation of spatial and functional information, and confirm that the imputed spatial transcriptomes provide a more complete gene expression landscape for downstream analyses of cell/spot clustering for tissue segmentation, and spatial gene expression clustering and visualizations. 
    more » « less
  2. Abstract Plant cells communicate information for the regulation of development and responses to external stresses. A key form of this communication is transcriptional regulation, accomplished via complex gene networks operating both locally and systemically. To fully understand how genes are regulated across plant tissues and organs, high resolution, multi-dimensional spatial transcriptional data must be acquired and placed within a cellular and organismal context. Spatial transcriptomics (ST) typically provides a two-dimensional spatial analysis of gene expression of tissue sections that can be stacked to render three-dimensional data. For example, X-ray and light-sheet microscopy provide sub-micron scale volumetric imaging of cellular morphology of tissues, organs, or potentially entire organisms. Linking these technologies could substantially advance transcriptomics in plant biology and other fields. Here, we review advances in ST and 3D microscopy approaches and describe how these technologies could be combined to provide high resolution, spatially organized plant tissue transcript mapping. 
    more » « less
  3. null (Ed.)
    Many genes that respond to infection have functions outside of immunity and have been found to be under natural selection. Pathogens may therefore incidentally alter nonimmune physiology through engagement with immune system genes. This raises a logical question of how genetically promiscuous the immune system is, here defined as how heavily cross-referenced the immune system is into other physiological systems. This work examined immune gene promiscuity across physiological systems in primates by assessing the baseline (unperturbed) expression of key tissue and cell types for differences, and primate genomes for signatures of selection. These efforts revealed “immune” gene expression to be cross-referenced extensively in other physiological systems in primates. When immune and nonimmune tissues diverge in expression, the differentially expressed genes at baseline are enriched for cell biological activities not immediately identifiable as immune function based. Individual comparisons of immune and nonimmune tissues in primates revealed low divergence in gene expression between tissues, with the exception of whole blood. Immune gene promiscuity increases over evolutionary time, with hominoids exhibiting the most cross-referencing of such genes among primates. An assessment of genetic sequences also found positive selection in the coding regions of differentially expressed genes between tissues functionally associated with immunity. This suggests that, with increasing promiscuity, divergent gene expression between the immune system and other physiological systems tends to be adaptive and enriched for immune functions in hominoids. 
    more » « less
  4. Many genes that respond to infection have functions outside of immunity and have been found to be under natural selection. Pathogens may therefore incidentally alter nonimmune physiology through engagement with immune system genes. This raises a logical question of how genetically promiscuous the immune system is, here defijined as how heavily cross-referenced the immune system is into other physiological systems. This work examined immune gene promiscuity across physiological systems in primates by assessing the baseline (unperturbed) expression of key tissue and cell types for diffferences, and primate genomes for signatures of selection. These effforts revealed “immune” gene expression to be cross-referenced extensively in other physiological systems in primates. When immune and nonim-mune tissues diverge in expression, the diffferentially expressed genes at baseline are enriched for cell biological activities not immediately identifijiable as immune function based. Individual comparisons of immune and nonimmune tissues in primates revealed low divergence in gene expression between tissues, with the exception of whole blood. Immune gene promiscuity increases over evolutionary time, with hominoids exhibiting the most cross-referencing of such genes among primates. An assessment of genetic sequences also found positive selection in the coding regions of diffferentially expressed genes between tissues functionally associated with immunity. This suggests that, with increasing promiscuity, divergent gene expression between the immune system and other physiological systems tends to be adaptive and enriched for immune functions in hominoids. 
    more » « less
  5. Abstract BackgroundThe eukaryotic genome is capable of producing multiple isoforms from a gene by alternative polyadenylation (APA) during pre-mRNA processing. APA in the 3′-untranslated region (3′-UTR) of mRNA produces transcripts with shorter or longer 3′-UTR. Often, 3′-UTR serves as a binding platform for microRNAs and RNA-binding proteins, which affect the fate of the mRNA transcript. Thus, 3′-UTR APA is known to modulate translation and provides a mean to regulate gene expression at the post-transcriptional level. Current bioinformatics pipelines have limited capability in profiling 3′-UTR APA events due to incomplete annotations and a low-resolution analyzing power: widely available bioinformatics pipelines do not reference actionable polyadenylation (cleavage) sites but simulate 3′-UTR APA only using RNA-seq read coverage, causing false positive identifications. To overcome these limitations, we developed APA-Scan, a robust program that identifies 3′-UTR APA events and visualizes the RNA-seq short-read coverage with gene annotations. MethodsAPA-Scan utilizes either predicted or experimentally validated actionable polyadenylation signals as a reference for polyadenylation sites and calculates the quantity of long and short 3′-UTR transcripts in the RNA-seq data. APA-Scan works in three major steps: (i) calculate the read coverage of the 3′-UTR regions of genes; (ii) identify the potential APA sites and evaluate the significance of the events among two biological conditions; (iii) graphical representation of user specific event with 3′-UTR annotation and read coverage on the 3′-UTR regions. APA-Scan is implemented in Python3. Source code and a comprehensive user’s manual are freely available athttps://github.com/compbiolabucf/APA-Scan. ResultAPA-Scan was applied to both simulated and real RNA-seq datasets and compared with two widely used baselines DaPars and APAtrap. In simulation APA-Scan significantly improved the accuracy of 3′-UTR APA identification compared to the other baselines. The performance of APA-Scan was also validated by 3′-end-seq data and qPCR on mouse embryonic fibroblast cells. The experiments confirm that APA-Scan can detect unannotated 3′-UTR APA events and improve genome annotation. ConclusionAPA-Scan is a comprehensive computational pipeline to detect transcriptome-wide 3′-UTR APA events. The pipeline integrates both RNA-seq and 3′-end-seq data information and can efficiently identify the significant events with a high-resolution short reads coverage plots. 
    more » « less