skip to main content


Title: Graph-based autoencoder integrates spatial transcriptomics with chromatin images and identifies joint biomarkers for Alzheimer’s disease
Abstract

Tissue development and disease lead to changes in cellular organization, nuclear morphology, and gene expression, which can be jointly measured by spatial transcriptomic technologies. However, methods for jointly analyzing the different spatial data modalities in 3D are still lacking. We present a computational framework to integrate Spatial Transcriptomic data using over-parameterized graph-based Autoencoders with Chromatin Imaging data (STACI) to identify molecular and functional alterations in tissues. STACI incorporates multiple modalities in a single representation for downstream tasks, enables the prediction of spatial transcriptomic data from nuclear images in unseen tissue sections, and provides built-in batch correction of gene expression and tissue morphology through over-parameterization. We apply STACI to analyze the spatio-temporal progression of Alzheimer’s disease and identify the associated nuclear morphometric and coupled gene expression features. Collectively, we demonstrate the importance of characterizing disease progression by integrating multiple data modalities and its potential for the discovery of disease biomarkers.

 
more » « less
Award ID(s):
1651995
NSF-PAR ID:
10383902
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Nature Communications
Volume:
13
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    TheBIN1locus contains the second-most significant genetic risk factor for late-onset Alzheimer’s disease.BIN1undergoes alternate splicing to generate tissue- and cell-type-specific BIN1 isoforms, which regulate membrane dynamics in a range of crucial cellular processes. Whilst the expression of BIN1 in the brain has been characterized in neurons and oligodendrocytes in detail, information regarding microglial BIN1 expression is mainly limited to large-scale transcriptomic and proteomic data. Notably, BIN1 protein expression and its functional roles in microglia, a cell type most relevant to Alzheimer’s disease, have not been examined in depth.

    Methods

    Microglial BIN1 expression was analyzed by immunostaining mouse and human brain, as well as by immunoblot and RT-PCR assays of isolated microglia or human iPSC-derived microglial cells.Bin1expression was ablated by siRNA knockdown in primary microglial cultures in vitro and Cre-lox mediated conditional deletion in adult mouse brain microglia in vivo. Regulation of neuroinflammatory microglial signatures by BIN1 in vitro and in vivo was characterized using NanoString gene panels and flow cytometry methods. The transcriptome data was explored by in silico pathway analysis and validated by complementary molecular approaches.

    Results

    Here, we characterized microglial BIN1 expression in vitro and in vivo and ascertained microglia expressed BIN1 isoforms. By silencingBin1expression in primary microglial cultures, we demonstrate that BIN1 regulates the activation of proinflammatory and disease-associated responses in microglia as measured by gene expression and cytokine production. Our transcriptomic profiling revealed key homeostatic and lipopolysaccharide (LPS)-induced inflammatory response pathways, as well as transcription factors PU.1 and IRF1 that are regulated by BIN1. Microglia-specificBin1conditional knockout in vivo revealed novel roles of BIN1 in regulating the expression of disease-associated genes while counteracting CX3CR1 signaling. The consensus from in vitro and in vivo findings showed that loss ofBin1impaired the ability of microglia to mount type 1 interferon responses to proinflammatory challenge, particularly the upregulation of a critical type 1 immune response gene,Ifitm3.

    Conclusions

    Our convergent findings provide novel insights into microglial BIN1 function and demonstrate an essential role of microglial BIN1 in regulating brain inflammatory response and microglial phenotypic changes. Moreover, for the first time, our study shows a regulatory relationship betweenBin1andIfitm3, two Alzheimer’s disease-related genes in microglia. The requirement for BIN1 to regulateIfitm3upregulation during inflammation has important implications for inflammatory responses during the pathogenesis and progression of many neurodegenerative diseases.

    Graphical Abstract 
    more » « less
  2. Abstract Background

    Current methods for analyzing single-cell datasets have relied primarily on static gene expression measurements to characterize the molecular state of individual cells. However, capturing temporal changes in cell state is crucial for the interpretation of dynamic phenotypes such as the cell cycle, development, or disease progression. RNA velocity infers the direction and speed of transcriptional changes in individual cells, yet it is unclear how these temporal gene expression modalities may be leveraged for predictive modeling of cellular dynamics.

    Results

    Here, we present the first task-oriented benchmarking study that investigates integration of temporal sequencing modalities for dynamic cell state prediction. We benchmark ten integration approaches on ten datasets spanning different biological contexts, sequencing technologies, and species. We find that integrated data more accurately infers biological trajectories and achieves increased performance on classifying cells according to perturbation and disease states. Furthermore, we show that simple concatenation of spliced and unspliced molecules performs consistently well on classification tasks and can be used over more memory intensive and computationally expensive methods.

    Conclusions

    This work illustrates how integrated temporal gene expression modalities may be leveraged for predicting cellular trajectories and sample-associated perturbation and disease phenotypes. Additionally, this study provides users with practical recommendations for task-specific integration of single-cell gene expression modalities.

     
    more » « less
  3. Abstract

    Single-cell technologies characterize complex cell populations across multiple data modalities at unprecedented scale and resolution. Multi-omic data for single cell gene expression, in situ hybridization, or single cell chromatin states are increasingly available across diverse tissue types. When isolating specific cell types from a sample of disassociated cells or performing in situ sequencing in collections of heterogeneous cells, one challenging task is to select a small set of informative markers that robustly enable the identification and discrimination of specific cell types or cell states as precisely as possible. Given single cell RNA-seq data and a set of cellular labels to discriminate, scGeneFit selects gene markers that jointly optimize cell label recovery using label-aware compressive classification methods. This results in a substantially more robust and less redundant set of markers than existing methods, most of which identify markers that separate each cell label from the rest. When applied to a data set given a hierarchy of cell types as labels, the markers found by our method improves the recovery of the cell type hierarchy with fewer markers than existing methods using a computationally efficient and principled optimization.

     
    more » « less
  4. Abstract

    Spatial gene expression in tissue is characterized by regions in which particular genes are enriched or depleted. Frequently, these regions contain nested inside them subregions with distinct expression patterns. Segmentation methods in spatial transcriptomic (ST) data extract disjoint regions maximizing similarity over the greatest number of genes, typically on a particular spatial scale, thus lacking the ability to find region-within-region structure. We present NeST, which extracts spatial structure through coexpression hotspots—regions exhibiting localized spatial coexpression of some set of genes. Coexpression hotspots identify structure on any spatial scale, over any possible subset of genes, and are highly explainable. NeST also performs spatial analysis of cell-cell interactions via ligand-receptor, identifying active areas de novo without restriction of cell type or other groupings, in both two and three dimensions. Through application on ST datasets of varying type and resolution, we demonstrate the ability of NeST to reveal a new level of biological structure.

     
    more » « less
  5. Abstract Background Large-scale genome-wide association studies have successfully identified many genetic variants significantly associated with Alzheimer’s disease (AD), such as rs429358, rs11038106, rs723804, rs13591776, and more. The next key step is to understand the function of these SNPs and the downstream biology through which they exert the effect on the development of AD. However, this remains a challenging task due to the tissue-specific nature of transcriptomic and proteomic data and the limited availability of brain tissue.In this paper, instead of using coupled transcriptomic data, we performed an integrative analysis of existing GWAS findings and expression quantitative trait loci (eQTL) results from AD-related brain regions to estimate the transcriptomic alterations in AD brain. Results We used summary-based mendelian randomization method along with heterogeneity in dependent instruments method and were able to identify 32 genes with potential altered levels in temporal cortex region. Among these, 10 of them were further validated using real gene expression data collected from temporal cortex region, and 19 SNPs from NECTIN and TOMM40 genes were found associated with multiple temporal cortex imaging phenotype. Conclusion Significant pathways from enriched gene networks included neutrophil degranulation, Cell surface interactions at the vascular wall, and Regulation of TP53 activity which are still relatively under explored in Alzheimer’s Disease while also encouraging a necessity to bind further trans-eQTL effects into this integrative analysis. 
    more » « less