Abstract BackgroundComputational cell type deconvolution enables the estimation of cell type abundance from bulk tissues and is important for understanding tissue microenviroment, especially in tumor tissues. With rapid development of deconvolution methods, many benchmarking studies have been published aiming for a comprehensive evaluation for these methods. Benchmarking studies rely on cell-type resolved single-cell RNA-seq data to create simulated pseudobulk datasets by adding individual cells-types in controlled proportions. ResultsIn our work, we show that the standard application of this approach, which uses randomly selected single cells, regardless of the intrinsic difference between them, generates synthetic bulk expression values that lack appropriate biological variance. We demonstrate why and how the current bulk simulation pipeline with random cells is unrealistic and propose a heterogeneous simulation strategy as a solution. The heterogeneously simulated bulk samples match up with the variance observed in real bulk datasets and therefore provide concrete benefits for benchmarking in several ways. We demonstrate that conceptual classes of deconvolution methods differ dramatically in their robustness to heterogeneity with reference-free methods performing particularly poorly. For regression-based methods, the heterogeneous simulation provides an explicit framework to disentangle the contributions of reference construction and regression methods to performance. Finally, we perform an extensive benchmark of diverse methods across eight different datasets and find BayesPrism and a hybrid MuSiC/CIBERSORTx approach to be the top performers. ConclusionsOur heterogeneous bulk simulation method and the entire benchmarking framework is implemented in a user friendly packagehttps://github.com/humengying0907/deconvBenchmarkingandhttps://doi.org/10.5281/zenodo.8206516, enabling further developments in deconvolution methods. 
                        more » 
                        « less   
                    
                            
                            Inference and analysis of cell-cell communication using CellChat
                        
                    
    
            Abstract Understanding global communications among cells requires accurate representation of cell-cell signaling links and effective systems-level analyses of those links. We construct a database of interactions among ligands, receptors and their cofactors that accurately represent known heteromeric molecular complexes. We then develop CellChat, a tool that is able to quantitatively infer and analyze intercellular communication networks from single-cell RNA-sequencing (scRNA-seq) data. CellChat predicts major signaling inputs and outputs for cells and how those cells and signals coordinate for functions using network analysis and pattern recognition approaches. Through manifold learning and quantitative contrasts, CellChat classifies signaling pathways and delineates conserved and context-specific pathways across different datasets. Applying CellChat to mouse and human skin datasets shows its ability to extract complex signaling patterns. Our versatile and easy-to-use toolkit CellChat and a web-based Explorer (http://www.cellchat.org/) will help discover novel intercellular communications and build cell-cell communication atlases in diverse tissues. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 1763272
- PAR ID:
- 10214335
- Publisher / Repository:
- Nature Publishing Group
- Date Published:
- Journal Name:
- Nature Communications
- Volume:
- 12
- Issue:
- 1
- ISSN:
- 2041-1723
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract The endoplasmic reticulum (ER) houses sensors that respond to environmental stress and underly plants' adaptative responses. These sensors transduce signals that lead to changes in nuclear gene expression. The ER to nuclear signaling pathways are primarily attributed to the unfolded protein response (UPR) and are also integrated with a wide range of development, hormone, immune, and stress signaling pathways. Understanding the role of the UPR in signaling network mechanisms that associate with particular phenotypes is crucially important. While UPR‐associated genes are the subject of ongoing investigations in a few model plant systems, most remain poorly annotated, hindering the identification of candidates across plant species. This open‐source curated database provides a centralized resource of peer reviewed knowledge of ER to nuclear signaling pathways for the plant community. We provide a UPRome interactive viewer for users to navigate through the pathways and to access annotated information. The plant ER UPRome website is located athttp://uprome.tamu.edu. We welcome contributions from the researchers studying the ER UPR to incorporate additional genes into the database through the “contact us” page.more » « less
- 
            Abstract Cells make decisions through their communication with other cells and receiving signals from their environment. Using single-cell transcriptomics, computational tools have been developed to infer cell–cell communication through ligands and receptors. However, the existing methods only deal with signals sent by the measured cells in the data, the received signals from the external system are missing in the inference. Here, we present exFINDER, a method that identifies such external signals received by the cells in the single-cell transcriptomics datasets by utilizing the prior knowledge of signaling pathways. In particular, exFINDER can uncover external signals that activate the given target genes, infer the external signal-target signaling network (exSigNet), and perform quantitative analysis on exSigNets. The applications of exFINDER to scRNA-seq datasets from different species demonstrate the accuracy and robustness of identifying external signals, revealing critical transition-related signaling activities, inferring critical external signals and targets, clustering signal-target paths, and evaluating relevant biological events. Overall, exFINDER can be applied to scRNA-seq data to reveal the external signal-associated activities and maybe novel cells that send such signals.more » « less
- 
            Abstract Single-cell RNA sequencing (scRNA-seq) provides details for individual cells; however, crucial spatial information is often lost. We present SpaOTsc, a method relying on structured optimal transport to recover spatial properties of scRNA-seq data by utilizing spatial measurements of a relatively small number of genes. A spatial metric for individual cells in scRNA-seq data is first established based on a map connecting it with the spatial measurements. The cell–cell communications are then obtained by “optimally transporting” signal senders to target signal receivers in space. Using partial information decomposition, we next compute the intercellular gene–gene information flow to estimate the spatial regulations between genes across cells. Four datasets are employed for cross-validation of spatial gene expression prediction and comparison to known cell–cell communications. SpaOTsc has broader applications, both in integrating non-spatial single-cell measurements with spatial data, and directly in spatial single-cell transcriptomics data to reconstruct spatial cellular dynamics in tissues.more » « less
- 
            Abstract Single cell profiling techniques including multi-omics and spatial-omics technologies allow researchers to study cell-cell variation within a cell population. These variations extend to biological networks within cells, in particular, the gene regulatory networks (GRNs). GRNs rewire as the cells evolve, and different cells can have different governing GRNs. However, existing GRN inference methods usually infer a single GRN for a population of cells, without exploring the cell-cell variation in terms of their regulatory mechanisms. Recently, jointly profiled single cell transcriptomics and chromatin accessibility data have been used to infer GRNs. Although methods based on such multi-omics data were shown to improve over the accuracy of methods using only single cell RNA-seq (scRNA-seq) data, they do not take full advantage of the single cell resolution chromatin accessibility data. We propose CeSpGRN (CellSpecificGeneRegulatoryNetwork inference), which infers cell-specific GRNs from scRNA-seq, single cell multi-omics, or single cell spatial-omics data. CeSpGRN uses a Gaussian weighted kernel that allows the GRN of a given cell to be learned from the sequencing profile of itself and its neighboring cells in the developmental process. The kernel is constructed from the similarity of gene expressions or spatial locations between cells. When the chromatin accessibility data is available, CeSpGRN constructs cell-specific prior networks which are used to further improve the inference accuracy. We applied CeSpGRN to various types of real-world datasets and inferred various regulation changes that were shown to be important in cell development. We also quantitatively measured the performance of CeSpGRN on simulated datasets and compared with baseline methods. The results show that CeSpGRN has a superior performance in reconstructing the GRN for each cell, as well as in detecting the regulatory interactions that differ between cells. CeSpGRN is available athttps://github.com/PeterZZQ/CeSpGRN.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
