SIMPLEs: a single-cell RNA sequencing imputation strategy preserving gene modules and cell clusters variation

Hu, Zhirui; Zu, Songpeng; Liu, Jun S

doi:10.1093/nargab/lqaa077

Citation Details

SIMPLEs: a single-cell RNA sequencing imputation strategy preserving gene modules and cell clusters variation

Abstract A main challenge in analyzing single-cell RNA sequencing (scRNA-seq) data is to reduce technical variations yet retain cell heterogeneity. Due to low mRNAs content per cell and molecule losses during the experiment (called ‘dropout’), the gene expression matrix has a substantial amount of zero read counts. Existing imputation methods treat either each cell or each gene as independently and identically distributed, which oversimplifies the gene correlation and cell type structure. We propose a statistical model-based approach, called SIMPLEs (SIngle-cell RNA-seq iMPutation and celL clustErings), which iteratively identifies correlated gene modules and cell clusters and imputes dropouts customized for individual gene module and cell type. Simultaneously, it quantifies the uncertainty of imputation and cell clustering via multiple imputations. In simulations, SIMPLEs performed significantly better than prevailing scRNA-seq imputation methods according to various metrics. By applying SIMPLEs to several real datasets, we discovered gene modules that can further classify subtypes of cells. Our imputations successfully recovered the expression trends of marker genes in stem cell differentiation and can discover putative pathways regulating biological processes. more »

Award ID(s):: 1903139

PAR ID:: 10346903

Author(s) / Creator(s):: Hu, Zhirui; Zu, Songpeng; Liu, Jun S

Date Published:: 2020-12-01

Journal Name:: NAR Genomics and Bioinformatics

Volume:: 2

Issue:: 4

ISSN:: 2631-9268

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1093/nargab/lqaa077

More Like this