Demystifying “drop-outs” in single-cell UMI data

Kim, Tae Hyun; Zhou, Xiang; Chen, Mengjie

doi:10.1186/s13059-020-02096-y

Citation Details

Demystifying “drop-outs” in single-cell UMI data

Abstract Many existing pipelines for scRNA-seq data apply pre-processing steps such as normalization or imputation to account for excessive zeros or “drop-outs. Here, we extensively analyze diverse UMI data sets to show that clustering should be the foremost step of the workflow. We observe that most drop-outs disappear once cell-type heterogeneity is resolved, while imputing or normalizing heterogeneous data can introduce unwanted noise. We propose a novel framework HIPPO (Heterogeneity-Inspired Pre-Processing tOol) that leverages zero proportions to explain cellular heterogeneity and integrates feature selection with iterative clustering. HIPPO leads to downstream analysis with greater flexibility and interpretability compared to alternatives. more »

Award ID(s):: 1712933

PAR ID:: 10480493

Author(s) / Creator(s):: Kim, Tae Hyun; Zhou, Xiang; Chen, Mengjie

Publisher / Repository:: Genome Biology

Date Published:: 2020-12-01

Journal Name:: Genome Biology

Volume:: 21

Issue:: 1

ISSN:: 1474-760X

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1186/s13059-020-02096-y

More Like this