Fast and precise single-cell data analysis using a hierarchical autoencoder

Tran, Duc (ORCID:0000000329188601); Nguyen, Hung; Tran, Bang; La Vecchia, Carlo (ORCID:000000031441897X); Luu, Hung N. (ORCID:0000000221721849); Nguyen, Tin (ORCID:0000000180019470)

doi:10.1038/s41467-021-21312-2

Citation Details

Fast and precise single-cell data analysis using a hierarchical autoencoder

Abstract A primary challenge in single-cell RNA sequencing (scRNA-seq) studies comes from the massive amount of data and the excess noise level. To address this challenge, we introduce an analysis framework, named single-cell Decomposition using Hierarchical Autoencoder (scDHA), that reliably extracts representative information of each cell. The scDHA pipeline consists of two core modules. The first module is a non-negative kernel autoencoder able to remove genes or components that have insignificant contributions to the part-based representation of the data. The second module is a stacked Bayesian autoencoder that projects the data onto a low-dimensional space (compressed). To diminish the tendency to overfit of neural networks, we repeatedly perturb the compressed space to learn a more generalized representation of the data. In an extensive analysis, we demonstrate that scDHA outperforms state-of-the-art techniques in many research sub-fields of scRNA-seq analysis, including cell segregation through unsupervised learning, visualization of transcriptome landscape, cell classification, and pseudo-time inference. more »

Award ID(s):: 2019609 2001385

PAR ID:: 10214234

Author(s) / Creator(s):: Tran, Duc; Nguyen, Hung; Tran, Bang; La Vecchia, Carlo; Luu, Hung N.; Nguyen, Tin

Publisher / Repository:: Nature Publishing Group

Date Published:: 2021-02-15

Journal Name:: Nature Communications

Volume:: 12

Issue:: 1

ISSN:: 2041-1723

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1038/s41467-021-21312-2

More Like this