scReadSim: a single-cell RNA-seq and ATAC-seq read simulator

Yan, Guanao; Song, Dongyuan (ORCID:0000000311141215); Li, Jingyi Jessica (ORCID:0000000292885648)

doi:10.1038/s41467-023-43162-w

Citation Details

scReadSim: a single-cell RNA-seq and ATAC-seq read simulator

Abstract Benchmarking single-cell RNA-seq (scRNA-seq) and single-cell Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) computational tools demands simulators to generate realistic sequencing reads. However, none of the few read simulators aim to mimic real data. To fill this gap, we introduce scReadSim, a single-cell RNA-seq and ATAC-seq read simulator that allows user-specified ground truths and generates synthetic sequencing reads (in a FASTQ or BAM file) by mimicking real data. At both read-sequence and read-count levels, scReadSim mimics real scRNA-seq and scATAC-seq data. Moreover, scReadSim provides ground truths, including unique molecular identifier (UMI) counts for scRNA-seq and open chromatin regions for scATAC-seq. In particular, scReadSim allows users to design cell-type-specific ground-truth open chromatin regions for scATAC-seq data generation. In benchmark applications of scReadSim, we show that UMI-tools achieves the top accuracy in scRNA-seq UMI deduplication, and HMMRATAC and MACS3 achieve the top performance in scATAC-seq peak calling. more »

Award ID(s):: 1846216 2113754

PAR ID:: 10474555

Author(s) / Creator(s):: Yan, Guanao; Song, Dongyuan; Li, Jingyi Jessica

Publisher / Repository:: Nature Publishing Group

Date Published:: 2023-11-18

Journal Name:: Nature Communications

Volume:: 14

Issue:: 1

ISSN:: 2041-1723

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1038/s41467-023-43162-w

More Like this