Best Practices in Designing, Sequencing, and Identifying Random DNA Barcodes

Johnson, Milo S.; Venkataram, Sandeep; Kryazhimskiy, Sergey (ORCID:0000000191288705)

doi:10.1007/s00239-022-10083-z

Citation Details

Best Practices in Designing, Sequencing, and Identifying Random DNA Barcodes

Abstract Random DNA barcodes are a versatile tool for tracking cell lineages, with applications ranging from development to cancer to evolution. Here, we review and critically evaluate barcode designs as well as methods of barcode sequencing and initial processing of barcode data. We first demonstrate how various barcode design decisions affect data quality and propose a new design that balances all considerations that we are currently aware of. We then discuss various options for the preparation of barcode sequencing libraries, including inline indices and Unique Molecular Identifiers (UMIs). Finally, we test the performance of several established and new bioinformatic pipelines for the extraction of barcodes from raw sequencing reads and for error correction. We find that both alignment and regular expression-based approaches work well for barcode extraction, and that error-correction pipelines designed specifically for barcode data are superior to generic ones. Overall, this review will help researchers to approach their barcoding experiments in a deliberate and systematic way. more »

Award ID(s):: 2109800

PAR ID:: 10391941

Author(s) / Creator(s):: Johnson, Milo S.; Venkataram, Sandeep; Kryazhimskiy, Sergey

Publisher / Repository:: Springer Science + Business Media

Date Published:: 2023-01-18

Journal Name:: Journal of Molecular Evolution

Volume:: 91

Issue:: 3

ISSN:: 0022-2844

Page Range / eLocation ID:: p. 263-280

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1007/s00239-022-10083-z

More Like this