Variant calling and quality control of large-scale human genome sequencing data

Pellegrini, Matteo; Jew, Brandon; Sul, Jae Hoon

doi:10.1042/ETLS20190007

Citation Details

Variant calling and quality control of large-scale human genome sequencing data

Abstract Next-generation sequencing has allowed genetic studies to collect genome sequencing data from a large number of individuals. However, raw sequencing data are not usually interpretable due to fragmentation of the genome and technical biases; therefore, analysis of these data requires many computational approaches. First, for each sequenced individual, sequencing data are aligned and further processed to account for technical biases. Then, variant calling is performed to obtain information on the positions of genetic variants and their corresponding genotypes. Quality control (QC) is applied to identify individuals and genetic variants with sequencing errors. These procedures are necessary to generate accurate variant calls from sequencing data, and many computational approaches have been developed for these tasks. This review will focus on current widely used approaches for variant calling and QC. more »

Award ID(s):: 1705197

PAR ID:: 10155514

Author(s) / Creator(s):: Pellegrini, Matteo; Jew, Brandon; Sul, Jae Hoon

Date Published:: 2019-07-29

Journal Name:: Emerging Topics in Life Sciences

Volume:: 3

Issue:: 4

ISSN:: 2397-8554

Page Range / eLocation ID:: 399 to 409

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1042/ETLS20190007

More Like this