A data harmonization pipeline to leverage external controls and boost power in GWAS

Chen, Danfeng; Tashman, Katherine; Palmer, Duncan S; Neale, Benjamin; Roeder, Kathryn; Bloemendal, Alex; Churchhouse, Claire; Ke, Zheng Tracy

doi:10.1093/hmg/ddab261

Citation Details

A data harmonization pipeline to leverage external controls and boost power in GWAS

Abstract The use of external controls in genome-wide association study (GWAS) can significantly increase the size and diversity of the control sample, enabling high-resolution ancestry matching and enhancing the power to detect association signals. However, the aggregation of controls from multiple sources is challenging due to batch effects, difficulty in identifying genotyping errors and the use of different genotyping platforms. These obstacles have impeded the use of external controls in GWAS and can lead to spurious results if not carefully addressed. We propose a unified data harmonization pipeline that includes an iterative approach to quality control and imputation, implemented before and after merging cohorts and arrays. We apply this harmonization pipeline to aggregate 27 517 European control samples from 16 collections within dbGaP. We leverage these harmonized controls to conduct a GWAS of Crohn’s disease. We demonstrate a boost in power over using the cohort samples alone, and that our procedure results in summary statistics free of any significant batch effects. This harmonization pipeline for aggregating genotype data from multiple sources can also serve other applications where individual level genotypes, rather than summary statistics, are required. more »

Award ID(s):: 1943902

PAR ID:: 10348690

Author(s) / Creator(s):: Chen, Danfeng; Tashman, Katherine; Palmer, Duncan S; Neale, Benjamin; Roeder, Kathryn; Bloemendal, Alex; Churchhouse, Claire; Ke, Zheng Tracy

Date Published:: 2021-09-11

Journal Name:: Human Molecular Genetics

Volume:: 31

Issue:: 3

ISSN:: 0964-6906

Page Range / eLocation ID:: 481 to 489

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1093/hmg/ddab261

More Like this