NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An Efficient Linear Mixed Model Framework for Meta-Analytic Association Studies Across Multiple Contexts

Jew, Brandon; Li, Jiajin; Sankararaman, Sriram; Sul, Jae-Hoon (January 2021, 21st International Workshop on Algorithms in Bioinformatics, {WABI})

Linear mixed models (LMMs) can be applied in the meta-analyses of responses from individuals across multiple contexts, increasing power to detect associations while accounting for confounding effects arising from within-individual variation. However, traditional approaches to fitting these models can be computationally intractable. Here, we describe an efficient and exact method for fitting a multiple-context linear mixed model. Whereas existing exact methods may be cubic in their time complexity with respect to the number of individuals, our approach for multiple-context LMMs (mcLMM) is linear. These improvements allow for large-scale analyses requiring computing time and memory magnitudes of order less than existing methods. As examples, we apply our approach to identify expression quantitative trait loci from large-scale gene expression data measured across multiple tissues as well as joint analyses of multiple phenotypes in genomewide association studies at biobank scale.
more » « less
Full Text Available
Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative

https://doi.org/10.1186/s13073-022-01106-x

Johnson, Ruth; Ding, Yi; Venkateswaran, Vidhya; Bhattacharya, Arjun; Boulier, Kristin; Chiu, Alec; Knyazev, Sergey; Schwarz, Tommer; Freund, Malika; Zhan, Lingyu; et al (December 2022, Genome Medicine)

Abstract Background Large medical centers in urban areas, like Los Angeles, care for a diverse patient population and offer the potential to study the interplay between genetic ancestry and social determinants of health. Here, we explore the implications of genetic ancestry within the University of California, Los Angeles (UCLA) ATLAS Community Health Initiative—an ancestrally diverse biobank of genomic data linked with de-identified electronic health records (EHRs) of UCLA Health patients ( N =36,736). Methods We quantify the extensive continental and subcontinental genetic diversity within the ATLAS data through principal component analysis, identity-by-descent, and genetic admixture. We assess the relationship between genetically inferred ancestry (GIA) and >1500 EHR-derived phenotypes (phecodes). Finally, we demonstrate the utility of genetic data linked with EHR to perform ancestry-specific and multi-ancestry genome and phenome-wide scans across a broad set of disease phenotypes. Results We identify 5 continental-scale GIA clusters including European American (EA), African American (AA), Hispanic Latino American (HL), South Asian American (SAA) and East Asian American (EAA) individuals and 7 subcontinental GIA clusters within the EAA GIA corresponding to Chinese American, Vietnamese American, and Japanese American individuals. Although we broadly find that self-identified race/ethnicity (SIRE) is highly correlated with GIA, we still observe marked differences between the two, emphasizing that the populations defined by these two criteria are not analogous. We find a total of 259 significant associations between continental GIA and phecodes even after accounting for individuals’ SIRE, demonstrating that for some phenotypes, GIA provides information not already captured by SIRE. GWAS identifies significant associations for liver disease in the 22q13.31 locus across the HL and EAA GIA groups (HL p -value=2.32×10 −16 , EAA p -value=6.73×10 −11 ). A subsequent PheWAS at the top SNP reveals significant associations with neurologic and neoplastic phenotypes specifically within the HL GIA group. Conclusions Overall, our results explore the interplay between SIRE and GIA within a disease context and underscore the utility of studying the genomes of diverse individuals through biobank-scale genotyping linked with EHR-based phenotyping.
more » « less
Full Text Available
Variant calling and quality control of large-scale human genome sequencing data

https://doi.org/10.1042/ETLS20190007

Pellegrini, Matteo; Jew, Brandon; Sul, Jae Hoon (July 2019, Emerging Topics in Life Sciences)

Abstract Next-generation sequencing has allowed genetic studies to collect genome sequencing data from a large number of individuals. However, raw sequencing data are not usually interpretable due to fragmentation of the genome and technical biases; therefore, analysis of these data requires many computational approaches. First, for each sequenced individual, sequencing data are aligned and further processed to account for technical biases. Then, variant calling is performed to obtain information on the positions of genetic variants and their corresponding genotypes. Quality control (QC) is applied to identify individuals and genetic variants with sequencing errors. These procedures are necessary to generate accurate variant calls from sequencing data, and many computational approaches have been developed for these tasks. This review will focus on current widely used approaches for variant calling and QC.
more » « less
Full Text Available
ForestQC: Quality control on genetic variants from next-generation sequencing data using random forest

https://doi.org/10.1371/journal.pcbi.1007556

Li, Jiajin; Jew, Brandon; Zhan, Lingyu; Hwang, Sungoo; Coppola, Giovanni; Freimer, Nelson B.; Sul, Jae Hoon; Pertea, Mihaela (December 2019, PLOS Computational Biology)

Full Text Available
Leveraging allelic imbalance to refine fine-mapping for eQTL studies

https://doi.org/10.1371/journal.pgen.1008481

Zou, Jennifer; Hormozdiari, Farhad; Jew, Brandon; Castel, Stephane E.; Lappalainen, Tuuli; Ernst, Jason; Sul, Jae Hoon; Eskin, Eleazar; Wen, Xiaoquan (December 2019, PLOS Genetics)

Full Text Available
Accurate estimation of cell composition in bulk expression through robust integration of single-cell information

https://doi.org/10.1038/s41467-020-15816-6

Jew, Brandon; Alvarez, Marcus; Rahmani, Elior; Miao, Zong; Ko, Arthur; Garske, Kristina M.; Sul, Jae Hoon; Pietiläinen, Kirsi H.; Pajukanta, Päivi; Halperin, Eran (April 2020, Nature Communications)

Abstract We present Bisque, a tool for estimating cell type proportions in bulk expression. Bisque implements a regression-based approach that utilizes single-cell RNA-seq (scRNA-seq) or single-nucleus RNA-seq (snRNA-seq) data to generate a reference expression profile and learn gene-specific bulk expression transformations to robustly decompose RNA-seq data. These transformations significantly improve decomposition performance compared to existing methods when there is significant technical variation in the generation of the reference profile and observed bulk expression. Importantly, compared to existing methods, our approach is extremely efficient, making it suitable for the analysis of large genomic datasets that are becoming ubiquitous. When applied to subcutaneous adipose and dorsolateral prefrontal cortex expression datasets with both bulk RNA-seq and snRNA-seq data, Bisque replicates previously reported associations between cell type proportions and measured phenotypes across abundant and rare cell types. We further propose an additional mode of operation that merely requires a set of known marker genes.
more » « less
Contribution of common and rare variants to bipolar disorder susceptibility in extended pedigrees from population isolates

https://doi.org/10.1038/s41398-020-0758-1

Sul, Jae Hoon; Service, Susan K.; Huang, Alden Y.; Ramensky, Vasily; Hwang, Sun-Goo; Teshiba, Terri M.; Park, YoungJun; Ori, Anil P.; Zhang, Zhongyang; Mullins, Niamh; et al (December 2020, Translational Psychiatry)

Full Text Available
Synaptic processes and immune-related pathways implicated in Tourette syndrome

https://doi.org/10.1038/s41398-020-01082-z

Tsetsos, Fotis; Yu, Dongmei; Sul, Jae Hoon; Huang, Alden Y.; Illmann, Cornelia; Osiecki, Lisa; Darrow, Sabrina M.; Hirschtritt, Matthew E.; Greenberg, Erica; Muller-Vahl, Kirsten R.; et al (June 2021, Translational Psychiatry)
null (Ed.)
Abstract Tourette syndrome (TS) is a neuropsychiatric disorder of complex genetic architecture involving multiple interacting genes. Here, we sought to elucidate the pathways that underlie the neurobiology of the disorder through genome-wide analysis. We analyzed genome-wide genotypic data of 3581 individuals with TS and 7682 ancestry-matched controls and investigated associations of TS with sets of genes that are expressed in particular cell types and operate in specific neuronal and glial functions. We employed a self-contained, set-based association method (SBA) as well as a competitive gene set method (MAGMA) using individual-level genotype data to perform a comprehensive investigation of the biological background of TS. Our SBA analysis identified three significant gene sets after Bonferroni correction, implicating ligand-gated ion channel signaling, lymphocytic, and cell adhesion and transsynaptic signaling processes. MAGMA analysis further supported the involvement of the cell adhesion and trans-synaptic signaling gene set. The lymphocytic gene set was driven by variants in FLT3 , raising an intriguing hypothesis for the involvement of a neuroinflammatory element in TS pathogenesis. The indications of involvement of ligand-gated ion channel signaling reinforce the role of GABA in TS, while the association of cell adhesion and trans-synaptic signaling gene set provides additional support for the role of adhesion molecules in neuropsychiatric disorders. This study reinforces previous findings but also provides new insights into the neurobiology of TS.
more » « less
Full Text Available
Genome-wide Association Study Points to Novel Locus for Gilles de la Tourette Syndrome

https://doi.org/10.1016/j.biopsych.2023.01.023

Tsetsos, Fotis; Topaloudi, Apostolia; Jain, Pritesh; Yang, Zhiyu; Yu, Dongmei; Kolovos, Petros; Tumer, Zeynep; Rizzo, Renata; Hartmann, Andreas; Depienne, Christel; et al (February 2023, Biological Psychiatry)

Full Text Available
Interrogating the Genetic Determinants of Tourette’s Syndrome and Other Tic Disorders Through Genome-Wide Association Studies

https://doi.org/10.1176/appi.ajp.2018.18070857

Yu, Dongmei; Sul, Jae Hoon; Tsetsos, Fotis; Nawaz, Muhammad S.; Huang, Alden Y.; Zelaya, Ivette; Illmann, Cornelia; Osiecki, Lisa; Darrow, Sabrina M.; Hirschtritt, Matthew E.; et al (March 2019, American Journal of Psychiatry)

Full Text Available

« Prev Next »

Search for: All records