Information‐incorporated Gaussian graphical model for gene expression data

Yi, Huangdi; Zhang, Qingzhao; Lin, Cunjie  (ORCID:0000000150430680); Ma, Shuangge  (ORCID:0000000190014999)

doi:10.1111/biom.13428

Citation Details

Information‐incorporated Gaussian graphical model for gene expression data

Abstract

In the analysis of gene expression data, network approaches take a system perspective and have played an irreplaceably important role. Gaussian graphical models (GGMs) have been popular in the network analysis of gene expression data. They investigate the conditional dependence between genes and “transform” the problem of estimating network structures into a sparse estimation of precision matrices. When there is a moderate to large number of genes, the number of parameters to be estimated may overwhelm the limited sample size, leading to unreliable estimation and selection. In this article, we propose incorporating information from previous studies (for example, those deposited at PubMed) to assist estimating the network structure in the present data. It is recognized that such information can be partial, biased, or even wrong. A penalization‐based estimation approach is developed, shown to have consistency properties, and realized using an effective computational algorithm. Simulation demonstrates its competitive performance under various information accuracy scenarios. The analysis of TCGA lung cancer prognostic genes leads to network structures different from the alternatives.

NSF-PAR ID:: 10368560

Author(s) / Creator(s):: Yi, Huangdi ; Zhang, Qingzhao ; Lin, Cunjie ; Ma, Shuangge

Publisher / Repository:: Oxford University Press

Date Published:: 2021-02-12

Journal Name:: Biometrics

Volume:: 78

Issue:: 2

ISSN:: 0006-341X

Format(s):: Medium: X Size: p. 512-523

Size(s):: ["p. 512-523"]

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1111/biom.13428

More Like this