TGPred: efficient methods for predicting target genes of a transcription factor by integrating statistics, machine learning and optimization

Cao, Xuewei; Zhang, Ling (ORCID:0000000267990673); Islam, Md Khairul; Zhao, Mingxia (ORCID:0000000207933378); He, Cheng; Zhang, Kui (ORCID:0000000224412064); Liu, Sanzhen (ORCID:000000029513855X); Sha, Qiuying; Wei, Hairong (ORCID:0000000235514998)

doi:10.1093/nargab/lqad083

Citation Details

TGPred: efficient methods for predicting target genes of a transcription factor by integrating statistics, machine learning and optimization

Abstract Four statistical selection methods for inferring transcription factor (TF)–target gene (TG) pairs were developed by coupling mean squared error (MSE) or Huber loss function, with elastic net (ENET) or least absolute shrinkage and selection operator (Lasso) penalty. Two methods were also developed for inferring pathway gene regulatory networks (GRNs) by combining Huber or MSE loss function with a network (Net)-based penalty. To solve these regressions, we ameliorated an accelerated proximal gradient descent (APGD) algorithm to optimize parameter selection processes, resulting in an equally effective but much faster algorithm than the commonly used convex optimization solver. The synthetic data generated in a general setting was used to test four TF–TG identification methods, ENET-based methods performed better than Lasso-based methods. Synthetic data generated from two network settings was used to test Huber-Net and MSE-Net, which outperformed all other methods. The TF–TG identification methods were also tested with SND1 and gl3 overexpression transcriptomic data, Huber-ENET and MSE-ENET outperformed all other methods when genome-wide predictions were performed. The TF–TG identification methods fill the gap of lacking a method for genome-wide TG prediction of a TF, and potential for validating ChIP/DAP-seq results, while the two Net-based methods are instrumental for predicting pathway GRNs. more »

Award ID(s):: 1741090

PAR ID:: 10462417

Author(s) / Creator(s):: Cao, Xuewei; Zhang, Ling; Islam, Md Khairul; Zhao, Mingxia; He, Cheng; Zhang, Kui; Liu, Sanzhen; Sha, Qiuying; Wei, Hairong

Publisher / Repository:: Oxford University Press

Date Published:: 2023-09-13

Journal Name:: NAR Genomics and Bioinformatics

Volume:: 5

Issue:: 3

ISSN:: 2631-9268

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/nargab/lqad083

More Like this