TripletGO: Integrating Transcript Expression Profiles with Protein Homology Inferences for Gene Function Prediction

Zhu, Yi-Heng (ORCID:0000000238571533); Zhang, Chengxin (ORCID:0000000172901324); Liu, Yan (ORCID:0000000253313655); Omenn, Gilbert S. (ORCID:0000000289766074); Freddolino, Peter L. (ORCID:0000000258214226); Yu, Dong-Jun (ORCID:0000000267868053); Zhang, Yang (ORCID:0000000227391916)

doi:10.1016/j.gpb.2022.03.001

Citation Details

TripletGO: Integrating Transcript Expression Profiles with Protein Homology Inferences for Gene Function Prediction

Abstract Gene Ontology (GO) has been widely used to annotate functions of genes and gene products. Here, we proposed a new method, TripletGO, to deduce GO terms of protein-coding and non-coding genes, through the integration of four complementary pipelines built on transcript expression profile, genetic sequence alignment, protein sequence alignment, and naïve probability. TripletGO was tested on a large set of 5754 genes from 8 species (human, mouse, Arabidopsis, rat, fly, budding yeast, fission yeast, and nematoda) and 2433 proteins with available expression data from the third Critical Assessment of Protein Function Annotation challenge (CAFA3). Experimental results show that TripletGO achieves function annotation accuracy significantly beyond the current state-of-the-art approaches. Detailed analyses show that the major advantage of TripletGO lies in the coupling of a new triplet network-based profiling method with the feature space mapping technique, which can accurately recognize function patterns from transcript expression profiles. Meanwhile, the combination of multiple complementary models, especially those from transcript expression and protein-level alignments, improves the coverage and accuracy of the final GO annotation results. The standalone package and an online server of TripletGO are freely available at https://zhanggroup.org/TripletGO/. more »

Award ID(s):: 2025426

PAR ID:: 10506764

Author(s) / Creator(s):: Zhu, Yi-Heng; Zhang, Chengxin; Liu, Yan; Omenn, Gilbert S.; Freddolino, Peter L.; Yu, Dong-Jun; Zhang, Yang

Publisher / Repository:: Oxford University Press

Date Published:: 2022-05-11

Journal Name:: Genomics, Proteomics & Bioinformatics

Volume:: 20

Issue:: 5

ISSN:: 1672-0229

Format(s):: Medium: X Size: p. 1013-1027

Size(s):: p. 1013-1027

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1016/j.gpb.2022.03.001

More Like this