Majority Vote Cascading: A Semi-Supervised Framework for Improving Protein Function Prediction

Lazarsfeld, John; Rodriguez, Jonathan; Erden, Mert; Liu, Yuelin; Cowen, Lenore J.

doi:10.1109/TCBB.2021.3059812

Citation Details

Majority Vote Cascading: A Semi-Supervised Framework for Improving Protein Function Prediction

A method to improve protein function prediction for sparsely annotated PPI networks is introduced. The method extends the DSD majority vote algorithm introduced by Cao et al. to give confidence scores on predicted labels and to use predictions of high confidence to predict the labels of other nodes in subsequent rounds. We call this a majority vote cascade. Several cascade variants are tested in a stringent cross-validation experiment on PPI networks from S. cerevisiae and D. melanogaster, and we show that for many different settings with several alternative confidence functions, cascading improves the accuracy of the predictions. A list of the most confident new label predictions in the two networks is also reported. Code and networks for the cross-validation experiments appear at http://bcb.cs.tufts.edu/cascade. more »

Award ID(s):: 1812503 1934553

PAR ID:: 10346852

Author(s) / Creator(s):: Lazarsfeld, John; Rodriguez, Jonathan; Erden, Mert; Liu, Yuelin; Cowen, Lenore J.

Date Published:: 2021-02-01

Journal Name:: IEEE/ACM Transactions on Computational Biology and Bioinformatics

ISSN:: 1545-5963

Page Range / eLocation ID:: 1 to 1

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TCBB.2021.3059812

More Like this