COBRAC: a fast implementation of convex biclustering with compression

Yi, Haidong; Huang, Le; Mishne, Gal; Chi, Eric C

doi:10.1093/bioinformatics/btab248

Citation Details

COBRAC: a fast implementation of convex biclustering with compression

Abstract Biclustering is a generalization of clustering used to identify simultaneous grouping patterns in observations (rows) and features (columns) of a data matrix. Recently, the biclustering task has been formulated as a convex optimization problem. While this convex recasting of the problem has attractive properties, existing algorithms do not scale well. To address this problem and make convex biclustering a practical tool for analyzing larger data, we propose an implementation of fast convex biclustering called COBRAC to reduce the computing time by iteratively compressing problem size along the solution path. We apply COBRAC to several gene expression datasets to demonstrate its effectiveness and efficiency. Besides the standalone version for COBRAC, we also developed a related online web server for online calculation and visualization of the downloadable interactive results. Availability The source code and test data are available at https://github.com/haidyi/cvxbiclustr or https://zenodo.org/record/4620218. The web server is available at https://cvxbiclustr.ericchi.com. Supplementary information Supplementary data are available at Bioinformatics online. more »

Award ID(s):: 1752692 2201136

PAR ID:: 10252252

Author(s) / Creator(s):: Yi, Haidong; Huang, Le; Mishne, Gal; Chi, Eric C

Editor(s):: Mathelier, Anthony

Date Published:: 2021-04-27

Journal Name:: Bioinformatics

ISSN:: 1367-4803

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1093/bioinformatics/btab248

More Like this