NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Random Walks, Conductance, and Resistance for the Connection Graph Laplacian

https://doi.org/10.1137/23M1595400

Cloninger, Alexander; Mishne, Gal; Oslandsbotn, Andreas; Robertson, Sawyer J; Wan, Zhengchao; Wang, Yusu (September 2024, SIAM Journal on Matrix Analysis and Applications)

Full Text Available
LDLE: Low Distortion Local Eigenmaps

Kohli, Dhruv; Cloninger, Alexander; Mishne, Gal (November 2021, Journal of machine learning research)

Full Text Available
Multi‐scale affinities with missing data: Estimation and applications

https://doi.org/10.1002/sam.11561

Zhang, Min; Mishne, Gal; Chi, Eric C. (November 2021, Statistical Analysis and Data Mining: The ASA Data Science Journal)

Full Text Available
Scalable Algorithms for Convex Clustering

https://doi.org/10.1109/DSLW51110.2021.9523411

Zhou, Weilian; Yi, Haidong; Mishne, Gal; Chi, Eric (June 2021, IEEE Data Science Workshop (DSW))

Full Text Available
COBRAC: a fast implementation of convex biclustering with compression

https://doi.org/10.1093/bioinformatics/btab248

Yi, Haidong; Huang, Le; Mishne, Gal; Chi, Eric C (April 2021, Bioinformatics)
Mathelier, Anthony (Ed.)
Abstract Biclustering is a generalization of clustering used to identify simultaneous grouping patterns in observations (rows) and features (columns) of a data matrix. Recently, the biclustering task has been formulated as a convex optimization problem. While this convex recasting of the problem has attractive properties, existing algorithms do not scale well. To address this problem and make convex biclustering a practical tool for analyzing larger data, we propose an implementation of fast convex biclustering called COBRAC to reduce the computing time by iteratively compressing problem size along the solution path. We apply COBRAC to several gene expression datasets to demonstrate its effectiveness and efficiency. Besides the standalone version for COBRAC, we also developed a related online web server for online calculation and visualization of the downloadable interactive results. Availability The source code and test data are available at https://github.com/haidyi/cvxbiclustr or https://zenodo.org/record/4620218. The web server is available at https://cvxbiclustr.ericchi.com. Supplementary information Supplementary data are available at Bioinformatics online.
more » « less
Full Text Available
LDLE: Low Distortion Local Eigenmaps

Kohli, Dhruv; Cloninger, Alexander; Mishne, Gal (January 2021, Journal of machine learning research)

Full Text Available
Multiway Graph Signal Processing on Tensors: Integrative Analysis of Irregular Geometries

https://doi.org/10.1109/MSP.2020.3013555

Stanley III, Jay S.; Chi, Eric C.; Mishne, Gal (November 2020, IEEE Signal Processing Magazine)
null (Ed.)
Full Text Available
Spectral Embedding Norm: Looking Deep into the Spectrum of the Graph Laplacian

https://doi.org/10.1137/18M1283160

Cheng, Xiuyuan; Mishne, Gal (January 2020, SIAM Journal on Imaging Sciences)
null (Ed.)
Full Text Available
Co-manifold learning with missing data

Mishne, Gal; Chi, Eric C; Coifman, Ronald R. (January 2019, Proceedings of the 36th International Conference on Machine Learning)

Representation learning is typically applied to only one mode of a data matrix, either its rows or columns. Yet in many applications, there is an underlying geometry to both the rows and the columns. We propose utilizing this coupled structure to perform co-manifold learning: uncovering the underlying geometry of both the rows and the columns of a given matrix, where we focus on a missing data setting. Our unsupervised approach consists of three components. We first solve a family of optimization problems to estimate a complete matrix at multiple scales of smoothness. We then use this collection of smooth matrix estimates to compute pairwise distances on the rows and columns based on a new multi-scale metric that implicitly introduces a coupling between the rows and the columns. Finally, we construct row and column representations from these multi- scale metrics. We demonstrate that our approach outperforms competing methods in both data visualization and clustering.
more » « less
Full Text Available

Search for: All records