NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Supervised Matrix Factorization: Local Landscape Analysis and Applications

Lee, Joowon; Lyu, Hanbaek; Yao, Weixin (July 2024, Proceedings of Machine Learning Research)

Supervised matrix factorization (SMF) is a classical machine learning method that seeks low-dimensional feature extraction and classification tasks at the same time. Training an SMF model involves solving a non-convex and factor-wise constrained optimization problem with at least three blocks of parameters. Due to the high non-convexity and constraints, theoretical understanding of the optimization landscape of SMF has been limited. In this paper, we provide an extensive local landscape analysis for SMF and derive several theoretical and practical applications. Analyzing diagonal blocks of the Hessian naturally leads to a block coordinate descent (BCD) algorithm with adaptive step sizes. We provide global convergence and iteration complexity guarantees for this algorithm. Full Hessian analysis gives minimum $$L_{2}$$-regularization to guarantee local strong convexity and robustness of parameters. We establish a local estimation guarantee under a statistical SMF model. We also propose a novel GPU-friendly neural implementation of the BCD algorithm and validate our theoretical findings through numerical experiments. Our work contributes to a deeper understanding of SMF optimization, offering insights into the optimization landscape and providing practical solutions to enhance its performance.
more » « less
Full Text Available
Supervised Matrix Factorization: Local Landscape Analysis and Applications

Lee, Joowon; Lyu, Hanbaek; Yao, Weixin (July 2024, Proceedings of Machine Learning Research)

Supervised matrix factorization (SMF) is a classical machine learning method that seeks low-dimensional feature extraction and classification tasks at the same time. Training an SMF model involves solving a non-convex and factor-wise constrained optimization problem with at least three blocks of parameters. Due to the high non-convexity and constraints, theoretical understanding of the optimization landscape of SMF has been limited. In this paper, we provide an extensive local landscape analysis for SMF and derive several theoretical and practical applications. Analyzing diagonal blocks of the Hessian naturally leads to a block coordinate descent (BCD) algorithm with adaptive step sizes. We provide global convergence and iteration complexity guarantees for this algorithm. Full Hessian analysis gives minimum L2-regularization to guarantee local strong convexity and robustness of parameters. We establish a local estimation guarantee under a statistical SMF model. We also propose a novel GPU-friendly neural implementation of the BCD algorithm and validate our theoretical findings through numerical experiments. Our work contributes to a deeper understanding of SMF optimization, offering insights into the optimization landscape and providing practical solutions to enhance its performance.
more » « less
Full Text Available
A new bandwidth selection method for nonparametric modal regression based on generalized hyperbolic distributions

https://doi.org/10.1007/s00180-023-01435-4

Yuan, Hongpeng; Xiang, Sijia; Yao, Weixin (November 2023, Computational Statistics)

Full Text Available
Semiparametric partially linear varying coefficient modal regression

https://doi.org/10.1016/j.jeconom.2022.09.002

Ullah, Aman; Wang, Tao; Yao, Weixin (August 2023, Journal of Econometrics)

Full Text Available
Exponentially Convergent Algorithms for Supervised Matrix Factorization

Lee, Joowon; Lyu, Hanbaek; Yao, Weixin (August 2023, Advances in Neural Information Processing Systems)

Supervised matrix factorization (SMF) is a classical machine learning method that simultaneously seeks feature extraction and classification tasks, which are not necessarily a priori aligned objectives. Our goal is to use SMF to learn low-rank latent factors that offer interpretable, data-reconstructive, and class-discriminative features, addressing challenges posed by high-dimensional data. Training SMF model involves solving a nonconvex and possibly constrained optimization with at least three blocks of parameters. Known algorithms are either heuristic or provide weak convergence guarantees for special cases. In this paper, we provide a novel framework that ‘lifts’ SMF as a low-rank matrix estimation problem in a combined factor space and propose an efficient algorithm that provably converges exponentially fast to a global minimizer of the objective with arbitrary initialization under mild assumptions. Our framework applies to a wide range of SMF-type problems for multi-class classification with auxiliary features. To showcase an application, we demonstrate that our algorithm successfully identified well-known cancer-associated gene groups for various cancers.
more » « less
Full Text Available
Exponentially Convergent Algorithms for Supervised Matrix Factorization

Lee, Joowon; Lyu, Hanbaek; Yao, Weixin (August 2023, Advances in Neural Information Processing Systems)

Supervised matrix factorization (SMF) is a classical machine learning method that simultaneously seeks feature extraction and classification tasks, which are not necessarily a priori aligned objectives. Our goal is to use SMF to learn low-rank latent factors that offer interpretable, data-reconstructive, and class-discriminative features, addressing challenges posed by high-dimensional data. Training SMF model involves solving a nonconvex and possibly constrained optimization with at least three blocks of parameters. Known algorithms are either heuristic or provide weak convergence guarantees for special cases. In this paper, we provide a novel framework that ‘lifts’ SMF as a low-rank matrix estimation problem in a combined factor space and propose an efficient algorithm that provably converges exponentially fast to a global minimizer of the objective with arbitrary initialization under mild assumptions. Our framework applies to a wide range of SMF-type problems for multi-class classification with auxiliary features. To showcase an application, we demonstrate that our algorithm successfully identified well-known cancer-associated gene groups for various cancers.
more » « less
Full Text Available
Energy Efficient Building HVAC Control Algorithm with Real-time Occupancy Prediction

https://doi.org/10.1016/j.egypro.2017.03.028

Shi, Jie; Yu, Nanpeng; Yao, Weixin (March 2017, Energy Procedia)

Full Text Available
Pursuing sources of heterogeneity in modeling clustered population

https://doi.org/10.1111/biom.13434

Li, Yan; Yu, Chun; Zhao, Yize; Yao, Weixin; Aseltine, Robert H.; Chen, Kun (February 2021, Biometrics)

Full Text Available
Impact of aerosols on reservoir inflow: A case study for Big Creek Hydroelectric System in California: Impact of Aerosols on Reservoir Inflow in California

https://doi.org/10.1002/hyp.13265

Kabir, Farzana; Yu, Nanpeng; Yao, Weixin; Wu, Longtao; Jiang, Jonathan H.; Gu, Yu; Su, Hui (September 2018, Hydrological Processes)

Search for: All records