Guided Semi-Supervised Non-Negative Matrix Factorization

Li, Pengyu; Tseng, Christine; Zheng, Yaxuan; Chew, Joyce A.; Huang, Longxiu; Jarman, Benjamin; Needell, Deanna

doi:10.3390/a15050136

Citation Details

Guided Semi-Supervised Non-Negative Matrix Factorization

Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform classification and topic modeling tasks; however, most methods that can perform both do not allow for guidance of the topics or features. In this paper, we propose a novel method, namely Guided Semi-Supervised Non-negative Matrix Factorization (GSSNMF), that performs both classification and topic modeling by incorporating supervision from both pre-assigned document class labels and user-designed seed words. We test the performance of this method on legal documents provided by the California Innocence Project and the 20 Newsgroups dataset. Our results show that the proposed method improves both classification accuracy and topic coherence in comparison to past methods such as Semi-Supervised Non-negative Matrix Factorization (SSNMF), Guided Non-negative Matrix Factorization (Guided NMF), and Topic Supervised NMF. more »

Award ID(s):: 2011140

PAR ID:: 10418274

Author(s) / Creator(s):: Li, Pengyu; Tseng, Christine; Zheng, Yaxuan; Chew, Joyce A.; Huang, Longxiu; Jarman, Benjamin; Needell, Deanna

Date Published:: 2022-05-01

Journal Name:: Algorithms

Volume:: 15

Issue:: 5

ISSN:: 1999-4893

Page Range / eLocation ID:: 136

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3390/a15050136

More Like this