A Hyperplane-Based Algorithm for Semi-Supervised Dimension Reduction

Fang, Huang; Cheng, Minhao; Hsieh, Cho-Jui

doi:10.1109/ICDM.2017.19

Citation Details

A Hyperplane-Based Algorithm for Semi-Supervised Dimension Reduction

We consider the semi-supervised dimension reduction problem: given a high dimensional dataset with a small number of labeled data and huge number of unlabeled data, the goal is to find the low-dimensional embedding that yields good classification results. Most of the previous algorithms for this task are linkage-based algorithms. They try to enforce the must-link and cannot-link constraints in dimension reduction, leading to a nearest neighbor classifier in low dimensional space. In this paper, we propose a new hyperplane-based semi-supervised dimension reduction method---the main objective is to learn the low-dimensional features that can both approximate the original data and form a good separating hyperplane. We formulate this as a non-convex optimization problem and propose an efficient algorithm to solve it. The algorithm can scale to problems with millions of features and can easily incorporate non-negative constraints in order to learn interpretable non-negative features. Experiments on real world datasets demonstrate that our hyperplane-based dimension reduction method outperforms state-of-art linkage-based methods when very few labels are available. more »

Award ID(s):: 1719097

PAR ID:: 10058227

Author(s) / Creator(s):: Fang, Huang; Cheng, Minhao; Hsieh, Cho-Jui

Date Published:: 2017-11-01

Journal Name:: IEEE International Conference on Data Mining (ICDM)

Page Range / eLocation ID:: 101 to 110

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICDM.2017.19

More Like this