Probabilistic methods for approximate archetypal analysis

Han, Ruijian; Osting, Braxton; Wang, Dong; Xu, Yiming

doi:10.1093/imaiai/iaac008

Citation Details

Probabilistic methods for approximate archetypal analysis

Abstract Archetypal analysis (AA) is an unsupervised learning method for exploratory data analysis. One major challenge that limits the applicability of AA in practice is the inherent computational complexity of the existing algorithms. In this paper, we provide a novel approximation approach to partially address this issue. Utilizing probabilistic ideas from high-dimensional geometry, we introduce two preprocessing techniques to reduce the dimension and representation cardinality of the data, respectively. We prove that provided data are approximately embedded in a low-dimensional linear subspace and the convex hull of the corresponding representations is well approximated by a polytope with a few vertices, our method can effectively reduce the scaling of AA. Moreover, the solution of the reduced problem is near-optimal in terms of prediction errors. Our approach can be combined with other acceleration techniques to further mitigate the intrinsic complexity of AA. We demonstrate the usefulness of our results by applying our method to summarize several moderately large-scale datasets. more »

Award ID(s):: 1752202 2136198

PAR ID:: 10367051

Author(s) / Creator(s):: Han, Ruijian; Osting, Braxton; Wang, Dong; Xu, Yiming

Publisher / Repository:: Oxford University Press

Date Published:: 2022-05-12

Journal Name:: Information and Inference: A Journal of the IMA

Volume:: 12

Issue:: 1

ISSN:: 2049-8772

Page Range / eLocation ID:: p. 466-493

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/imaiai/iaac008

More Like this