Local Regularization of Noisy Point Clouds: Improved Global Geometric Estimates and Data Analysis

Garcia Trillos, N; Sanz-Alonso, D; Yang, R

Citation Details

Several data analysis techniques employ similarity relationships between data points to uncover the intrinsic dimension and geometric structure of the underlying data-generating mechanism. In this paper we work under the model assumption that the data is made of random perturbations of feature vectors lying on a low-dimensional manifold. We study two questions: how to define the similarity relationships over noisy data points, and what is the resulting impact of the choice of similarity in the extraction of global geometric information from the underlying manifold. We provide concrete mathematical evidence that using a local regularization of the noisy data to define the similarity improves the ap- proximation of the hidden Euclidean distance between unperturbed points. Furthermore, graph-based objects constructed with the locally regularized similarity function satisfy bet- ter error bounds in their recovery of global geometric ones. Our theory is supported by numerical experiments that demonstrate that the gain in geometric understanding facili- tated by local regularization translates into a gain in classification accuracy in simulated and real data. more »

Award ID(s):: 1912818

PAR ID:: 10155932

Author(s) / Creator(s):: Garcia Trillos, N; Sanz-Alonso, D; Yang, R

Date Published:: 2019-08-01

Journal Name:: Journal of machine learning research

Volume:: 20

ISSN:: 1532-4435

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this