No Dimensional Sampling Coresets for Classification

Alishahi, Meysam; Phillips, Jeff M

Citation Details

We refine and generalize what is known about coresets for classification problems via the sensitivity sampling framework. Such coresets seek the smallest possible subsets of input data, so one can optimize a loss function on the coreset and ensure approximation guarantees with respect to the original data. Our analysis provides the first no dimensional coresets, so the size does not depend on the dimension. Moreover, our results are general, apply for distributional input and can use iid samples, so provide sample complexity bounds, and work for a variety of loss functions. A key tool we develop is a Radamacher complexity version of the main sensitivity sampling approach, which can be of independent interest. more »

Award ID(s):: 2115677

PAR ID:: 10537703

Author(s) / Creator(s):: Alishahi, Meysam; Phillips, Jeff M

Publisher / Repository:: Proceedings of the 41st International Conference on Machine Learning (ICML)

Date Published:: 2024-07-22

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
The DOI is not currently available.

More Like this