Rd-dpp: Rate-distortion theory meets determinantal point process to diversify learning data samples

Chen, Xiwen; Li, Huayu; Qiu, Peijie; Zhu, Wenhui; Amin, Rahul; Razi, Abolfazl

Citation Details

Selecting representative samples plays an indispensable role in many machine learning and computer vision applications under limited resources (e.g., limited communication bandwidth and computational power). Determinantal Point Process (DPP) is a widely used method for selecting the most diverse representative samples that can summarize a dataset. However, its adaptability to different tasks remains an open challenge, as it is challenging for DPP to perform task-specific tuning. In contrast, Rate-Distortion (RD) theory provides a way to measure task-specific diversity. However, optimizing RD for a data selection problem remains challenging because the quantity that needs to be optimized is the index set of the selected samples. To tackle these challenges, we first draw an inherent relationship between DPP and RD theory. Our theoretical derivation paves the way for taking advantage of both RD and DPP for a task-specific data selection. To this end, we propose a novel method for task-specific data selection for multi-level classification tasks, named RD-DPP. Empirical studies on seven different datasets using five benchmark models demonstrate the effectiveness of the proposed RD-DPP method. Our method also outperforms recent strong competing methods, while exhibiting high generalizability to a variety of learning tasks. more »

Award ID(s):: 2204721

PAR ID:: 10542194

Author(s) / Creator(s):: Chen, Xiwen; Li, Huayu; Qiu, Peijie; Zhu, Wenhui; Amin, Rahul; Razi, Abolfazl

Publisher / Repository:: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Date Published:: 2024-08-20

ISSN:: 2472-6737

Format(s):: Medium: X

Location:: Tucson, Arizona

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this