Diverse Data Selection under Fairness Constraints

Moumoulidou, Zafeiria; McGregor, Andrew

doi:10.4230/LIPIcs.ICDT.2021.13

Citation Details

Diverse Data Selection under Fairness Constraints

Diversity is an important principle in data selection and summarization, facility location, and recommendation systems. Our work focuses on maximizing diversity in data selection, while offering fairness guarantees. In particular, we offer the first study that augments the Max-Min diversification objective with fairness constraints. More specifically, given a universe 𝒰 of n elements that can be partitioned into m disjoint groups, we aim to retrieve a k-sized subset that maximizes the pairwise minimum distance within the set (diversity) and contains a pre-specified k_i number of elements from each group i (fairness). We show that this problem is NP-complete even in metric spaces, and we propose three novel algorithms, linear in n, that provide strong theoretical approximation guarantees for different values of m and k. Finally, we extend our algorithms and analysis to the case where groups can be overlapping. more »

Award ID(s):: 1763423 1943971 1453543

PAR ID:: 10287166

Author(s) / Creator(s):: Moumoulidou, Zafeiria; McGregor, Andrew

Date Published:: 2021-01-01

Journal Name:: International Conference on Database Theory

Volume:: 186

Page Range / eLocation ID:: 13:1--13:25

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.4230/LIPIcs.ICDT.2021.13

More Like this