Scalable Spectral Clustering with Group Fairness Constraints

Ji Wang; Ding Lu; Ian Davidson; Zhaojun Bai

Citation Details

There are synergies of research interests and industrial efforts in modeling fairness and correcting algorithmic bias in machine learning. In this paper, we present a scalable algorithm for spectral clustering (SC) with group fairness constraints. Group fairness is also known as statistical parity where in each cluster, each protected group is represented with the same proportion as in the entirety. While FairSC algorithm (Kleindessner et al., 2019) is able to find the fairer clustering, it is compromised by high computational costs due to the algorithm’s kernels of computing nullspaces and the square roots of dense matrices explicitly. We present a new formulation of the underlying spectral computation of FairSC by incorporating nullspace projection and Hotelling’s deflation such that the resulting algorithm, called s-FairSC, only involves the sparse matrix-vector products and is able to fully exploit the sparsity of the fair SC model. The experimental results on the modified stochastic block model demonstrate that while it is comparable with FairSC in recovering fair clustering, s-FairSC is 12× faster than FairSC for moderate model sizes. s-FairSC is further demonstrated to be scalable in the sense that the computational costs of s-FairSC only increase marginally compared to the SC without fairness constraints. more »

Award ID(s):: 1913364

PAR ID:: 10482970

Author(s) / Creator(s):: Ji Wang; Ding Lu; Ian Davidson; Zhaojun Bai

Editor(s):: Ruiz, Francisco; Dy, Jennife; van de Meent, Jan-Willem

Publisher / Repository:: PMLR

Date Published:: 2023-04-25

Journal Name:: Proceedings of The 26th International Conference on Artificial Intelligence and Statistics

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this