Group Fairness in Bandits with Biased Feedback

Schumann, Candice; Lang, Zhi; Mattei, Nicholas; Dickerson, John P.

Citation Details

We propose a novel formulation of group fairness with biased feedback in the contextual multi-armed bandit (CMAB) setting. In the CMAB setting, a sequential decision maker must, at each time step, choose an arm to pull from a finite set of arms after observing some context for each of the potential arm pulls. In our model, arms are partitioned into two or more sensitive groups based on some protected feature(s) (e.g., age, race, or socio-economic status). Initial rewards received from pulling an arm may be distorted due to some unknown societal or measurement bias. We assume that in reality these groups are equal despite the biased feedback received by the agent. To alleviate this, we learn a societal bias term which can be used to both find the source of bias and to potentially fix the problem outside of the algorithm. We provide a novel algorithm that can accommodate this notion of fairness for an arbitrary number of groups, and provide a theoretical bound on the regret for our algorithm. We validate our algorithm using synthetic data and two real-world datasets for intervention settings wherein we want to allocate resources fairly across groups. more »

Award ID(s):: 2007955

PAR ID:: 10386118

Author(s) / Creator(s):: Schumann, Candice; Lang, Zhi; Mattei, Nicholas; Dickerson, John P.

Date Published:: 2022-05-09

Journal Name:: 21st International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2022

Page Range / eLocation ID:: 1155-1163

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this