Causal Graph Fuzzing for Fair ML Sofware Development

Monjezi, Verya  (ORCID:0000000167965948); Kumar, Ashish  (ORCID:0000000187732084); Tan, Gang  (ORCID:0000000161096091); Trivedi, Ashutosh  (ORCID:0000000193460126); Tizpaz-Niari, Saeid  (ORCID:0000000213753154)

doi:10.1145/3639478.3643530

Not AvailableMachine learning (ML) is increasingly used in high-stakes areas like autonomous driving, finance, and criminal justice. However, it often unintentionally perpetuates biases against marginalized groups. To address this, the software engineering community has developed fairness testing and debugging methods, establishing best practices for fair ML software. These practices focus on training model design, including the selection of sensitive and non-sensitive attributes and hyperparameter configuration. However, the application of these practices across different socio-economic and cultural contexts is challenging, as societal constraints vary. Our study proposes a search-based software engineering approach to evaluate the robustness of these fairness practices. We formulate these practices as the first-order logic properties and search for two neighborhood datasets where the practice satisfies in one dataset, but fail in the other one. Our key observation is that these practices should be general and robust to various uncertainty such as noise, faulty labeling, and demographic shifts. To generate datasets, we sift to the causal graph representations of datasets and apply perturbations over the causal graphs to generate neighborhood datasets. In this short paper, we show our methodology using an example of predicting risks in the car insurance application.

More Like this