Search for: All records

Creators/Authors contains: "Thomas, Philip"

« Prev Next »

Total Resources

16

Resource Type
Conference Paper

12

Conference Proceeding

0

Dataset

0

Journal Article

4

Workshop Report

0

Availability
Full Text / Resource Available

14

Citation Only

2

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Seldonian Toolkit: Building Software with Safe and Fair Machine Learning

Hoag, Austin ; Kostas, James E. ; Castro da Silva, Bruno ; Thomas, Philip S. ; Brun, Yuriy ( May 2023 , 2023 IEEE/ACM 45th International Conference on Software Engineering: Companion Proceedings)

Abstract—We present the Seldonian Toolkit, which enables software engineers to integrate provably safe and fair machine learning algorithms into their systems. Software systems that use data and machine learning are routinely deployed in a wide range of settings from medical applications, autonomous vehicles, the criminal justice system, and hiring processes. These systems, however, can produce unsafe and unfair behavior, such as suggesting potentially fatal medical treatments, making racist or sexist predictions, or facilitating radicalization and polarization. To reduce these undesirable behaviors, software engineers need the ability to easily integrate their machine- learning-based systems with domain-specific safety and fairness requirements defined by domain experts, such as doctors and hiring managers. The Seldonian Toolkit provides special machine learning algorithms that enable software engineers to incorporate such expert-defined requirements of safety and fairness into their systems, while provably guaranteeing those requirements will be satisfied. A video demonstrating the Seldonian Toolkit is available at https://youtu.be/wHR-hDm9jX4/.
more » « less
Free, publicly-accessible full text available May 14, 2024
Seldonian Toolkit: Building Software with Safe and Fair Machine Learning

https://doi.org/10.1109/ICSE-Companion58688.2023.00035

Hoag, Austin ; Kostas, James E. ; da Silva, Bruno Castro ; Thomas, Philip S. ; Brun, Yuriy ( May 2023 , Proceedings of the Demonstrations Track at the 45th International Conference on Software Engineering (ICSE))

Free, publicly-accessible full text available May 1, 2024
Mechanizing Soundness of Off-Policy Evaluation

https://doi.org/10.4230/LIPIcs.ITP.2022.32

Yeager, Jared ; Moss, J. Eliot ; Norrish, Michael ; Thomas, Philip S. ( August 2022 , Leibniz international proceedings in informatics)
Andronick, June ; de Moura, Leonardo (Ed.)
There are reinforcement learning scenarios - e.g., in medicine - where we are compelled to be as confident as possible that a policy change will result in an improvement before implementing it. In such scenarios, we can employ off-policy evaluation (OPE). The basic idea of OPE is to record histories of behaviors under the current policy, and then develop an estimate of the quality of a proposed new policy, seeing what the behavior would have been under the new policy. As we are evaluating the policy without actually using it, we have the "off-policy" of OPE. Applying a concentration inequality to the estimate, we derive a confidence interval for the expected quality of the new policy. If the confidence interval lies above that of the current policy, we can change policies with high confidence that we will do no harm. We focus here on the mathematics of this method, by mechanizing the soundness of off-policy evaluation. A natural side effect of the mechanization is both to clarify all the result’s mathematical assumptions and preconditions, and to further develop HOL4’s library of verified statistical mathematics, including concentration inequalities. Of more significance, the OPE method relies on importance sampling, whose soundness we prove using a measure-theoretic approach. In fact, we generalize the standard result, showing it for contexts comprising both discrete and continuous probability distributions.
more » « less
Full Text Available
Fairness Guarantees Under Demographic Shift

Giguere, Stephen ; Metevier, Blossom ; Brun, Yuriy ; Castro da Silva, Bruno ; Thomas, Philip ; Niekum, Scott ( April 2022 , International Conference on Learning Representations)

Full Text Available
Fairness Guarantees under Demographic Shift

Giguere, Stephen ; Metevier, Blossom ; Brun, Yuriy ; da Silva, Bruno Castro ; Thomas, Philip S. ; Niekum, Scott ( April 2022 , Proceedings of the 10th International Conference on Learning Representations (ICLR))

Recent studies found that using machine learning for social applications can lead to injustice in the form of racist, sexist, and otherwise unfair and discriminatory outcomes. To address this challenge, recent machine learning algorithms have been designed to limit the likelihood such unfair behavior occurs. However, these approaches typically assume the data used for training is representative of what will be encountered in deployment, which is often untrue. In particular, if certain subgroups of the population become more or less probable in deployment (a phenomenon we call demographic shift), prior work's fairness assurances are often invalid. In this paper, we consider the impact of demographic shift and present a class of algorithms, called Shifty algorithms, that provide high-confidence behavioral guarantees that hold under demographic shift when data from the deployment environment is unavailable during training. Shifty, the first technique of its kind, demonstrates an effective strategy for designing algorithms to overcome demographic shift's challenges. We evaluate Shifty using the UCI Adult Census dataset, as well as a real-world dataset of university entrance exams and subsequent student success. We show that the learned models avoid bias under demographic shift, unlike existing methods. Our experiments demonstrate that our algorithm's high-confidence fairness guarantees are valid in practice and that our algorithm is an effective tool for training models that are fair when demographic shift occurs.
more » « less
Full Text Available
Fairness Guarantees Under Demographic Shift

Giguere, Stephen ; Metevier, Blossom ; Castro da Silva, Bruno ; Brun, Yuriy ; Thomas, Philip ; Niekum, Scott ( January 2022 , International Conference on Learning Representations)

Full Text Available
Universal Off-Policy Evaluation

Chandak, Yash ; Niekum, Scott ; Castro da Silva, Bruno ; Learned-Miller, Erik ; Brunskill, Emma ; Thomas, Philip ( December 2021 , Advances in neural information processing systems)

When faced with sequential decision-making problems, it is often useful to be able to predict what would happen if decisions were made using a new policy. Those predictions must often be based on data collected under some previously used decision-making rule. Many previous methods enable such off-policy (or counterfactual) estimation of the expected value of a performance measure called the return. In this paper, we take the first steps towards a universal off-policy estimator (UnO)—one that provides off-policy estimates and high-confidence bounds for any parameter of the return distribution. We use UnO for estimating and simultaneously bounding the mean, variance, quantiles/median, inter-quantile range, CVaR, and the entire cumulative distribution of returns. Finally, we also discuss UnO’s applicability in various settings, including fully observable, partially observable (i.e., with unobserved confounders), Markovian, non-Markovian, stationary, smoothly non-stationary, and discrete distribution shifts.
more » « less
Full Text Available
Towards Practical Mean Bounds for Small Samples

Phan, My ; Thomas, Philip ; Learned-Miller, Erik ( July 2021 , Proceedings of Machine Learning Research)
null (Ed.)
Full Text Available
High Confidence Generalization for Reinforcement Learning

Kostas, James ; Chandak, Yash ; Jordan, Scott ; Theocharous, Georgios ; Thomas, Philip ( July 2021 , Proceedings of Machine Learning Research)
null (Ed.)
Full Text Available
High-Confidence Off-Policy (or Counterfactual) Variance Estimation

Chandak, Yash ; Shankar, Shiv ; Thomas, Philip ( April 2021 , Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
Full Text Available

« Prev Next »