The Bayesian Prophet: A Low-Regret Framework for Online Decision Making

Vera, Alberto; Banerjee, Siddhartha

doi:10.1287/mnsc.2020.3624

Citation Details

The Bayesian Prophet: A Low-Regret Framework for Online Decision Making

We develop a new framework for designing online policies given access to an oracle providing statistical information about an off-line benchmark. Having access to such prediction oracles enables simple and natural Bayesian selection policies and raises the question as to how these policies perform in different settings. Our work makes two important contributions toward this question: First, we develop a general technique we call compensated coupling, which can be used to derive bounds on the expected regret (i.e., additive loss with respect to a benchmark) for any online policy and off-line benchmark. Second, using this technique, we show that a natural greedy policy, which we call the Bayes selector, has constant expected regret (i.e., independent of the number of arrivals and resource levels) for a large class of problems we refer to as “online allocation with finite types,” which includes widely studied online packing and online matching problems. Our results generalize and simplify several existing results for online packing and online matching and suggest a promising pathway for obtaining oracle-driven policies for other online decision-making settings. This paper was accepted by George Shanthikumar, big data analytics. more »

Award ID(s):: 1847393 1955997 1839346

PAR ID:: 10240305

Author(s) / Creator(s):: Vera, Alberto; Banerjee, Siddhartha

Date Published:: 2021-03-01

Journal Name:: Management Science

Volume:: 67

Issue:: 3

ISSN:: 0025-1909

Page Range / eLocation ID:: 1368 to 1391

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1287/mnsc.2020.3624

More Like this