Regulatory jurisdiction and policy coordination: A bi-level modeling approach for performance-based environmental policy
- Award ID(s):
- 1832683
- PAR ID:
- 10409981
- Date Published:
- Journal Name:
- Journal of the Operational Research Society
- Volume:
- 73
- Issue:
- 3
- ISSN:
- 0160-5682
- Page Range / eLocation ID:
- 509 to 524
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Ravikumar, Pradeep (Ed.)We consider the task of evaluating a policy for a Markov decision process (MDP). The standard unbiased technique for evaluating a policy is to deploy the policy and observe its performance. We show that the data collected from deploying a different policy, commonly called the behavior policy, can be used to produce unbiased estimates with lower mean squared error than this standard technique. We derive an analytic expression for a minimal variance behavior policy -- a behavior policy that minimizes the mean squared error of the resulting estimates. Because this expression depends on terms that are unknown in practice, we propose a novel policy evaluation sub-problem, behavior policy search: searching for a behavior policy that reduces mean squared error. We present two behavior policy search algorithms and empirically demonstrate their effectiveness in lowering the mean squared error of policy performance estimates.more » « less
An official website of the United States government

