On Trajectory Augmentations for Off‑Policy Evaluation
Introduces OAT (Offline with Augmented Trajectories), a generative sub-trajectory augmentation method designed to enhance off-policy evaluation accuracy. Experiments across robotics, healthcare, and e-learning show substantial performance gains over baselines.
more »
« less
- Award ID(s):
- 2013502
- PAR ID:
- 10609445
- Publisher / Repository:
- Proceedings of the Twelfth International Conference on Learning Representations / OpenReview (ICLR)
- Date Published:
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found
An official website of the United States government

