NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

https://doi.org/10.1609/aaai.v39i15.33680

Agrawal, Rishabh; Dahlin, Nathan; Jain, Rahul; Nayyar, Ashutosh (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Imitation learning (IL) is notably effective for robotic tasks where directly programming behaviors or defining optimal control costs is challenging. In this work, we address a scenario where the imitator relies solely on observed behavior and cannot make environmental interactions during learning. It does not have additional supplementary datasets beyond the expert's dataset nor any information about the transition dynamics. Unlike state-of-the-art (SOTA) IL methods, this approach tackles the limitations of conventional IL by operating in a more constrained and realistic setting. Our method uses the Markov balance equation and introduces a novel conditional density estimation-based imitation learning framework. It employs conditional normalizing flows for transition dynamics estimation and aims at satisfying a balance equation for the environment. Through a series of numerical experiments on Classic Control and MuJoCo environments, we demonstrate consistently superior empirical performance compared to many SOTA IL algorithms.
more » « less
Free, publicly-accessible full text available April 11, 2026
Optimal Communication and Control Strategies in a Cooperative Multiagent MDP Problem

https://doi.org/10.1109/TAC.2024.3386454

Sudhakara, Sagar; Kartik, Dhruva; Jain, Rahul; Nayyar, Ashutosh (October 2024, IEEE Transactions on Automatic Control)

Full Text Available
A Bayesian Learning Algorithm for Unknown Zero-sum Stochastic Games with an Arbitrary Opponent

Jahromi, Mehdi J; Jain, Rahul; Nayyar, Ashutosh (July 2024, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:3880-3888, 2024)

Full Text Available
A Novel Point-Based Algorithm for Multi-Agent Control Using the Common Information Approach

https://doi.org/10.1109/CDC49753.2023.10383439

Tang, Dengwang; Nayyar, Ashutosh; Jain, Rahul (December 2023, IEEE)
Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes

https://doi.org/10.1109/TAC.2024.3422213

Kalagarla, Krishna C; Kartik, Dhruva; Shen, Dongming; Jain, Rahul; Nayyar, Ashutosh; Nuzzo, Pierluigi (January 2024, IEEE Transactions on Automatic Control)

Full Text Available
Ubi-TOUCH: Ubiquitous Tangible Object Utilization through Consistent Hand-object interaction in Augmented Reality

https://doi.org/10.1145/3586183.3606793

Jain, Rahul; Shi, Jingyu; Duan, Runlin; Zhu, Zhengzhe; Qian, Xun; Ramani, Karthik (October 2023, ACM)

Full Text Available
Interacting Objects: A Dataset of Object-Object Interactions for Richer Dynamic Scene Representations

https://doi.org/10.1109/LRA.2023.3332554

Unmesh, Asim; Jain, Rahul; Shi, Jingyu; Chaithanya Manam, V. K.; Chi, Hyung-Gun; Chidambaram, Subramanian; Quinn, Alexander; Ramani, Karthik (January 2024, IEEE Robotics and Automation Letters)

Full Text Available
One-Shot Quantum State Redistribution and Quantum Markov Chains

https://doi.org/10.1109/TIT.2023.3271316

Anshu, Anurag; Bab Hadiashar, Shima; Jain, Rahul; Nayak, Ashwin; Touchette, Dave (September 2023, IEEE Transactions on Information Theory)

Full Text Available
Compositional Planning for Logically Constrained Multi-Agent Markov Decision Processes

https://doi.org/10.1109/CDC56724.2024.10885812

Kalagarla, Krishna C; Low, Matthew; Jain, Rahul; Nayyar, Ashutosh; Nuzzo, Pierluigi (December 2024, IEEE)

Full Text Available
Online Learning for Cooperative Multi-Player Multi-Armed Bandits

https://doi.org/10.1109/CDC51059.2022.9992885

Chang, William; Jafarnia-Jahromi, Mehdi; Jain, Rahul (December 2022, IEEE)

« Prev Next »

Search for: All records