NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Beyond Marginals: Capturing Correlated Returns through Joint Distributional Reinforcement Learning

Kaya, Ege; Ghasemi, Mahsa; Hashemi, Abolfazl (December 2025, 2nd Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET 2025).)

Free, publicly-accessible full text available December 2, 2026
Identification of Average Outcome under Interventions in Confounded Additive Noise Models

Elahi, Muhammad Qasim; Ghasemi, Mahsa; Kocaoglu, Murat (November 2025, Transactions on Machine Learning Research)

Free, publicly-accessible full text available November 13, 2026
Partial Structure Discovery is Sufficient for No-regret Learning in Causal Bandits

Elahi, Muhammad_Qasim; Ghasemi, Mahsa; Kocaoglu, Murat (September 2024, The Thirty-Eighth Annual Conference on Neural Information Processing Systems)

Full Text Available
Adaptive Online Experimental Design for Causal Discovery

Elahi, Muhammad_Qasim; Wei, Lai; Kocaoglu, Murat; Ghasemi, Mahsa (May 2024, Proceedings of the 41st International Conference on Machine Learning)

Full Text Available
Role of Resilience in Cyber-Physical Systems: A Roundtable Panel

https://doi.org/10.1109/MC.2024.3394108

Bagchi, Saurabh; Ghasemi, Mahsa; Shin, Kang G; Venkatasubramanian, Nalini; Xu, Dongyan; Zonouz, Saman (July 2024, Computer)

The panel was held on 14 November 2023 at Purdue University as part of a Grand Challenges in Resilience Workshop sponsored by the U.S. National Science Foundation and organized by our center, the Center for Resilient Infrastructures, Systems, and Processes (CRISP).
more » « less
Full Text Available
No-Regret Learning in Dynamic Stackelberg Games

https://doi.org/10.1109/TAC.2023.3330797

Lauffer, Niklas; Ghasemi, Mahsa; Hashemi, Abolfazl; Savas, Yagiz; Topcu, Ufuk (March 2024, IEEE Transactions on Automatic Control)

In a Stackelberg game, a leader commits to a randomized strategy and a follower chooses their best strategy in response. We consider an extension of a standard Stackelberg game, called a discrete-time dynamic Stackelberg game, that has an underlying state space that affects the leader’s rewards and available strategies and evolves in a Markovian manner depending on both the leader and follower’s selected trategies. Although standard Stackelberg games have been utilized to improve scheduling in security domains, their deployment is often limited by requiring complete information of the follower’s utility function. In contrast, we consider scenarios where the follower’s utility function is unknown to the leader; however, it can be linearly parameterized. Our objective is then to provide an algorithm that prescribes a randomized strategy to the leader at each step of the game based on observations of how the follower responded in previous steps. We design an online learning algorithm that, with high probability, is no-regret, i.e., achieves a regret bound (when compared to the best policy in hindsight), which is sublinear in the number of time steps; the degree of sublinearity depends on the number of features representing the follower’s utility function. The regret of the proposed learning algorithm is independent of the size of the state space and polynomial in the rest of the parameters of the game. We show that the proposed learning algorithm outperforms existing model-free reinforcement learning approaches.
more » « less
Full Text Available
On the Complexity and Approximability of Optimal Sensor Selection for Mixed-Observable Markov Decision Processes

https://doi.org/10.23919/ACC55779.2023.10156299

Bhargav, Jayanth; Ghasemi, Mahsa; Sundaram, Shreyas (May 2023, 2023 American Control Conference (ACC))

Full Text Available
Formal Methods for Autonomous Systems

https://doi.org/10.1561/2600000029

Wongpiromsarn, Tichakorn; Ghasemi, Mahsa; Cubuktepe, Murat; Bakirtzis, Georgios; Carr, Steven; Karabag, Mustafa O.; Neary, Cyrus; Gohari, Parham; Topcu, Ufuk (September 2023, Foundations and Trends® in Systems and Control)

Full Text Available
Randomized Greedy Sensor Selection: Leveraging Weak Submodularity

https://doi.org/10.1109/TAC.2020.2980924

Hashemi, Abolfazl; Ghasemi, Mahsa; Vikalo, Haris; Topcu, Ufuk (January 2021, IEEE Transactions on Automatic Control)
null (Ed.)
Full Text Available
A Barrier Pair Method for Safe Human-Robot Shared Autonomy

He, Binghan; Ghasemi, Mahsa; Topcu, Ufuk; and Sentis, Luis (January 2021, IEEE Conference on Decision and Control)
null (Ed.)
Shared autonomy provides a framework where a human and an automated system, such as a robot, jointly control the system’s behavior, enabling an effective solution for various applications, including human-robot interaction and remote operation of a semi-autonomous system. However, a challenging problem in shared autonomy is safety because the human input may be unknown and unpredictable, which affects the robot’s safety constraints. If the human input is a force applied through physical contact with the robot, it also alters the robot’s behavior to maintain safety. We address the safety issue of shared autonomy in real-time applications by proposing a two-layer control framework. In the first layer, we use the history of human input measurements to infer what the human wants the robot to do and define the robot’s safety constraints according to that inference. In the second layer, we formulate a rapidly-exploring random tree of barrier pairs, with each barrier pair composed of a barrier function and a controller. Using the controllers in these barrier pairs, the robot is able to maintain its safe operation under the intervention from the human input. This proposed control framework allows the robot to assist the human while preventing them from encountering safety issues. We demonstrate the proposed control framework on a simulation of a two-linkage manipulator robot.
more » « less
Full Text Available

« Prev Next »

Search for: All records