NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Hypothesis Testing-based Cyber Deception Framework under Sequential Reconnaissance

Kumar, Abhinav; Brahma, Swastik; Kamhoua, Charles A (March 2025, Annual Conference on Information Sciences and Systems (CISS))

Free, publicly-accessible full text available March 19, 2026
Adaptive Incentive Design for Markov Decision Processes with Unknown Rewards

MA, Haoxiang; Han, Shuo; Hemida, Ahmed; Kamhoua, Charles; Fu, Jie (March 2025, Transactions on machine learning research)

Free, publicly-accessible full text available March 28, 2026
Adaptive Incentive Design for Markov Decision Processes with Unknown Rewards

Ma, Haoxiang; Han, Shuo; Hemida, Ahmed; Kamhoua, Charles A; Fu, Jie (March 2025, Transactions on Machine Learning Research)
Poupart, Pascal (Ed.)
Incentive design, also known as model design or environment design for Markov decision processes(MDPs), refers to a class of problems in which a leader can incentivize his follower by modifying the follower's reward function, in anticipation that the follower's optimal policy in the resulting MDP can be desirable for the leader's objective. In this work, we propose gradient-ascent algorithms to compute the leader's optimal incentive design, despite the lack of knowledge about the follower's reward function. First, we formulate the incentive design problem as a bi-level optimization problem and demonstrate that, by the softmax temporal consistency between the follower's policy and value function, the bi-level optimization problem can be reduced to single-level optimization, for which a gradient-based algorithm can be developed to optimize the leader's objective. We establish several key properties of incentive design in MDPs and prove the convergence of the proposed gradient-based method. Next, we show that the gradient terms can be estimated from observations of the follower's best response policy, enabling the use of a stochastic gradient-ascent algorithm to compute a locally optimal incentive design without knowing or learning the follower's reward function. Finally, we analyze the conditions under which an incentive design remains optimal for two different rewards which are policy invariant. The effectiveness of the proposed algorithm is demonstrated using a small probabilistic transition system and a stochastic gridworld.
more » « less
Free, publicly-accessible full text available March 28, 2026
A Hypothesis Testing-based Framework for Cyber Deception with Sludging

https://doi.org/10.1109/Allerton63246.2024.10735267

Kumar, Abhinav; Brahma, Swastik; Geng, Baocheng; Kamhoua, Charles A; Varshney, Pramod K (September 2024, 2024 60th Annual Allerton Conference on Communication, Control, and Computing)

Full Text Available
Optimal Resource Allocation for Proactive Defense with Deception in Probabilistic Attack Graphs

https://doi.org/10.1007/978-3-031-50670-3_11

Ma, Haoxiang; Han, Shuo; Kamhoua, Charles; Fu, Jie (December 2023, Decision and Game Theory for Security)
Fu, J. (Ed.)
Full Text Available
Optimizing Honeypot Placement Strategies with Graph Neural Networks for Enhanced Resilience via Cyber Deception

https://doi.org/10.1145/3630049.3630169

Osman, Mohamed; Nadeem, Tamer; Hemida, Ahmed; Kamhoua, Charles (December 2023, ACM)

Full Text Available
IoTFlowGenerator: Crafting Synthetic IoT Device Traffic Flows for Cyber Deception

https://doi.org/10.32473/flairs.36.133376

Bao, Joseph; Kantarcioglu, Murat; Vorobeychik, Yevgeniy; Kamhoua, Charles (May 2023, The International FLAIRS Conference Proceedings)

Over the years, honeypots emerged as an important security tool to understand attacker intent and deceive attackers to spend time and resources. Recently, honeypots are being deployed for Internet of things (IoT) devices to lure attackers, and learn their behavior. However, most of the existing IoT honeypots, even the high interaction ones, are easily detected by an attacker who can observe honeypot traffic due to lack of real network traffic originating from the honeypot. This implies that, to build better honeypots and enhance cyber deception capabilities, IoT honeypots need to generate realistic network traffic flows. To achieve this goal, we propose a novel deep learning based approach for generating traffic flows that mimic real network traffic due to user and IoT device interactions.A key technical challenge that our approach overcomes is scarcity of device-specific IoT traffic data to effectively train a generator.We address this challenge by leveraging a core generative adversarial learning algorithm for sequences along with domain specific knowledge common to IoT devices.Through an extensive experimental evaluation with 18 IoT devices, we demonstrate that the proposed synthetic IoT traffic generation tool significantly outperforms state of the art sequence and packet generators in remaining indistinguishable from real traffic even to an adaptive attacker.
more » « less
Full Text Available
Optimizing Sensor Allocation Against Attackers With Uncertain Intentions: A Worst-Case Regret Minimization Approach

https://doi.org/10.1109/LCSYS.2023.3290489

Ma, Haoxiang; Han, Shuo; Kamhoua, Charles A.; Fu, Jie (January 2023, IEEE Control Systems Letters)

This letter focuses on the optimal allocation of multi-stage attacks with the uncertainty in attacker’s intention. We model the attack planning problem using a Markov decision process and characterize the uncertainty in the attacker’s intention using a finite set of reward functions—each reward represents a type of attacker. Based on this modeling, we employ the paradigm of the worst-case absolute regret minimization from robust game theory and develop mixed-integer linear program (MILP) formulations for solving the worst-case regret minimizing sensor allocation strategies for two classes of attack-defend interactions: one where the defender and attacker engage in a zero-sum game and another where they engage in a non-zero-sum game. We demonstrate the effectiveness of our algorithm using a stochastic gridworld example.
more » « less
Full Text Available
Game and Prospect Theoretic Hardware Trojan Testing

Nan, Satyaki; Njilla, Laurent; Brahma, Swastik; Kamhoua, Charles A. (January 2023, Annual Conference on Information Sciences and Systems)

Full Text Available
Designing a supervised feature selection technique for mixed attribute data analysis

https://doi.org/10.1016/j.mlwa.2022.100431

Jeong, Dong Hyun; Jeong, Bong Keun; Leslie, Nandi; Kamhoua, Charles; Ji, Soo-Yeon (December 2022, Machine Learning with Applications)

Full Text Available

« Prev Next »

Search for: All records