In this paper, a distributed swarm control problem is studied for large-scale multi-agent systems (LS-MASs). Different than classical multi-agent systems, an LS-MAS brings new challenges to control design due to its large number of agents. It might be more difficult for developing the appropriate control to achieve complicated missions such as collective swarming. To address these challenges, a novel mixed game theory is developed with a hierarchical learning algorithm. In the mixed game, the LS-MAS is represented as a multi-group, large-scale leader–follower system. Then, a cooperative game is used to formulate the distributed swarm control for multi-group leaders, and a Stackelberg game is utilized to couple the leaders and their large-scale followers effectively. Using the interaction between leaders and followers, the mean field game is used to continue the collective swarm behavior from leaders to followers smoothly without raising the computational complexity or communication traffic. Moreover, a hierarchical learning algorithm is designed to learn the intelligent optimal distributed swarm control for multi-group leader–follower systems. Specifically, a multi-agent actor–critic algorithm is developed for obtaining the distributed optimal swarm control for multi-group leaders first. Furthermore, an actor–critic–mass method is designed to find the decentralized swarm control for large-scale followers. Eventually, a series of numerical simulations and a Lyapunov stability proof of the closed-loop system are conducted to demonstrate the performance of the developed scheme.
more »
« less
This content will become publicly available on May 14, 2026
Co-Regulated Hierarchical Reinforcement Learning for Uncrewed Aircraft System Swarms
Deploying decentralized control strategies for outdoor multi-agent Uncrewed Aircraft Systems (UASs) is challenging due to timing variations, packet loss, and computing resource limitations. In this work we address robustness to these conditions through a novel co-regulated control strategy that varies the periodicity of control inputs and communication with other agents. Co-regulation is applied to a decentralized hierarchical controller consisting of a global component governing inter-group coordination to multiple targets while a local component governs intra-group coordination of the agents as they progress to the target of interest. The control gains are “gain scheduled” according to current conditions while a cyber controller schedules the control and communication tasks for execution based on swarm performance. The control gains are found via reinforcement learning and the entire algorithm is deployed on a swarm consisting of 7 custom agents. Our results show the impact of rethinking swarming algorithms with computation and communication resource limitations in mind and indicate we can provide exceptional swarm control utilizing fewer resources while also improving the quality of service for an onboard, anytime collision avoidance algorithm.
more »
« less
- PAR ID:
- 10596596
- Publisher / Repository:
- IEEE
- Date Published:
- ISBN:
- 979-8-3315-1328-3
- Page Range / eLocation ID:
- 1002 to 1010
- Subject(s) / Keyword(s):
- swarms, unmanned aircraft vehicles, control, decentralized
- Format(s):
- Medium: X
- Location:
- Charlotte, NC, USA
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The rise of mobile multi-agent robotic platforms is outpacing control paradigms for tasks that require operating in complex, realistic environments. To leverage inertial, energetic, and cost bene fits of small-scale robots, critical future applications may depend on coordinating large numbers of agents with minimal onboard sensing and communication resources. In this article, we present the perspective that adaptive and resilient autonomous control of swarms of minimal agents might follow from a direct analogy with the neural circuits of spatial cognition in rodents. We focus on spatial neurons such as place cells found in the hippocampus. Two major emergent hippocampal phenomena, self-stabilizing attractor maps and temporal organization by shared oscillations, reveal theoretical solutions for decentralized self-organization and distributed communication in the brain. We consider that autonomous swarms of minimal agents with low-bandwidth communication are analogous to brain circuits of oscillatory neurons with spike-based propagation of information. The resulting notion of `neural swarm control' has the potential to be scalable, adaptive to dynamic environments, and resilient to communication failures and agent attrition. We illustrate a path toward extending this analogy into multi-agent systems applications and discuss implications for advances in decentralized swarm control.more » « less
-
null (Ed.)Intelligent utilization of resources and improved mission performance in an autonomous agent require consideration of cyber and physical resources. The allocation of these resources becomes more complex when the system expands from one agent to multiple agents, and the control shifts from centralized to decentralized. Consensus is a distributed algorithm that lets multiple agents agree on a shared value, but typically does not leverage mobility. We propose a coupled consensus control strategy that co-regulates computation, communication frequency, and connectivity of the agents to achieve faster convergence times at lower communication rates and computational costs. In this strategy, agents move towards a common location to increase connectivity. Simultaneously, the communication frequency is increased when the shared state error between an agent and its connected neighbors is high. When the shared state converges (i.e., consensus is reached), the agents withdraw to the initial positions and the communication frequency is decreased. Convergence properties of our algorithm are demonstrated under the proposed co-regulated control algorithm. We evaluated the proposed approach through a new set of cyber-physical, multi-agent metrics and demonstrated our approach in a simulation of unmanned aircraft systems measuring temperatures at multiple sites. The results demonstrate that, compared with fixed-rate and event-triggered consensus algorithms, our co-regulation scheme can achieve improved performance with fewer resources, while maintaining high reactivity to changes in the environment and system.more » « less
-
We study the performance of a decentralized integral control scheme for joint power grid frequency regulation and economic dispatch. We show that by properly designing the controller gains, after a power flow perturbation, the control achieves near-optimal economic dispatch while recovering the nominal frequency, without requiring any communication. We quantify the gap between the controllable power generation cost under the decentralized control scheme and the optimal cost, based on the DC power flow model. Moreover, we study the tradeoff between the cost and the convergence time, by adjusting parameters of the control scheme. Communication between generators reduces the convergence time. We identify key communication links whose failures have more significant impacts on the performance of a distributed power grid control scheme that requires information exchange between neighbors.more » « less
-
We study the performance of a decentralized inte- gral control scheme for joint power grid frequency regulation and economic dispatch. We show that by properly designing the controller gains, after a power flow perturbation, the control achieves near-optimal economic dispatch while recovering the nominal frequency, without requiring any communication. We quantify the gap between the controllable power generation cost under the decentralized control scheme and the optimal cost, based on the DC power flow model. Moreover, we study the tradeoff between the cost and the convergence time, by adjusting parameters of the control scheme. Communication between generators reduces the convergence time. We identify key communication links whose failures have more significant impacts on the performance of a distributed power grid control scheme that requires information exchange between neighbors.more » « less
An official website of the United States government
