In this paper, a distributed swarm control problem is studied for large-scale multi-agent systems (LS-MASs). Different than classical multi-agent systems, an LS-MAS brings new challenges to control design due to its large number of agents. It might be more difficult for developing the appropriate control to achieve complicated missions such as collective swarming. To address these challenges, a novel mixed game theory is developed with a hierarchical learning algorithm. In the mixed game, the LS-MAS is represented as a multi-group, large-scale leader–follower system. Then, a cooperative game is used to formulate the distributed swarm control for multi-group leaders, and a Stackelberg game is utilized to couple the leaders and their large-scale followers effectively. Using the interaction between leaders and followers, the mean field game is used to continue the collective swarm behavior from leaders to followers smoothly without raising the computational complexity or communication traffic. Moreover, a hierarchical learning algorithm is designed to learn the intelligent optimal distributed swarm control for multi-group leader–follower systems. Specifically, a multi-agent actor–critic algorithm is developed for obtaining the distributed optimal swarm control for multi-group leaders first. Furthermore, an actor–critic–mass method is designed to find the decentralized swarm control for large-scale followers. Eventually, a series of numerical simulations and a Lyapunov stability proof of the closed-loop system are conducted to demonstrate the performance of the developed scheme.
more »
« less
Density-Driven Multi-Agent Swarm Control with Potential Field Based Collision Avoidance
In this paper, a density-driven multi-agent swarm control problem is investigated. Robot swarms can provide a great benefit, especially for applications where a single robot cannot effectively achieve a given task. For large spatial-scale applications such as search and rescue, environmental monitoring, and surveillance, a new multi-agent swarm control strategy is necessary because of physical constraints including a robot number and operation time. This paper provides a novel density-driven swarm control strategy for multi-agent systems based on the Optimal Transport theory, to cover a spacious domain with limited resources. In such a scenario, \textit{efficiency} will likely be a key point in achieving an efficient robot swarm behavior rather than uniform coverage that might be infeasible. With the given reference density, pre-constructed from available information, the proposed swarm control method will drive the multi-agent system such that their time-averaged behavior becomes similar to the reference density. In this way, density-driven swarm control will enable the multiple agents to spend most of their time on high-priority regions that are reflected in the reference density, leading to efficiency. To protect the agents from collisions, the Artificial Potential Field method is employed and combined with the proposed density-driven swarm control scheme. Simulations are conducted to validate density-driven swarm control as well as to test collision avoidance. Also, the swarm performance is analyzed by varying the agent number in the simulation.
more »
« less
- Award ID(s):
- 2145810
- PAR ID:
- 10508975
- Publisher / Repository:
- IEEE
- Date Published:
- ISBN:
- 979-8-3503-0413-8
- Page Range / eLocation ID:
- 0714 to 0721
- Format(s):
- Medium: X
- Location:
- New York, NY, USA
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
This paper outlines a methodology for the construction of vector fields that can enable a multi-robot system moving on the plane to generate multiple dynamical behaviors by adjusting a single scalar parameter. This parameter essentially triggers a Hopf bifurcation in an underlying time-varying dynamical system that steers a robotic swarm. This way, the swarm can exhibit a variety of behaviors that arise from the same set of continuous differential equations. Other approaches to bifurcation-based swarm coordination rely on agent interaction which cannot be realized if the swarm members cannot sense or communicate with one another. The contribution of this paper is to offer an alternative method for steering minimally instrumented multi-robot collectives with a control strategy that can realize a multitude of dynamical behaviors without switching their constituent equations. Through this approach, analytical solutions for the bifurcation parameter are provided, even for more complex cases that are described in the literature, along with the process to apply this theory in a multi-agent setup. The theoretical predictions are confirmed via simulation and experimental results with the latter also demonstrating real-world applicability.more » « less
-
In this article, we consider the problem of stabilizing a class of degenerate stochastic processes, which are constrained to a bounded Euclidean domain or a compact smooth manifold, to a given target probability density. This stabilization problem arises in the field of swarm robotics, for example, in applications where a swarm of robots is required to cover an area according to a target probability density. Most existing works on modeling and control of robotic swarms that use partial differential equation (PDE) models assume that the robots' dynamics are holonomic and, hence, the associated stochastic processes have generators that are elliptic. We relax this assumption on the ellipticity of the generator of the stochastic processes, and consider the more practical case of the stabilization problem for a swarm of agents whose dynamics are given by a controllable driftless control-affine system. We construct state-feedback control laws that exponentially stabilize a swarm of nonholonomic agents to a target probability density that is sufficiently regular. State-feedback laws can stabilize a swarm only to target probability densities that are positive everywhere. To stabilize the swarm to probability densities that possibly have disconnected supports, we introduce a semilinear PDE model of a collection of interacting agents governed by a hybrid switching diffusion process. The interaction between the agents is modeled using a (mean-field) feedback law that is a function of the local density of the swarm, with the switching parameters as the control inputs. We show that under the action of this feedback law, the semilinear PDE system is globally asymptotically stable about the given target probability density. The stabilization strategies with and without agent interactions are verified numerically for agents that evolve according to the Brockett integrator; the strategy with interactions is additionally verified for agents that evolve according to an underactuated s...more » « less
-
null (Ed.)Abstract Swarm robotic search aims at searching targets using a large number of collaborating simple mobile robots, with applications to search and rescue and hazard localization. In this regard, decentralized swarm systems are touted for their coverage scalability, time efficiency, and fault tolerance. To guide the behavior of such swarm systems, two broad classes of approaches are available, namely, nature-inspired swarm heuristics and multi-robotic search methods. However, the ability to simultaneously achieve efficient scalability and provide fundamental insights into the exhibited behavior (as opposed to exhibiting a black-box behavior) remains an open problem. To address this problem, this paper extends the underlying search approach in batch-Bayesian optimization to perform search with embodied swarm agents operating in a (simulated) physical 2D arena. Key contributions lie in (1) designing an acquisition function that not only balances exploration and exploitation across the swarm but also allows modeling knowledge extraction over trajectories and (2) developing its distributed implementation to allow asynchronous task inference and path planning by the swarm robots. The resulting collective informative path planning approach is tested on target-search case studies of varying complexity, where the target produces a spatially varying (measurable) signal. Notably, superior performance, in terms of mission completion efficiency, is observed compared to exhaustive search and random walk baselines as well as a swarm optimization-based state-of-the-art method. Favorable scalability characteristics are also demonstrated.more » « less
-
This paper investigates motion planning for one or more robot(s) that attempt to harvest agents from a moving swarm. Generating motion paths that maximize the number of agents harvested differs from many traditional coverage problems because the agents move. This movement allows previously cleared areas to become recontaminated. We assume that the swarm agents prefer certain regions over others, and that we can represent the swarm by a Markov Process that encodes the agents' preferred regions and their speed of motion. We exploit this model to design and simulate robotic coverage paths that maximize the number of agents harvested by a fleet of robots in a given time budget.more » « less