NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Federated Learning for Crowd Counting in Smart Surveillance Systems

https://doi.org/10.1109/JIOT.2023.3305933

Pang, Yiran; Ni, Zhen; Zhong, Xiangnan (February 2024, IEEE Internet of Things Journal)

Full Text Available
Microgrid energy scheduling under uncertain extreme weather: Adaptation from parallelized reinforcement learning agents

https://doi.org/10.1016/j.ijepes.2023.109210

Das, Avijit; Ni, Zhen; Zhong, Xiangnan (October 2023, International Journal of Electrical Power & Energy Systems)

Full Text Available
An Automated Statistical Evaluation Framework of Rapidly-Exploring Random Tree Frontier Detector for Indoor Space Exploration

https://doi.org/10.1109/ICCR55715.2022.10053918

Andy, Wen-Chung Cheng; Marty, Wen-Yu Cheng; Ni, Zhen; Zhong, Xiangnan (December 2022, 2022 4th International Conference on Control and Robotics (ICCR))

Full Text Available
Multi-Virtual-Agent Reinforcement Learning for a Stochastic Predator-Prey Grid Environment

https://doi.org/10.1109/IJCNN55064.2022.9891898

Lin, Yanbin; Ni, Zhen; Zhong, Xiangnan (September 2022, 2022 International Joint Conference on Neural Networks (IJCNN))

Generalization problem of reinforcement learning is crucial especially for dynamic environments. Conventional reinforcement learning methods solve the problems with some ideal assumptions and are difficult to be applied in dynamic environments directly. In this paper, we propose a new multi-virtual- agent reinforcement learning (MVARL) approach for a predator-prey grid game. The designed method can find the optimal solution even when the predator moves. Specifically, we design virtual agents to interact with simulated changing environments in parallel instead of using actual agents. Moreover, a global agent learns information from these virtual agents and interacts with the actual environment at the same time. This method can not only effectively improve the generalization performance of reinforcement learning in dynamic environments, but also reduce the overall computational cost. Two simulation studies are considered in this paper to validate the effectiveness of the designed method. We also compare the results with the conventional reinforcement learning methods. The results indicate that our proposed method can improve the robustness of reinforcement learning method and contribute to the generalization to certain extent.
more » « less
Full Text Available
An Intelligent and Secure Control Approach for Nonlinear Systems under Attacks

https://doi.org/10.1109/SSCI50451.2021.9659857

Zhong, Xiangnan; Ni, Zhen (December 2021, 2021 IEEE Symposium Series on Computational Intelligence (SSCI))

Full Text Available
Aggregating Learning Agents for Microgrid Energy Scheduling During Extreme Weather Events

https://doi.org/10.1109/PESGM46819.2021.9637949

Das, Avijit; Ni, Zhen; Zhong, Xiangnan (July 2021, 2021 IEEE Power & Energy Society General Meeting (PESGM))

Full Text Available
Semicentralized Deep Deterministic Policy Gradient in Cooperative StarCraft Games

https://doi.org/10.1109/TNNLS.2020.3042943

Xie, Dong; Zhong, Xiangnan (December 2020, IEEE Transactions on Neural Networks and Learning Systems)
null (Ed.)
In this article, we propose a novel semicentralized deep deterministic policy gradient (SCDDPG) algorithm for cooperative multiagent games. Specifically, we design a two-level actor-critic structure to help the agents with interactions and cooperation in the StarCraft combat. The local actor-critic structure is established for each kind of agents with partially observable information received from the environment. Then, the global actor-critic structure is built to provide the local design an overall view of the combat based on the limited centralized information, such as the health value. These two structures work together to generate the optimal control action for each agent and to achieve better cooperation in the games. Comparing with the fully centralized methods, this design can reduce the communication burden by only sending limited information to the global level during the learning process. Furthermore, the reward functions are also designed for both local and global structures based on the agents' attributes to further improve the learning performance in the stochastic environment. The developed method has been demonstrated on several scenarios in a real-time strategy game, i.e., StarCraft. The simulation results show that the agents can effectively cooperate with their teammates and defeat the enemies in various StarCraft scenarios.
more » « less
Full Text Available
Comprehensive cooperative deep deterministic policy gradients for multi-agent systems in unstable environment

https://doi.org/10.1117/12.2519153

Xie, Dong; Zhong, Xiangnan; Yang, Qing; Huang, Yan (May 2019, SPIE Defense + Commercial Sensing)

Full Text Available
Deep Deterministic Policy Gradients with Transfer Learning Framework in StarCraft Micromanagement

https://doi.org/10.1109/EIT.2019.8833742

Xie, Dong; Zhong, Xiangnan (May 2019, 2019 IEEE International Conference on Electro Information Technology (EIT))

This paper proposes an intelligent multi-agent approach in a real-time strategy game, StarCraft, based on the deep deterministic policy gradients (DDPG) techniques. An actor and a critic network are established to estimate the optimal control actions and corresponding value functions, respectively. A special reward function is designed based on the agents' own condition and enemies' information to help agents make intelligent control in the game. Furthermore, in order to accelerate the learning process, the transfer learning techniques are integrated into the training process. Specifically, the agents are trained initially in a simple task to learn the basic concept for the combat, such as detouring moving, avoiding and joining attacking. Then, we transfer this experience to the target task with a complex and difficult scenario. From the experiment, it is shown that our proposed algorithm with transfer learning can achieve better performance.
more » « less
Full Text Available

Search for: All records