Integrated Actor-Critic for Deep Reinforcement Learning

Jiaohao Zheng, Mehmet Necip

Citation Details

We propose a new deep deterministic actor-critic algorithm with an integrated network architecture and an integrated objective func- tion. We address stabilization of the learning procedure via a novel adap- tive objective that roughly ensures keeping the actor unchanged while the critic makes large errors. We reduce the number of network parame- ters and propose an improved exploration strategy over bounded action spaces. Moreover, we incorporate some recent advances in deep learn- ing to our algorithm. Experiments illustrate that our algorithm speeds up the learning process and reduces the sample complexity considerably over the state-of-the-art algorithms including TD3, SAC, PPO, and A2C in continuous control tasks. more »

Award ID(s):: 1954549

PAR ID:: 10333252

Author(s) / Creator(s):: Jiaohao Zheng, Mehmet Necip

Editor(s):: I. Farkaˇs et al.

Date Published:: 2021-01-01

Journal Name:: ICANN 2021

Page Range / eLocation ID:: 505–518

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this