Journal Home Online First Current Issue Archive For Authors Journal Information 中文版

Frontiers of Information Technology & Electronic Engineering >> 2024, Volume 25, Issue 6 doi: 10.1631/FITEE.2300394

Multi-agent reinforcement learning behavioral control for nonlinear second-order systems

Affiliation(s): College of Electrical Engineering and Automation, Fuzhou University, Fuzhou 350108, China; 5G+ Industrial Internet Institute, Fuzhou University, Fuzhou 350108, China; less

Received: 2023-06-01 Accepted: 2024-07-05 Available online: 2024-07-05

Next Previous

Abstract

(RLBC) is limited to an individual agent without any swarm mission, because it models the behavior priority learning as a Markov decision process. In this paper, a novel multi-agent (MARLBC) method is proposed to overcome such limitations by implementing joint learning. Specifically, a multi-agent (MARLMS) is designed for a group of nonlinear to assign the behavior priorities at the decision layer. Through modeling behavior priority switching as a cooperative Markov game, the MARLMS learns an optimal joint behavior priority to reduce dependence on human intelligence and high-performance computing hardware. At the control layer, a group of second-order controllers are designed to learn the optimal control policies to track position and velocity signals simultaneously. In particular, input saturation constraints are strictly implemented via designing a group of adaptive compensators. Numerical simulation results show that the proposed MARLBC has a lower switching frequency and control cost than finite-time and fixed-time and RLBC methods.

Related Research