Title: Multi-robot concurrent learning of cooperative behaviours for the tracking of multiple moving targets

Authors: Zheng Liu, Marcelo H. Ang Jr, Winston Khoon Guan Seah

Addresses: Department of Electrical and Computer Engineering, National University of Singapore, 119260, Singapore; Institute for Infocomm Research, Singapore. ' Department of Mechanical Engineering, National University of Singapore, 119260, Singapore. ' Network Technology Department, Institute for Infocomm Research, Agency for Science, Technology and Research, 119613 Singapore

Abstract: Reinforcement learning has been extensively studied and applied for generating cooperative behaviours in multi-robot systems. However, traditional reinforcement learning algorithms assume discrete state and action spaces with finite number of elements. This limits the learning to discrete behaviours and cannot be applied to most real multi-robot systems that inherently require appropriate combinations of different elementary behaviours. To address this problem, we design a distributed learning controller that integrates reinforcement learning with behaviour-based control networks. This learning controller can enable the robots to generate appropriate control policy without the need for human design or hardcoding. Furthermore, to address the problems in concurrent learning, we propose a distributed learning control algorithm to coordinate the concurrent learning processes. The distributed learning controller and learning control algorithm are applied to multi-robot tracking of multiple moving targets. The efficacy of our proposed scheme is shown through simulations.

Keywords: behaviour-based control; cooperative tracking; multi-robot systems; concurrent learning; reinforcement learning; vehicle autonomous systems; autonomous vehicles; robot learning; multiple robots; robot tracking; tracking control; robot control; cooperative behaviours; distributed learning; multiple moving targets; simulation.

DOI: 10.1504/IJVAS.2006.012207

International Journal of Vehicle Autonomous Systems, 2006 Vol.4 No.2/3/4, pp.196 - 215

Published online: 28 Jan 2007 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article