Abstract

Collective behaviours such as swarm formation of autonomous agents offer the advantages of efficient movement, redundancy, and potential for human guidance of a single swarm organism. However, tuning the behaviour of a group of agents so that they swarm, is difficult. Behaviour-bootstrapping algorithms permit agents to self-tune behaviour adapted for their physical form and associated movement constraints. This paper proposes a reinforcement learning framework to tune collective motion behaviours from random behaviours. The learning process is guided by a novel reward function capable of autonomously detecting generic collective motion behaviours from sensor data about the relative velocity and position of neighbouring agents. Our reward function is designed using a meta-learner trained on a human-labelled collective motion dataset. We demonstrate that our reinforcement learner can tune the behaviour of randomly moving groups so that structured collective motion emerges. We compare our framework to an existing developmental evolutionary framework for this purpose. Our results demonstrate that the proposed learning framework can generate behaviours with different collective motion characteristics more quickly than existing approaches. In addition, the trained reinforcement learner can tune the behaviour of robots with movement characteristics that it has not been trained on.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.