LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning

Je Yang,Joo-Young Kim,Jaeuk Kim

doi:10.1109/icfpt56656.2022.9974543

Abstract

Multi-agent reinforcement learning (MARL) is a powerful technology to construct interactive artificial intelligent systems in various applications such as multi-robot control and self-driving cars. Unlike supervised model or single-agent rein-forcement learning, which actively exploits network pruning, it is obscure that how pruning will work in multi-agent reinforcement learning with its cooperative and interactive characteristics. In this paper, we present a real-time sparse training accel-eration system named LearningGroup, which adopts network pruning on the training of MARL for the first time with an algorithm/architecture co-design approach. We create spar-sity using a weight grouping algorithm and propose on-chip sparse data encoding loop (OSEL) that enables fast encoding with efficient implementation. Based on the OSEL's encoding format, LearningGroup performs efficient weight compression and computation workload allocation to multiple cores, where each core handles multiple sparse rows of the weight matrix simultaneously with vector processing units. As a result, LearningGroup system minimizes the cycle time and memory footprint for sparse data generation up to 5.72x and 6.81x. Its FPGA accelerator shows 257.40-3629.48 GFLOPS throughput and 7.10-100.12 GFLOPS/W energy efficiency for various conditions in MARL, which are 7.13x higher and 12.43x more energy efficient than Nvidia Titan RTX GPU, thanks to the fully on-chip training and highly optimized dataflow/data format provided by FPGA. Most importantly, the accelerator shows speedup up to 12.52 x for processing sparse data over the dense case, which is the highest among state-of-the-art sparse training accelerators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Effective control of two-dimensional Rayleigh–Bénard convection: Invariant multi-agent reinforcement learning is all you need
Colin Vignon ... Mikael Mortensen
Physics of Fluids | VOL. 35
Colin Vignon, et. al.Colin Vignon ... Mikael Mortensen
01 Jun 2023
Physics of Fluids | VOL. 35

Deep Reinforcement Learning for Energy Efficiency Maximization in Cache-Enabled Cell-Free Massive MIMO Networks: Single- and Multi-Agent Approaches
Yu-Chieh Chuang ... Ronald Y Chang
IEEE Transactions on Vehicular Technology | VOL. 72
Yu-Chieh Chuang, et. al.Yu-Chieh Chuang ... Ronald Y Chang
01 Aug 2023
IEEE Transactions on Vehicular Technology | VOL. 72

Lessons learned in single-agent and multiagent learning with robot foraging
Z Ren ... A.B Williams
-
Z Ren, et. al.Z Ren ... A.B Williams
10 Nov 2003
10 Nov 2003

Simulation of football sport PID controller based on BP neural network
Qiangguo Lv ... Simon K.S Cheung
Journal of Intelligent & Fuzzy Systems | VOL. 40
Qiangguo Lv, et. al.Qiangguo Lv ... Simon K.S Cheung
01 Jan 2020
Journal of Intelligent & Fuzzy Systems | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning

Abstract

Talk to us

Similar Papers