Towards Real-Time Segmentation on the Edge

Yanyu Li,Geng Yuan,Jiexiong Guan,Bin Ren,Pu Zhao,Hao Tang,Wei Niu,Changdi Yang,Yanzhi Wang,Xue Lin,Qing Jin,Minghai Qin

doi:10.1609/aaai.v37i2.25232

Abstract

The research in real-time segmentation mainly focuses on desktop GPUs. However, autonomous driving and many other applications rely on real-time segmentation on the edge, and current arts are far from the goal. In addition, recent advances in vision transformers also inspire us to re-design the network architecture for dense prediction task. In this work, we propose to combine the self attention block with lightweight convolutions to form new building blocks, and employ latency constraints to search an efficient sub-network. We train an MLP latency model based on generated architecture configurations and their latency measured on mobile devices, so that we can predict the latency of subnets during search phase. To the best of our knowledge, we are the first to achieve over 74% mIoU on Cityscapes with semi-real-time inference (over 15 FPS) on mobile GPU from an off-the-shelf phone.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards Real-Time Segmentation on the Edge

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 5

Similar Papers

ESTIMATION OF POWER CONSUMPTION OF MOBILE DEVICES IN CLOUD COMPUTING
Oleksandr Mamchych ... Maksym Volk
Innovative Technologies and Scientific Solutions for Industries | VOL. -
Oleksandr Mamchych, et. al.Oleksandr Mamchych ... Maksym Volk
20 Apr 2023
Innovative Technologies and Scientific Solutions for Industries | VOL. -

Latency Estimation Tool and Investigation of Neural Networks Inference on Mobile GPU
Evgeny Ponomarev ... Sergey Matveev
Computers | VOL. 10
Evgeny Ponomarev, et. al.Evgeny Ponomarev ... Sergey Matveev
23 Aug 2021
Computers | VOL. 10

Energy-efficient design of a presbyopia correction wearable powered by mobile GPUs and FPGAs
Juan Mompeán ... Juan L Aragón
The Journal of Supercomputing | VOL. 78
Juan Mompeán, et. al.Juan Mompeán ... Juan L Aragón
16 Feb 2022
The Journal of Supercomputing | VOL. 78

A model of architecture for estimating GPU processing performance and power
Saman Payvar ... Maxime Pelcat
Design Automation for Embedded Systems | VOL. 25
Saman Payvar, et. al.Saman Payvar ... Maxime Pelcat
16 Jan 2021
Design Automation for Embedded Systems | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards Real-Time Segmentation on the Edge

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence