Unifying Visual Perception by Dispersible Points Learning

Jianming Liang,Yu Liu,Guanglu Song,Biao Leng

doi:10.1007/978-3-031-20077-9_26

Abstract

AbstractWe present a conceptually simple, flexible, and universal visual perception head for variant visual tasks, e.g., classification, object detection, instance segmentation and pose estimation, and different frameworks, such as one-stage or two-stage pipelines. Our approach effectively identifies an object in an image while simultaneously generating a high-quality bounding box or contour-based segmentation mask or set of keypoints. The method, called UniHead, views different visual perception tasks as the dispersible points learning via the transformer encoder architecture. Given a fixed spatial coordinate, UniHead adaptively scatters it to different spatial points and reasons about their relations by transformer encoder. It directly outputs the final set of predictions in the form of multiple points, allowing us to perform different visual tasks in different frameworks with the same head design. We show extensive evaluations on ImageNet classification and all three tracks of the COCO suite of challenges, including object detection, instance segmentation and pose estimation. Without bells and whistles, UniHead can unify these visual tasks via a single visual head design and achieve comparable performance compared to expert models developed for each task. We hope our simple and universal UniHead will serve as a solid baseline and help promote universal visual perception research. Code and models are available at https://github.com/Sense-X/UniHead. KeywordsDispersible points learningTransformer encoderGeneral visual perception

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unifying Visual Perception by Dispersible Points Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

MPViT: Multi-Path Vision Transformer for Dense Prediction
Youngwan Lee ... Jonghee Kim
-
Youngwan Lee, et. al.Youngwan Lee ... Jonghee Kim
01 Jun 2022
01 Jun 2022

Functional Magnetic Resonance Imaging during Visual Perception Tasks in Adolescents Born Prematurely.
Annika Lind ... Virva Saunavaara
Journal of the International Neuropsychological Society | VOL. 27
Annika Lind, et. al.Annika Lind ... Virva Saunavaara
15 Sep 2020
Journal of the International Neuropsychological Society | VOL. 27

Deficits in auditory and visual temporal perception in schizophrenia
Deana B Davalos ... Randal G Ross
Cognitive Neuropsychiatry | VOL. 7
Deana B Davalos, et. al.Deana B Davalos ... Randal G Ross
01 Nov 2002
Cognitive Neuropsychiatry | VOL. 7

Application of digital practice to improve head movement, visual perception and activities of daily living for subacute stroke patients with unilateral spatial neglect: Preliminary results of a single-blinded, randomized controlled trial.
Ho-Suk Choi ... Dae-Hyouk Bang
Medicine | VOL. 100
Ho-Suk Choi, et. al.Ho-Suk Choi ... Dae-Hyouk Bang
12 Feb 2021
Medicine | VOL. 100

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unifying Visual Perception by Dispersible Points Learning

Abstract

Talk to us

Similar Papers