High-Level Semantic Networks for Multi-Scale Object Detection

Jiale Cao,Yanwei Pang,Xuelong Li,Shengjie Zhao

doi:10.1109/tcsvt.2019.2950526

Abstract

To better solve scale variance problem, deep multi-scale methods usually detect objects of different scales by different in-network layers. However, the semantic levels of features from different layers are usually inconsistent. In this paper, we propose a multi-branch and high-level semantic network by gradually splitting a base network into multiple different branches. As a result, the different branches have same depth and the output features of different branches have similarly high-level semantics. Due to the difference of receptive fields, the different branches are suitable to detect objects of different scales. Meanwhile, the multi-branch network does not introduce additional parameters by sharing the convolutional weights of different branches. To further improve detection performance, skip-layer connections are used to add context to the branch of relatively small receptive field, and dilated convolution is incorporated to enlarge the resolutions of output feature maps. When they are embedded into Faster RCNN architecture, the weighted scores of proposal generation network and proposal classification network are further proposed. Experiments on three pedestrian datasets (i.e., the KITTI dataset, the Caltech dataset, and the Citypersons dataset), one face dataset (i.e., the WIDER FACE dataset), and two general object datasets (i.e., the COCO benchmark and the PASCAL VOC dataset) demonstrate the effectiveness and generality of proposed method. On these datasets, our method achieves state-of-the-art performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High-Level Semantic Networks for Multi-Scale Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Nov 22, 2019
Citations: 140

Similar Papers

Retinal Vessels Segmentation Based on Dilated Multi-Scale Convolutional Neural Network
Yun Jiang ... Hai Zhang
IEEE Access | VOL. 7
Yun Jiang, et. al.Yun Jiang ... Hai Zhang
01 Jan 2019
IEEE Access | VOL. 7

RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild
Jiankang Deng ... Evangelos Ververas
-
Jiankang Deng, et. al.Jiankang Deng ... Evangelos Ververas
01 Jun 2020
01 Jun 2020

ADYOLOv5-Face: An Enhanced YOLO-Based Face Detector for Small Target Faces
Linrunjia Liu ... Qiguang Miao
Electronics | VOL. 13
Linrunjia Liu, et. al.Linrunjia Liu ... Qiguang Miao
25 Oct 2024
Electronics | VOL. 13

Masked Face Detection Algorithm in the Dense Crowd Based on Federated Learning
Rui Zhu ... Guangqiang Yin
Wireless Communications and Mobile Computing | VOL. 2021
Rui Zhu, et. al.Rui Zhu ... Guangqiang Yin
01 Jan 2020
Wireless Communications and Mobile Computing | VOL. 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High-Level Semantic Networks for Multi-Scale Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology