Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU

Napat Wanchaitanawong,Masatoshi Okutomi,Masayuki Tanaka,Takashi Shibata

doi:10.1117/1.jei.32.1.013025

Napat Wanchaitanawong, Masatoshi Okutomi + Show 2 more

https://doi.org/10.1117/1.jei.32.1.013025

Copy DOI

Abstract

Multi-modal pedestrian detection, which integrates visible and thermal sensors, has been developed to overcome many limitations of visible-modal pedestrian detection, such as poor illumination, cluttered background, and occlusion. By adopting the combination of multiple modalities, we can efficiently detect pedestrians even with poor visibility. Nevertheless, the critical assumption of multi-modal pedestrian detection is that multi-modal images are perfectly aligned. In general, however, this assumption often becomes invalid in real-world situations. Viewpoints of the different modal sensors are usually different. Then, the positions of pedestrians on the different modal images have disparities. We proposed a multi-modal faster-RCNN specifically designed to handle misalignment between two modalities. The faster-RCNN consists of a region proposal network (RPN) and a detector. We introduce position regressors for both modalities in the RPN and the detector. Intersection over union (IoU) is one of the useful metrics for object detection but is defined only for a single-modal image. We extend it into multi-modal IoU to evaluate the preciseness of both modalities. Our experimental results with the proposed evaluation metrics demonstrate that the proposed method has comparable performance with state-of-the-art methods and outperforms them for data with significant misalignment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Electronic Imaging	Publication Date: Feb 7, 2023
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Similar Papers

A benchmark bone marrow aspirate smear dataset and a multi-scale cell detection model for the diagnosis of hematological disorders.
Jie Su ... Jinjun Han
Computerized Medical Imaging and Graphics | VOL. 90
Jie Su, et. al.Jie Su ... Jinjun Han
02 Apr 2021
Computerized Medical Imaging and Graphics | VOL. 90

Pedestrian Detection Using Regional Proposal Network with Feature Fusion
Xiaogang Lv ... Jianxin Zhang
-
Xiaogang Lv, et. al.Xiaogang Lv ... Jianxin Zhang
01 Nov 2018
01 Nov 2018

Refine pedestrian detections by referring to features in different ways
Jaemyung Lee ... Janghyeon Lee
-
Jaemyung Lee, et. al.Jaemyung Lee ... Janghyeon Lee
01 Jun 2017
01 Jun 2017

Pedestrian detection with dilated convolution, region proposal network and boosted decision trees
Jiqian Li ... Chen Ye
-
Jiqian Li, et. al.Jiqian Li ... Chen Ye
01 May 2017
01 May 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging