CNN-Based Multimodal Human Recognition in Surveillance Environments.

Ja Hyung Koo,Na Rae Baek,Min Cheol Kim,Se Woon Cho,Kang Ryoung Park

doi:10.3390/s18093040

Abstract

In the current field of human recognition, most of the research being performed currently is focused on re-identification of different body images taken by several cameras in an outdoor environment. On the other hand, there is almost no research being performed on indoor human recognition. Previous research on indoor recognition has mainly focused on face recognition because the camera is usually closer to a person in an indoor environment than an outdoor environment. However, due to the nature of indoor surveillance cameras, which are installed near the ceiling and capture images from above in a downward direction, people do not look directly at the cameras in most cases. Thus, it is often difficult to capture front face images, and when this is the case, facial recognition accuracy is greatly reduced. To overcome this problem, we can consider using the face and body for human recognition. However, when images are captured by indoor cameras rather than outdoor cameras, in many cases only part of the target body is included in the camera viewing angle and only part of the body is captured, which reduces the accuracy of human recognition. To address all of these problems, this paper proposes a multimodal human recognition method that uses both the face and body and is based on a deep convolutional neural network (CNN). Specifically, to solve the problem of not capturing part of the body, the results of recognizing the face and body through separate CNNs of VGG Face-16 and ResNet-50 are combined based on the score-level fusion by Weighted Sum rule to improve recognition performance. The results of experiments conducted using the custom-made Dongguk face and body database (DFB-DB1) and the open ChokePoint database demonstrate that the method proposed in this study achieves high recognition accuracy (the equal error rates of 1.52% and 0.58%, respectively) in comparison to face or body single modality-based recognition and other methods used in previous studies.

Highlights

Previous biometrics studies have used various modalities, including the face, fingerprints, body, irises, retinas veins, and voice [1,2,3,4,5,6,7,8,9]
This study focuses on cases that often occur in indoor surveillance camera environments in which a person is approaching or moving further away from the camera; the proposed method is the first approach for the multimodal human recognition that separately recognizes face and body regions in a single image and combines them
This paper proposed a multimodal human recognition method that uses both the face and body regions in indoor surveillance camera environments, and is based on deep convolutional neural network (CNN)

Summary

Introduction

Previous biometrics studies have used various modalities, including the face, fingerprints, body, irises, retinas veins, and voice [1,2,3,4,5,6,7,8,9]. In a typical surveillance camera environment, it is difficult to use fingerprints or vein recognition, so face, body, and iris methods have been considered. In a surveillance environment, the camera is normally installed above the user and captures images in a downward direction, so it mainly takes off-angle images that capture the user’s iris at an angle. In such circumstances, the recognition accuracy is greatly reduced [9].

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Sep 11, 2018
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CNN-Based Multimodal Human Recognition in Surveillance Environments.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Face and Body-Based Human Recognition by GAN-Based Blur Restoration.
Ja Hyung Koo ... Na Rae Baek
Sensors | VOL. 20
Ja Hyung Koo, et. al.Ja Hyung Koo ... Na Rae Baek
14 Sep 2020
Sensors | VOL. 20

Optimization of face recognition algorithm based on deep learning multi feature fusion driven by big data
Yinghui Zhu ... Yuzhen Jiang
Image and Vision Computing | VOL. 104
Yinghui Zhu, et. al.Yinghui Zhu ... Yuzhen Jiang
18 Sep 2020
Image and Vision Computing | VOL. 104

Automatic Identification of Individual Primates with Deep Learning Techniques.
Songtao Guo ... Qiguang Miao
iScience | VOL. 23
Songtao Guo, et. al.Songtao Guo ... Qiguang Miao
25 Jul 2020
iScience | VOL. 23

An Attendance System Based on Face Recognition and Detection Using CNN
Vashvi Singh ... Siddharth Singh
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07
Vashvi Singh, et. al.Vashvi Singh ... Siddharth Singh
02 Jan 2023
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CNN-Based Multimodal Human Recognition in Surveillance Environments.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors