CNN-Based Mask-Pose Fusion for Detecting Specific Persons on Heterogeneous Embedded Systems

Jeongjun Lee,Jinhong Lee,Hyun Kim,Jihoon Jang,Dayoung Chun

doi:10.1109/access.2021.3108776

Jeongjun Lee, Jinhong Lee + Show 3 more

Open Access

https://doi.org/10.1109/access.2021.3108776

Copy DOI

Abstract

In recent times, numerous convolutional neural network (CNN) based detection models have been proposed and have shown excellent performance. However, because these models are generally developed to detect objects in class units (e.g., person, car), additional training processes with numerous datasets are required to find a specific object. This paper proposes a model that accurately detects specific persons by using top clothing color information without any additional training processes. The proposed method combines CNN-based instance segmentation and pose estimation, utilizing all the advantages of each technique. To avoid redundant computations, these two schemes are implemented as a filtering-based sequential operation structure. As a result, the proposed method has a 92.57% of accuracy in detecting a specific person with only a slight processing speed decrease. Furthermore, in this paper, the proposed model is efficiently ported on the heterogeneous embedded platform (i.e., NVIDIA Jetson AGX Xavier) with a parallel processing technique to maximize the hardware utilization.

Highlights

With the development of hardware accelerators like graphics processing units (GPUs), deep learning (DL) has become pwidely used in various computer vision (CV) tasks, such as image classification [1]–[3], object detection [4]–[7], segmentation [8]–[12], and pose estimation [13]–[17], and has shown remarkable performance
NETWORK STRUCTURE This paper proposes a mask-pose fusion model that combines the representative instance segmentation model, YOLACT++ [10], and the representative pose estimation model, AlphaPose [17], to identify a specific person in real time using the precise position of the upper body
EXPERIMENTAL ENVIRONMENTS To verify the performance of the proposed design, the accuracy and processing speed are evaluated on an RTX-2080 GPU with the COCO pre-trained weights of YOLACT++ and AlphaPose

Summary

Introduction

With the development of hardware accelerators like graphics processing units (GPUs), deep learning (DL) has become pwidely used in various computer vision (CV) tasks, such as image classification [1]–[3], object detection [4]–[7], segmentation [8]–[12], and pose estimation [13]–[17], and has shown remarkable performance. Several prior studies have focused on DL-based facial recognition schemes [18], [22]–[24]; in practical environments, it may be necessary to identify specific persons in CCTV images. In such cases, there are significant limitations to performing accurate facial recognition, such as resolution and noise problems [25]. All these approaches can be used for specific person detection Their characteristics are as follows: YOLOv3 [5], a representative model of object detection, predicts classes using binary cross-entropy loss, and creates anchor boxes through clustering to detect bounding boxes. It is not suitable for use as an upper-body garment color discrimination model

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CNN-Based Mask-Pose Fusion for Detecting Specific Persons on Heterogeneous Embedded Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Decision letter: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina ... Ricardo Azziz
-
Larisa V Suturina, et. al.Larisa V Suturina ... Ricardo Azziz
12 Dec 2022
12 Dec 2022

Author response: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Zhuoran Zhang ... Wenyuan Chen
-
Zhuoran Zhang, et. al.Zhuoran Zhang ... Wenyuan Chen
12 Jan 2023
12 Jan 2023

Editor's evaluation: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina
-
Larisa V SuturinaLarisa V Suturina
12 Dec 2022
12 Dec 2022

BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
Penglei Liu ... Qieshi Zhang
IEEE Transactions on Automation Science and Engineering | VOL. 21
Penglei Liu, et. al.Penglei Liu ... Qieshi Zhang
01 Apr 2024
IEEE Transactions on Automation Science and Engineering | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CNN-Based Mask-Pose Fusion for Detecting Specific Persons on Heterogeneous Embedded Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access