Human recognition by utilizing voice recognition and visual recognition

Sukaina Sh Altyar ,Samera Shams Hussein ,Mahir Jasem Mohammed

doi:10.22075/ijnaa.2022.5501

Abstract

Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some area such as W.C. or sleeping room. Thus, its commonly difficult to identify any movement or breakthrough process, on the other hand when need to pursue suspect when enter a building or party to identify his location and/or listen to his speech only and isolate it from other voices or noises, the other. Hence, the use of the hybrid combination technique is very effective. In this work, we proposed a multimodal human recognition approach that utilizes both the face and audio and is based upon a deep convolutional neural network (CNN). Mainly, to solve the challenge of not capturing part of the body, final results of recognizing via separate CNNs of VGG Face16 and ResNet50 are joined together depending on the score-level combination by Weighted Sum rule to enhance recognition performance. The results show that the proposed system success to recognise each person from his voice and/or his face captured. In addition, the system can separate the person voice and isolate it from noisy environment and determine the existence of desired person.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human recognition by utilizing voice recognition and visual recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Nonlinear Analysis and Applications

Lead the way for us

Similar Papers

CNN-Based Multimodal Human Recognition in Surveillance Environments.
Ja Hyung Koo ... Min Cheol Kim
Sensors | VOL. 18
Ja Hyung Koo, et. al.Ja Hyung Koo ... Min Cheol Kim
11 Sep 2018
Sensors | VOL. 18

Effects of Noise on RASTA-PLP and MFCC based Bangla ASR Using CNN
Md Raffael Maruf ... Nazmun Nahar Nelima
-
Md Raffael Maruf, et. al.Md Raffael Maruf ... Nazmun Nahar Nelima
01 Jan 2020
01 Jan 2020

Audio-Visual Speech Recognition Using LSTM and CNN
Eslam E El Maghraby ... M Hesham Farouk
Recent Advances in Computer Science and Communications | VOL. 14
Eslam E El Maghraby, et. al.Eslam E El Maghraby ... M Hesham Farouk
20 Oct 2021
Recent Advances in Computer Science and Communications | VOL. 14

A Light-weight Convolutional Neural Network based Speech Recognition for Spoken Content Retrieval Task
Nirayo Hailu Gebreegziabher ... Andreas Nurnberger
-
Nirayo Hailu Gebreegziabher, et. al.Nirayo Hailu Gebreegziabher ... Andreas Nurnberger
11 Oct 2020
11 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human recognition by utilizing voice recognition and visual recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Nonlinear Analysis and Applications