Hierarchical Picture of existing Audio-Visual Speech Database

Bibish Kumar K T*,Sunil John,R K Sunil Kumar,Muraleedharan K M

doi:10.35940/ijrte.c6483.098319

Abstract

Despite the technological improvement and arrival of new methodologies in the different process of a speech-based applications, a parallel development is not observed in the availability of audio-visual speech database. This paper provides a detailed hierarchical picture of the existing audio-visual speech database. Since the performance of a speech-based application deeply depends on the different parameters like the number of speakers, speaker variability, phonetically balanced sentences, recording quality etc. involved in the creation of a database to attain specific task. This paper gave more importance to these parameters involved in the exciting audio-visual speech database rather than the experimental side which need linguistic knowledge about the concerned language in the feature extraction task and classification task. This paper is arranged in such a way that a new face in this realm can capture the needy things to build a speech database in his language. In addition, this paper differs from other review papers in the aspect that it gives equal importance to the available audio-visual speech database in the resourced and under-resourced languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical Picture of existing Audio-Visual Speech Database

Abstract

Talk to us

Similar Papers

More From: International Journal of Recent Technology and Engineering (IJRTE)

Lead the way for us

Similar Papers

A YOLO-based deep learning model for Real-Time face mask detection via drone surveillance in public spaces
Salama A Mostafa ... Weiping Ding
Information Sciences | VOL. 676
Salama A Mostafa, et. al.Salama A Mostafa ... Weiping Ding
31 May 2024
Information Sciences | VOL. 676

Recognition of speech emotion using custom 2D-convolution neural network deep learning algorithm
Kudakwashe Zvarevashe ... Oludayo O Olugbara
Intelligent Data Analysis | VOL. 24
Kudakwashe Zvarevashe, et. al.Kudakwashe Zvarevashe ... Oludayo O Olugbara
30 Sep 2020
Intelligent Data Analysis | VOL. 24

Cross-modal learning representation using new margin combination for speech recognition task
D Karim ... M Abdelkarim
Journal of Applied Research and Technology | VOL. 22
D Karim, et. al.D Karim ... M Abdelkarim
28 Jun 2024
Journal of Applied Research and Technology | VOL. 22

Nonlinear scale decomposition based features for visual speech recognition
...
-
, et. al. ...
01 Sep 1998
01 Sep 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Picture of existing Audio-Visual Speech Database

Abstract

Talk to us

Similar Papers

More From: International Journal of Recent Technology and Engineering (IJRTE)