Chapter 6 - Linguistically involved data-driven approach for Malayalam phoneme-to-viseme mapping

K.T Bibish Kumar,Sunil John,K.M Muraleedharan,R.K Sunil Kumar

doi:10.1016/b978-0-12-823898-1.00003-5

Abstract

Knowledge of phonemes and visemes in language is a vital component of speech-based applications. A phoneme is the nuclear sound unit necessary to symbolize all words in a particular speech. The present definition of viseme is a visual language unit that describes the state of different speech articulators. This chapter discusses the primary task of identifying visemes and the number of frames required to encode the temporal evolution of vowel and consonant phonemes. For this work, an audio-visual Malayalam speech database is created from 23 native speakers of Kerala (18 females and five males). The tongue plays a vital role in the utterance of Malayalam, regarding flexibility and speed, which makes it distinct from other languages. The appearance of teeth and the oral cavity and the shape of the lips can be modeled using geometric features of lips obtained from the hue, saturation, value (HSV) color space, and deformation in the appearance of the lips and tongue can be modeled using the discrete cosine transform (DCT) feature. A linguistically involved, data-driven approach can model individual perception from a linguistic approach with the computational ease of a data-driven approach. The visual speech attributes are then clustered to identify the visual equivalent of the phoneme employing K-means cluster and Gap statistic. To study the temporal variation, we analyzed three phoneme-to-viseme mappings and compared them with the linguistic mapping and visual speech duration.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Chapter 6 - Linguistically involved data-driven approach for Malayalam phoneme-to-viseme mapping

Abstract

Talk to us

Similar Papers

More From: Applied Speech Processing

Lead the way for us

Similar Papers

Comparative study of color iris recognition: DCT vs. vector quantization approaches in rgb and hsv color spaces
Suchitra Patil ... Nilkamal More
-
Suchitra Patil, et. al.Suchitra Patil ... Nilkamal More
01 Sep 2017
01 Sep 2017

<title>New HSL and HSV color spaces and applications</title>
Gabriel G Marcu ... Satoshi Abe
-
Gabriel G Marcu, et. al.Gabriel G Marcu ... Satoshi Abe
07 Feb 1997
07 Feb 1997

Viseme set identification from Malayalam phonemes and allophones
K T Bibish Kumar ... V L Lajish
International Journal of Speech Technology | VOL. 22
K T Bibish Kumar, et. al.K T Bibish Kumar ... V L Lajish
04 Nov 2019
International Journal of Speech Technology | VOL. 22

Color Conversion Formulae between RGB Color Space and HSI Color Space for Color Image Processing
Taichi Oinosho ... Akira Taguchi
-
Taichi Oinosho, et. al.Taichi Oinosho ... Akira Taguchi
16 Nov 2021
16 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Chapter 6 - Linguistically involved data-driven approach for Malayalam phoneme-to-viseme mapping

Abstract

Talk to us

Similar Papers

More From: Applied Speech Processing