Lip Reading Using Various Deep Learning Models with Visual Turkish Data

Ali Berkol,Talya Tümer Si̇vri̇,Hamit Erdem

doi:10.35378/gujs.1239207

Abstract

In the Human-Computer Interaction field, lip reading is essential and still an open research problem. In the last decades, there have been many studies in the field of Automatic Lip-Reading (ALR) in different languages which is important for societies where the essential applications developed. Similarly to other machine learning and artificial intelligence applications, Deep Learning (DL) based classification algorithms have been applied for ALR in order to improve the performance of ALR. In the field of ALR, few studies have been done on the Turkish language. In this study, firstly an original data set was provided. Also, three image data augmentation techniques, which are sigmoidal transform, horizontal flip, and inverse transform, were applied to increase the data quality and variety. Then three deep learning models: Convolutional Neural Networks (CNN), Long-Short Term Memory (LSTM), and Bidirectional Gated Recurrent Unit (BGRU), were performed with a visual Turkish lip reading dataset. The performance of the applied method has been compared regarding precision, recall, and F1 metrics. According to experiment results, BGRU and LSTM models gave the same results up to the fifth decimal, and BGRU had the fastest training time.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Lip Reading Using Various Deep Learning Models with Visual Turkish Data

Abstract

Published Version

Talk to us

Similar Papers

More From: GAZI UNIVERSITY JOURNAL OF SCIENCE

Lead the way for us

Journal: GAZI UNIVERSITY JOURNAL OF SCIENCE	Publication Date: Nov 15, 2023
Citations: 1

Similar Papers

Sentiment Analysis of Self Driving Car Dataset: A comparative study of Deep Learning approaches
Devshri Pandya ... Ankit Thakkar
Procedia Computer Science | VOL. 235
Devshri Pandya, et. al.Devshri Pandya ... Ankit Thakkar
01 Jan 2024
Procedia Computer Science | VOL. 235

The Design of an Intelligent Lightweight Stock Trading System Using Deep Learning Models: Employing Technical Analysis Methods
Seongjae Yu ... Sang-Hyeak Yoon
Systems | VOL. 11
Seongjae Yu, et. al.Seongjae Yu ... Sang-Hyeak Yoon
13 Sep 2023
Systems | VOL. 11

Hidden Markov guided Deep Learning models for forecasting highly volatile agricultural commodity prices
G. Avinash ... Mir Asif Iquebal
Applied Soft Computing | VOL. 158
G. Avinash, et. al.G. Avinash ... Mir Asif Iquebal
01 Apr 2024
Applied Soft Computing | VOL. 158

Forecasting of Bicycle and Pedestrian Traffic Using Flexible and Efficient Hybrid Deep Learning Approach
Fouzi Harrou ... Abdelhafid Zeroual
Applied Sciences | VOL. 12
Fouzi Harrou, et. al.Fouzi Harrou ... Abdelhafid Zeroual
28 Apr 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Lip Reading Using Various Deep Learning Models with Visual Turkish Data

Abstract

Published Version

Talk to us

Similar Papers

More From: GAZI UNIVERSITY JOURNAL OF SCIENCE