Lip Reading in Cantonese

Yewei Xiao,Aosu Zhu,Lianwei Teng,Picheng Tian,Xuanming Liu

doi:10.1109/access.2022.3204677

Abstract

Lip reading aims at recognizing texts from a talking face without audio information. Due to the rapid development of deep learning techniques, researchers have made giant breakthroughs for both word-level and sentence-level English lip reading in recent years. Unlike English, it is difficult for Chinese to distinguish the lexical meanings, because Chinese is a tonal language. In addition, most of the existing Chinese lip reading datasets are designed for Mandarin, there are few for Cantonese. In this paper, we propose a word-level Cantonese lip reading dataset called CLRW which contains 800-word classes with 400,000 samples. For better practical applications, we do not limit gender, age, postures, light conditions, and speech speed to make CLRW closer to the real scene distribution. At first, we give a detailed description of the data collection process. Next, a novel two-branch network is proposed by us, named TBGL, which consists of a global branch and a local branch. The global branch models the whole lip and the local branch divides the feature into three parts to focus on subtle local lip motion. We jointly train these two branches and achieve comparable performance on LRW, CAS-VSR-W1K, and CLRW, respectively. Finally, we benchmark our dataset and perform a comprehensively analyze of the results, which demonstrate that CLRW is full of challenge, and it will bring a positive impact on further Cantonese lip reading tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Lip Reading in Cantonese

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Thorax disease classification with attention guided convolutional neural network
Qingji Guan ... Yi Yang
Pattern Recognition Letters | VOL. 131
Qingji Guan, et. al.Qingji Guan ... Yi Yang
30 Nov 2019
Pattern Recognition Letters | VOL. 131

Deep Patch-based Global Normal Orientation
Shiyao Wang ... Shuhua Li
Computer-Aided Design | VOL. 150
Shiyao Wang, et. al.Shiyao Wang ... Shuhua Li
01 Sep 2022
Computer-Aided Design | VOL. 150

Lip Motion Magnification Network for Lip Reading
Xueyi Zhang ... Li Liu
-
Xueyi Zhang, et. al.Xueyi Zhang ... Li Liu
29 Oct 2021
29 Oct 2021

An Efficient High-Resolution Global–Local Network to Detect Lunar Features for Space Energy Discovery
Yutong Jia ... Gang Wan
Remote Sensing | VOL. 14
Yutong Jia, et. al.Yutong Jia ... Gang Wan
13 Mar 2022
Remote Sensing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lip Reading in Cantonese

Abstract

Talk to us

Similar Papers

More From: IEEE Access