Abstract

Lip Reading is the technology of obtaining the language content by analyzing the change of the speaker’s lip shape and recognizing the information of the lip movement. Lip reading helps people with hearing disabilities understand what other people are saying, which is difficult for humans. This paper proposes a novel Lip Reading model using Transformer network, to achieve a lip reading model with high accuracy. The main process of the model includes the processing of data sets, the extraction of lip features using the pre-trained neural network, and then input into Transformer network for training. Finally, our model achieves a word-level lip reading accuracy of 45.81% on the open source GRID corpus.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.