Abstract

Sign language is a crucial form of communication for the hearing impaired, presenting unique challenges for technological interpretation. This project focuses on developing a Sign Language Detection Using Action Recognition (SLDAR) system to address these challenges. Leveraging Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and multimodal fusion techniques, our system aims to accurately detect and recognize sign language gestures in real-time. The project involves collecting and annotating a diverse dataset comprising video, audio, and text data. We then design and train a deep learning model that combines features from these modalities to improve accuracy and robustness.The project involves collecting and annotating a diverse dataset, designing a deep learning model for multimodal fusion, and implementing a user-friendly interface for real-time interaction. The system's performance is evaluated using metrics such as accuracy, precision, and recall. Our goal is to enhance accessibility and communication for the hearing impaired through the development of this advanced sign language recognition system. Keywords:Acoustic Features, Linguistic Context, Mean Opinion Score (MOS), Natural Language Processing, NLP Algorithms, Signal-to-Noise Ratio (SNR), Speech Enhancement, Speech Quality, Telecommunication Systems, Voice Assistants.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call