RFaNet: Receptive Field-Aware Network with Finger Attention for Fingerspelling Recognition Using a Depth Sensor

Shih-Hung Yang,Yao-Mao Cheng,Jyun-We Huang,Yon-Ping Chen

doi:10.3390/math9212815

Shih-Hung Yang, Yao-Mao Cheng + Show 2 more

Open Access

https://doi.org/10.3390/math9212815

Copy DOI

Abstract

Automatic fingerspelling recognition tackles the communication barrier between deaf and hearing individuals. However, the accuracy of fingerspelling recognition is reduced by high intra-class variability and low inter-class variability. In the existing methods, regular convolutional kernels, which have limited receptive fields (RFs) and often cannot detect subtle discriminative details, are applied to learn features. In this study, we propose a receptive field-aware network with finger attention (RFaNet) that highlights the finger regions and builds inter-finger relations. To highlight the discriminative details of these fingers, RFaNet reweights the low-level features of the hand depth image with those of the non-forearm image and improves finger localization, even when the wrist is occluded. RFaNet captures neighboring and inter-region dependencies between fingers in high-level features. An atrous convolution procedure enlarges the RFs at multiple scales and a non-local operation computes the interactions between multi-scale feature maps, thereby facilitating the building of inter-finger relations. Thus, the representation of a sign is invariant to viewpoint changes, which are primarily responsible for intra-class variability. On an American Sign Language fingerspelling dataset, RFaNet achieved 1.77% higher classification accuracy than state-of-the-art methods. RFaNet achieved effective transfer learning when the number of labeled depth images was insufficient. The fingerspelling representation of a depth image can be effectively transferred from large- to small-scale datasets via highlighting the finger regions and building inter-finger relations, thereby reducing the requirement for expensive fingerspelling annotations.

Highlights

For deaf people, sign language is a means to communicate
We evaluated the effectiveness of receptive field-aware network with finger attention (RFaNet) in transferring knowledge from the large-scale ASL dataset to the small-scale
We proposed and evaluated RFaNet, a network that highlights the finger regions and builds inter-finger relations for fingerspelling recognition

Summary

Introduction

Sign language is a means to communicate. Communication between deaf and hearing people remains challenging. Automatic sign language recognition tackles this communication barrier by translating sign language to text or speech. Fingerspelling is a sign language that signals words letter by letter. Fingerspelling enables the communication of technical terms and other terms lacking a representation in sign language. Note that ~35% of words in social interactions refer to technical topics requiring fingerspelling [1]

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Nov 5, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

RFaNet: Receptive Field-Aware Network with Finger Attention for Fingerspelling Recognition Using a Depth Sensor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Towards multi-scale feature detection repeatable over intensity and depth images
Hatem A Rashwan ... Geraldine Morin
-
Hatem A Rashwan, et. al.Hatem A Rashwan ... Geraldine Morin
01 Sep 2016
01 Sep 2016

A Fine-Grained Visual Attention Approach for Fingerspelling Recognition in the Wild
Kamala Gajurel ... Cuncong Zhong
-
Kamala Gajurel, et. al.Kamala Gajurel ... Cuncong Zhong
17 Oct 2021
17 Oct 2021

Assembly Monitoring Using Semantic Segmentation Network Based on Multiscale Feature Maps and Trainable Guided Filter
Chengjun Chen ... Changzhi Li
IEEE Transactions on Instrumentation and Measurement | VOL. 71
Chengjun Chen, et. al.Chengjun Chen ... Changzhi Li
01 Jan 2021
IEEE Transactions on Instrumentation and Measurement | VOL. 71

A Crossmodal Multiscale Fusion Network for Semantic Segmentation of Remote Sensing Data
Xianping Ma ... Man-On Pun
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 15
Xianping Ma, et. al.Xianping Ma ... Man-On Pun
01 Jan 2021
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RFaNet: Receptive Field-Aware Network with Finger Attention for Fingerspelling Recognition Using a Depth Sensor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics