A cross-lingual adaptation approach for rapid development of speech recognizers for learning disabled users

Marek Bohac,Michaela Kucharova,Petr Červa,Zoraida Callejas,Jan Nouza

doi:10.1186/s13636-014-0039-0

Abstract

Building a voice-operated system for learning disabled users is a difficult task that requires a considerable amount of time and effort. Due to the wide spectrum of disabilities and their different related phonopathies, most approaches available are targeted to a specific pathology. This may improve their accuracy for some users, but makes them unsuitable for others. In this paper, we present a cross-lingual approach to adapt a general-purpose modular speech recognizer for learning disabled people. The main advantage of this approach is that it allows rapid and cost-effective development by taking the already built speech recognition engine and its modules, and utilizing existing resources for standard speech in different languages for the recognition of the users’ atypical voices. Although the recognizers built with the proposed technique obtain lower accuracy rates than those trained for specific pathologies, they can be used by a wide population and developed more rapidly, which makes it possible to design various types of speech-based applications accessible to learning disabled users.

Highlights

Millions of individuals suffer from learning disabilities that affect their speech production
3 Proposed method As the development of speech recognition technologies starting from scratch is a very time- and resourceconsuming process, we propose to avoid these costs by means of cross-lingual adaptation
TP means true positives, FP means false positives, FN means false negatives, Nkw denotes the number of keywords in the vocabulary, Dur stands for the total duration of recordings, and Nrec is the number of words in the reference transcription that really appear in each audio recording

Summary

Introduction

Millions of individuals suffer from learning disabilities that affect their speech production. These conditions result in atypical voices that are very difficult to understand even for human listeners, as they may affect one or more of the major language subsystems, including phonology, morphology, syntax and semantics. Focusing on phonology, impaired speech can affect voice timing, pitch, volume, fluency and articulation [1]. Different studies have focused on the nature of such mispronunciations and their impact in intelligibility. In [3], the authors focus on how to measure the intelligibility of atypical voices objectively along different perceptual dimensions

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Oct 18, 2014
Citations: 19	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

A cross-lingual adaptation approach for rapid development of speech recognizers for learning disabled users

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Understanding Intervention for (C)APD: As Easy as A-B-C
Jeanane M Ferre
The ASHA Leader | VOL. 12
Jeanane M FerreJeanane M Ferre
01 Aug 2007
The ASHA Leader | VOL. 12

Rapid and cost-effective technology development using TCAD: a case study
Adil Shafi ... Jim Mcginty
-
Adil Shafi, et. al.Adil Shafi ... Jim Mcginty
27 Apr 1999
27 Apr 1999

MHealth App to Facilitate Remote Care for Patients With COVID-19: Rapid Development of the DrCovid+ App.
Jamaica Pei Ying Tan ... Chen Ee Lee
JMIR Formative Research | VOL. 7
Jamaica Pei Ying Tan, et. al.Jamaica Pei Ying Tan ... Chen Ee Lee
07 Feb 2023
JMIR Formative Research | VOL. 7

RAPADAPTE for rapid guideline development: high-quality clinical guidelines can be rapidly developed with limited resources.
Brian S Alper ... Anggie Ramirez-Morera
International Journal for Quality in Health Care | VOL. 28
Brian S Alper, et. al.Brian S Alper ... Anggie Ramirez-Morera
19 Apr 2016
International Journal for Quality in Health Care | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A cross-lingual adaptation approach for rapid development of speech recognizers for learning disabled users

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing