ALT Speech Recognition System using F0 Improvement and Spectral Tilt Method

Krishna Kumar E,Inbanila K*

doi:10.35940/ijeat.f9348.088619

Abstract

Human Beings use voice as the medium for communication. Human Speech is a very complex signal with multiple frequencies, amplitudes and intensities that mix up to convey specific information. In international terminology, voice disorders are described as dysphonia. Various dysphonia’s are clearly organic origin due to nervous, muscular, neuro or cellular degenerative disease affecting the body or it is from local laryngeal changes. Other dysphonia’s having no visible laryngeal causes are grouped as non organic involving habitual dysphonia’s that arise from faulty speaking habits or the psycho genic dysphonia’s that stem from emotional causes. This paper looks at a speech recognition system for disordered speech generated by Physically Disabled people using Artificial Larynx Transducer (ALT) device from the perspective of Speech Signal Processing. From the ALT speech features like formant, pitch and spectral tilt is estimated. For formant frequency estimation RNN technique is used. Before training the system pitch frequency improvement is accomplished. Now the features and homomorphic based coefficients are used for training the system. The same operation is performed during the test phase and compared with the training set. Comparison and decision making is accomplished using distance estimator.

Full Text