Abstract

Automatic Speech Recognition schemes are the alternating modes in which individuals interrelate with various mobile application applications. The user interactivity needs are a huge vocabulary identification system, high accuracy, energy-efficient solution and time. Although the automatic Speech Recognition system requires power budget and huge memory bandwidth, it is not applicable for many tiny forms of battery-controlled devices. Hence, an effective method is developed using the proposed Multi-kernel Perceptual linear predictive and Stochastic Biogeography-based whale optimization algorithm optimization to adapt non-audible mutter to regular speech. First, the input speech signal is initially given to the pre-processed module. Then, the features, such as spectral centroid, pitch chroma, Taylor amplitude modulation spectrogram (AMS), spectral skewness, and the developed Multi-kernel Perceptual linear predictive, are extracted to determine the appropriate features. After extracting features, the speech recognition is performed based on Deep Convolutional Neural Network, which is trained by the proposed Stochastic Biogeography whale optimization algorithm. The Stochastic Biogeography whale optimization algorithm combines the stochastic gradient descent method, whale optimization algorithm, and biogeography-based optimization. The developed model showed improved results with maximum accuracy of 0.985, minimal FPR of 0.001, maximal TPR of 1, respectively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call