Abstract

Recently, Deep Neural Networks (DNN) has been widely used for pattern recognition and classification applications because of its high accuracy. Here in this paper, we propose four different Deep Neural Network (DNN) architectures and comparison is made between these four proposed DNN architectures in terms of accuracy and training time. The proposed DNN models are evaluated for speech recognition application using TIDIGITS corpus. Mel-Frequency Cepstral Coefficients (MFCC) technique is used to extract feature vectors of speech data. It is observed that modified triangular architecture gave the highest accuracy of 99.31 % as compared to other architectures while the triangular architecture gave the least training time of 49.72 sec. Furthermore, results of proposed DNN architecture is compared with the existing Hidden Markov Model based speech recognition and the proposed DNN provide an increased accuracy of 2.33%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.