Abstract

Voice recognition systems mostly suffer from environmental effects and accent differences. Therefore, studies on speech recognition have begun to be examined using deep learning which is a method known to be successful in speech recognition and classification. In this study, 12 different voice commands are defined using convolutional neural network, which is a deep learning structure. In this study, the effect of dataset size on test and recognition accuracy was investigated. In addition, a different dataset which was prepared from the records of people whose main language is Turkish to investigate the effect of different accents on both test and recognition accuracy. In the experiments when the test dataset including native-speaker voice records is used, the test accuracy was obtained as 94.64% for large dataset and 64.81% for small dataset. On the other hand when the test dataset including foreigner's voice records the test accuracy reduced to 63.29% for large and 33.18% for small dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.