Abstract

Recurrent neural networks, as an effective method to study the analysis and prediction of large-scale time series, have existed in various applications on time series data, but the traditional RNN neural network has the problem of gradient disappearance and the problem of poor prediction accuracy has not been solved. This paper proposes a vocal recognition system based on LSTM neural network, whose core structure can be divided into four parts: forgetting gate, input gate, cell The core structure can be divided into four parts: forgetting gate, input gate, state and output gate, which are more suitable to analyze the characteristics of different human voice patterns, and can obtain higher recognition accuracy after the process of feature extraction, data enhancement, model training and voice pattern recognition. Finally, the performance of the LSTM neural network is tested on 300 speech data, and the test results prove that the LSTM neural network can obtain high recognition rate with less iterations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.