Abstract

Recurrent neural networks, as an effective method to study the analysis and prediction of large-scale time series, have existed in various applications on time series data, but the traditional RNN neural network has the problem of gradient disappearance and the problem of poor prediction accuracy has not been solved. This paper proposes a vocal recognition system based on LSTM neural network, whose core structure can be divided into four parts: forgetting gate, input gate, cell The core structure can be divided into four parts: forgetting gate, input gate, state and output gate, which are more suitable to analyze the characteristics of different human voice patterns, and can obtain higher recognition accuracy after the process of feature extraction, data enhancement, model training and voice pattern recognition. Finally, the performance of the LSTM neural network is tested on 300 speech data, and the test results prove that the LSTM neural network can obtain high recognition rate with less iterations.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call