Abstract

: Nowadays, speech recognition has become one of the important technologies for human- computer interaction. Speech recognition is essentially a process of speech training and pattern recognition, which makes feature extraction technology particularly essential. The quality of feature extraction is directly related to the accuracy of speech recognition. Dynamic feature parameters can effectively improve the accuracy of speech recognition. These parameters make the speech dynamic feature extraction to have a higher research value. The traditional dynamic feature extraction method is easier to generate more redundant information, resulting in low recognition accuracy. Therefore, based on a new speech feature extraction method, which is based on deep learning for speech feature extraction, is proposed in the present study. Firstly, the speech signal is preprocessed by pre-emphasis, windowing, filtering, and endpoint detection. Then, the sliding differential cepstral feature (SDC) is extracted, which contains the voice information of the front and back frames. Finally, the feature is used as input to extract the dynamic features that represent the depth essence of speech information through the deep self-encoding neural network. The simulation results show that the dynamic features extracted by in-depth learning have better recognition performance than the original features, and have a good effect on speech recognition.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.