Oral pronunciation refers to the way words are spoken aloud, encompassing aspects like accent, intonation, and clarity. It plays a crucial role in effective communication, influencing listeners to understand the speaker. The purpose of this research is to construct an English oral pronunciation evaluation model based on a deep learning algorithm. In this research, we propose a novel Dung Beetle Optimization-driven Customized Deep Belief Network (DBO-CDBN) for accurate detection and evaluation of English oral pronunciation. We obtain a dataset that comprises audio recordings of English speech, including both correct and mispronounced samples to train and evaluate the suggested pronunciation evaluation model. Data normalization is used to pre-process the gathered raw audio data, it enhances the overall quality of the data. The Gammatone Frequency Cepstral Coefficients (GFCC) algorithm is employed to extract the crucial features from the obtained audio data. In our proposed model, the DBO optimization algorithm iteratively finetunes the CDBN architecture to enhance the evaluation accuracy. The approach, which is executed in python, is shown to be more successful in pronunciation evaluation and to provide prospective advances in the area when its performance is examined and compared to other approaches in terms of precision (96.78%), recall (93.65%), and [Formula: see text]-measure (94.53%). The established model is implemented using Python software. During the outcome analysis phase, we evaluate our model’s performance across various parameters. In addition, we also performed a comparative analysis using diverse existing methodologies. The findings demonstrate the excellence and effectiveness of the suggested future economic forecasting model.
Read full abstract