Arabic dialect identification (ADI) is a specific task of natural language processing (NLP) that intends to forecast the Arabic language dialect of the input text automatically. ADI is the preliminary step toward establishing many NLP applications, including cross-language text generation, multilingual text-to-speech synthesis, and machine translation. The automatic classification of the Arabic dialect is the first step in various dialect-sensitive Arabic NLP tasks. ADI includes predicting the dialects related to the textual input and classifying them on their respective labels. As a result, increased interest has been gained in the last few decades to address the problems of ADI through deep learning (DL) and machine learning (ML) algorithms. The study develops an Arabic multi-class dialect recognition using fast random opposition-based fractals learning aquila optimizer with DL (FROBLAO-DL) technique. The FROBLAO-DL technique utilizes the optimal DL model to identify distinct types of Arabic dialects. In the FROBLAO-DL technique, data preprocessing is involved in cleaning the input Arabic dialect dataset. In addition, the ROBERTa word embedding process is used to generate word embedding. The FROBLAO-DL technique uses attention bidirectional long short-term memory (ABiLSTM) network to identify distinct Arabic dialects. Moreover, the ABiLSTM model’s hyperparameter tuning is implemented using the FROBLOA method. The performance evaluation of the FROBLAO-DL method is tested under the Arabic dialect dataset. The empirical analysis implies the supremacy of the FROBLAO-DL technique over recent approaches under various measures.
Read full abstract