Abstract

This paper aims to automatically design optimal LSTM topologies using the clonal selection algorithm (CSA) to solve text classification tasks such as sentiment analysis and SMS spam classification. Designing optimal topologies involves determining the best configuration of hyperparameters that will give the best performance. The current state-of-the-art LSTM topologies are often designed using trial and error approaches which are incredibly time-consuming and require domain experts. Our proposed method, referred to as CSA-LSTM, is evaluated using the Large Movie Review Dataset (IMDB). Furthermore, to verify the robustness of the hyperparameters discovered by CSA for the IMDB dataset, we have used them for the other datasets, viz. the Twitter US Airline Sentiment and the SMS Spam Collection. Additionally, the discovered hyperparameters for the LSTM are combined with pre-determined convolutional neural network (CNN) layers to achieve the same or better results to fast the training time and fewer trainable parameters. For further verification and evaluation of the generalization ability and effectiveness of the proposed approach, it is compared with four machine learning algorithms widely used for text classification tasks: (1) random forest (RF), (2) logistic regression (LR), (3) support vector machine (SVM), and (4) multinomial naive Bayes (NB). The results of our experiments show that the LSTM topologies automatically designed by our CSA method are less expensive, reusable and outperform the machine learning algorithms and other models in the literature evaluated on the same three datasets. Through our proposed method, LSTM's best topology can be self-determined without any human intervention, making CSA-based algorithms a promising approach to automatically design optimal LSTM topologies that provide the best performance for a given task.

Highlights

  • Today, deep learning is one of the most up-and-coming technologies that are mainly driving the modern rise of artificial intelligence (AI) and machine learning [1], [2]

  • It shows that the performance of clonal selection algorithm (CSA)-convolutional neural network (CNN)-LSTM outperforms all other models in all evaluation metrics followed by the CSA-LSTM

  • Due to the shortcomings of current methods and the limited computing resources available to experimenters, optimizing LSTM topologies are often performed by domain experts who adopt innovative theoretical insights and intuitions gained from experience

Read more

Summary

Introduction

Deep learning is one of the most up-and-coming technologies that are mainly driving the modern rise of artificial intelligence (AI) and machine learning [1], [2]. With the significant advances in technology and algorithms in the last few years, deep learning has paved the way for a new generation of AI applications [3]. In many of these applications, i.e., classification from text, sound, or images, the performance of deep learning algorithms matched and sometimes exceeding human-level performance [4], [5]. Each LSTM cell has the same inputs and outputs as a regular RNN but has more parameters and a gating system to regulate the information flow.

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.