Abstract

This experimental study addresses the problem of predicting the direction of stocks and the movement of stock price indices for three major stocks and stock indices. The proposed approach for processing input data involves the computation of ten technical indicators using stock trading data. The dataset used for the evaluation of all the prediction models consists of 11 years of historical data from January 2007 to December 2017. The study comprises four prediction models which are Long Short-Term Memory, XGBoost, Support Vector Machine ( and Random forests. Accuracy scores and F1 scores for each of the prediction models have been evaluated using this input approach. Experimental results reveal that a continuous data approach using ten technical indicators gives the best performance in the case of the Random Forest classifier model with the highest accuracy of 84.89% (average wise 83.74%) and highest F1 score of 89.33% (average wise 83.74%). The experiments also give us an insight into why a Naïve Bayes Classification model is not a suitable prediction model for the above task.

Highlights

  • Everyone wants to earn money in the shortest possible time; stock market can be one of the instruments to fulfill such dreams

  • Experimental results reveal that a continuous data approach using 10 technical indicators gives the best performance in the case of the random forest classifier model with the highest accuracy of 84.89% and highest F1 score of 89.33%

  • The findings indicate that, by forecasting a 10-day inventory flow, other classical ML algorithms (Decision Tree (DT), Random Forest (RF) and Neural Network (NN)) were outperformed in the proposed novel “homogenous” ensemble classification

Read more

Summary

INTRODUCTION

Everyone wants to earn money in the shortest possible time; stock market can be one of the instruments to fulfill such dreams. One significant reason for it was that the BNN model does not take into account past trends of the stock market while making predictions To overcome this shortcoming, the presented study makes use of an improvement in ANN models that is LSTM. The presented study makes use of an improvement in ANN models that is LSTM This is a type of Recurrent Neural Network which has a memory element that helps in taking into account previous trends while making future predictions, improving the accuracy score. It improves on the basic gradient boosting method Machine learning techniques such as Support Vector Machine (SVM) and Random Forest have proven to be extremely effective in time series forecasting, in stock market prediction.

LITERATURE REVIEW
RESEARCH DATA
PREDICTION MODELS
Random Forest
Splitting Criteria
Kernel
Regularization and Gamma
Feasibility of Naive Bayes
Why the Naive Bayes Algorithm is Naive
RESULT
Performance Metrics
Experimental Results
Results
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call