Abstract Forecasting cryptocurrencies as a financial issue is crucial as it provides investors with possible financial benefits. A slight improvement in forecasting performance can lead to increased profitability; Therefore, obtaining a realistic forecast is very important for investors. Bitcoin, frequently mentioned in recent due to its volatility and chaotic behavior, has become an investment tool, especially during and after the COVID-19 pandemic. In this study, selected ML techniques were investigated for predicting cryptocurrency movements by using technical indicator-based data sets and measuring the applicability of the techniques to cryptocurrencies that do not have sufficient historical data. In order to measure the effect of data size, Bitcoin’s last 1 year and 7 years of data were used. Following the related literature, Google trends and the number of tweets were used as input features, in addition to the most commonly used twelve technical indicators. Random Forest, K-Nearest Neighbors, Extreme Gradient Boosting (XGBoost-XGB), Support Vector Machine (SVM), Naive Bayes (NB), Artificial Neural Networks (ANN), and Long-Short-Term Memory (LSTM) network were optimized for best results. Accuracy, F1, and area under the ROC curve values were used to compare the model performance. For continuous data, ANN and SVM performed the best with the highest accuracy and outperformed the other ML models for complete and reduced sets. LSTM reached the best accuracy for trend data, but SVM, NB, and XGB models showed similar performance. The research shows that some indicators significantly affect prediction performance, and the data discretization process also improved the model’s accuracy. While the number of samples affects the results of many ML models, correctly optimized and fine-tuned models may also give excellent results even with less data.
Read full abstract