Using company-specific headlines and convolutional neural networks to predict stock fluctuations

Jonathan Readshaw,Stefano Giani

doi:10.1007/s00521-021-06324-9

Jonathan Readshaw, Stefano Giani

Open Access

https://doi.org/10.1007/s00521-021-06324-9

Copy DOI

Journal: Neural Computing and Applications	Publication Date: Jul 29, 2021
Citations: 5	License type: open-access

Affiliation: Durham University

Abstract

This work presents a convolutional neural network for the prediction of next-day stock fluctuations using company-specific news headlines. Experiments to evaluate model performance using various configurations of word embeddings and convolutional filter widths are reported. The total number of convolutional filters used is far fewer than is common, reducing the dimensionality of the task without loss of accuracy. Furthermore, multiple hidden layers with decreasing dimensionality are employed. A classification accuracy of 61.7% is achieved using pre-learned embeddings, that are fine-tuned during training to represent the specific context of this task. Multiple filter widths are also implemented to detect different length phrases that are key for classification. Trading simulations are conducted using the presented classification results. Initial investments are more than tripled over an 838-day testing period using the optimal classification configuration and a simple trading strategy. Two novel methods are presented to reduce the risk of the trading simulations. Adjustment of the sigmoid class threshold and re-labelling headlines using multiple classes form the basis of these methods. A combination of these approaches is found to be more than double the Average Trade Profit achieved during baseline simulations.

Highlights

Despite suggestions that the stock market is not predictable [1], many investors and researchers seek methods that can provide market fluctuation predictions to aid investment strategy
A much larger training set would be required for general context relationships to be represented in self-learnt embeddings. These observations, suggest that non-static embeddings provide the best configuration because of their ability to be fine-tuned to the task in question and because a more general context of words is retained in the embeddings allowing for better application to both unseen headlines and new tasks
The largest single-day loss from an investment is 11.3%, the model predicts rðzÞmean 1⁄4 0:93 based on the previous day’s headlines. The effect of these incorrect predictions with high rðzÞmean, coupled with a significantly reduced number of trades, leads to lower performance metrics than in the baseline case at values of t [ 0:75. These results demonstrate the shortcomings of making predictions based solely on company headlines, as it is possible for the network to make a positive prediction with high certainty based on a collection of headlines but for a significant loss to be made

Summary

Introduction

Despite suggestions that the stock market is not predictable [1], many investors and researchers seek methods that can provide market fluctuation predictions to aid investment strategy. Advances in machine learning (ML) and natural language processing (NLP) have led to a shift in focus from technical to fundamental analysis This new approach uses data such as news articles and historical stock prices and is based upon the efficient market hypothesis which states that an asset price reflects all available information [2]. Mittermayer [23] focuses on intra-day predictions, whereas long-term trends are briefly considered in the work of Ding [24] Methods such as support vector machines [22] and complex decision trees [17] remain popular for predictive tasks of this nature.

Preprocessing

Embedding

Convolution

Max-pooling

Fully connected hidden layers

Output node

Training

Network architectures

Dataset

Experimental procedure

Optimum model configuration

Effect of Filter Width

Word embeddings

Overall optimal model

Trading simulations

Buy threshold

Modification to multi-class labelling

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using company-specific headlines and convolutional neural networks to predict stock fluctuations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Similar Papers

End-To-End Gender Determination by Images of an Human Eye
Yasaswini Paladugu ... Dr Ramesh Sekaran
Remittances Review | VOL. 7
Yasaswini Paladugu, et. al.Yasaswini Paladugu ... Dr Ramesh Sekaran
19 Nov 2022
Remittances Review | VOL. 7

Deep learning with convolutional fiber filters for spectral analysis of hyperspectral imagery (Conference Presentation)
Robert S Rand ... David W Messinger
-
Robert S Rand, et. al.Robert S Rand ... David W Messinger
14 May 2018
14 May 2018

Fully Automated Convolutional Neural Network Method for Quantification of Breast MRI Fibroglandular Tissue and Background Parenchymal Enhancement.
Richard Ha ... Ralph T Wynn
Journal of Digital Imaging | VOL. 32
Richard Ha, et. al.Richard Ha ... Ralph T Wynn
03 Aug 2018
Journal of Digital Imaging | VOL. 32

Towards Better Uncertainty Sampling: Active Learning with Multiple Views for Deep Convolutional Neural Network
Tao He ... Lan Yi
-
Tao He, et. al.Tao He ... Lan Yi
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using company-specific headlines and convolutional neural networks to predict stock fluctuations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural Computing and Applications