This paper presents a novel approach to representing the Limit Order Book data at a given timestamp using the Ordered Fuzzy Numbers concept. The limit order book contains all buy and sell orders placed by investors, updated in real-time, for the most liquid securities, even several hundred times a minute. Due to its irregular nature (different and dynamic changes in the number of buy and sell orders), direct calculations on the order book data are not feasible without transforming it into feature vectors. Currently, most studies use a price level-based data representation scheme when applying deep learning models on limit order book data. However, this scheme has limitations, particularly its sensitivity to subtle perturbations that can negatively impact model performance. On the other hand, the ordered fuzzy number is a mathematical object (a pair of two functions) used to process imprecise and uncertain data. Ordered Fuzzy Numbers possess well-defined arithmetic properties. Converting the limit order book data to ordered fuzzy numbers allows the creation of a time series of ordered fuzzy numbers (order books) and use them for further calculations, e.g., to represent input data for deep learning models or employing the concept of fuzzy time series in various domains, such as defining liquidity measures based on limit order book data. In this paper, the proposed approach is tested using one-year market data from the Polish Stock Exchange for the five biggest companies. The DeepLOB model is employed to predict mid-price movement using different input data representations. The proposed representation of Limit Order Book data demonstrated remarkably stable out-of-sample prediction accuracy, even when subjected to data perturbation.
Read full abstract