Financial Machine Learning Research Articles

Conventionally, random forests are built from “greedy” decision trees which each consider only one split at a time during their construction. The sub-optimality of greedy implementation has been well-known, yet mainstream adoption of more sophisticated tree building algorithms has been lacking. We examine under what circumstances an implementation of less greedy decision trees actually yields outperformance. To this end, a “stepwise lookahead” variation of the random forest algorithm is presented for its ability to better uncover binary feature interdependencies. In contrast to the greedy approach, the decision trees included in this random forest algorithm, each simultaneously consider three split nodes in tiers of depth two. It is demonstrated on synthetic data and financial price time series that the lookahead version significantly outperforms the greedy one when (a) certain non-linear relationships between feature-pairs are present and (b) if the signal-to-noise ratio is particularly low. A long-short trading strategy for copper futures is then backtested by training both greedy and stepwise lookahead random forests to predict the signs of daily price returns. The resulting superior performance of the lookahead algorithm is at least partially explained by the presence of “XOR-like” relationships between long-term and short-term technical indicators. More generally, across all examined datasets, when no such relationships between features are present, performance across random forests is similar. Given its enhanced ability to understand the feature-interdependencies present in complex systems, this lookahead variation is a useful extension to the toolkit of data scientists, in particular for financial machine learning, where conditions (a) and (b) are typically met.

Read full abstract

Most statistical arbitrage strategies in the academic literature solely rely on price time series. By contrast, alternative data sources are of growing importance for professional investors. We contribute to bridging this gap by assessing the price-predictive value of millions of tweets on intraday returns of the S&P 500 constituents from 2014 and 2015. For this purpose, we design a machine learning system addressing specific challenges inherent to this task. At first, building on the literature of financial dictionaries, we engineer domain-specific features along three categories, i.e., directional indicators, relevance indicators and meta features. Next, we leverage a random forest to extract the relationship between these features and subsequent stock returns in a low signal-to-noise setting. For performance evaluation, we run a rigorous event-based backtesting study across all tweets and stocks. We find annualized returns of 6.4 percent and a Sharpe ratio of 2.2 after transaction costs. Finally, we illuminate the machine learning black box and unveil sources of profitability: First, results are both driven and limited by the temporal clustering of tweets, i.e., the majority of profits stem from tweets clustered closely together in time, corresponding to high-event situations. Second, the importance of included features follows an economic rationale, e.g., tweets with positive sentiment tend to yield positive returns and vice versa. Third, we find that stocks of medium market capitalization and from the consumer and technology sectors contribute most to our results, which we interpret as a trade-off between tweet coverage and tweet relevance.

Read full abstract

Financial Machine Learning Research Articles

Related Topics

Articles published on Financial Machine Learning

Reinforcement prompting for financial synthetic data generation

Portfolio construction using explainable reinforcement learning

Predicting Cryptocurrency Returns Using Classification and Regression Machine Learning Model

Study and Analysis of Deep Learning Techniques for Solving Financial Problems

Credit Risk Modeling with Graph Machine Learning

Option Volatility Investment Strategy: The Combination of Neural Network and Classical Volatility Prediction Model

Feature Scaling for Financial Machine Learning

Uncovering feature interdependencies in high-noise environments with stepwise lookahead decision forests

Explainable fintech lending

A CLOSED-FORM SOLUTION FOR OPTIMAL ORNSTEIN–UHLENBECK DRIVEN TRADING STRATEGIES

Can Machines 'Learn' Finance?

Valuation Ratios, Surprises, Uncertainty or Sentiment: How Does Financial Machine Learning Predict Returns From Earnings Announcements?

Separating the signal from the noise – Financial machine learning for Twitter

Information leakage in financial machine learning research

Advances in Financial Machine Learning: Numerai's Tournament (Presentation Slides)

Dissecting Momentum: We Need to Go Deeper

Financial Machine Learning Regulation

Ten Applications of Financial Machine Learning

Advances in Financial Machine Learning: Lecture 5/10

Advances in Financial Machine Learning: Lecture 4/10

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Financial Machine Learning Research Articles

Related Topics

Articles published on Financial Machine Learning

Reinforcement prompting for financial synthetic data generation

Portfolio construction using explainable reinforcement learning

Predicting Cryptocurrency Returns Using Classification and Regression Machine Learning Model

Study and Analysis of Deep Learning Techniques for Solving Financial Problems

Credit Risk Modeling with Graph Machine Learning

Option Volatility Investment Strategy: The Combination of Neural Network and Classical Volatility Prediction Model

Feature Scaling for Financial Machine Learning

Uncovering feature interdependencies in high-noise environments with stepwise lookahead decision forests

Explainable fintech lending

A CLOSED-FORM SOLUTION FOR OPTIMAL ORNSTEIN–UHLENBECK DRIVEN TRADING STRATEGIES

Can Machines 'Learn' Finance?

Valuation Ratios, Surprises, Uncertainty or Sentiment: How Does Financial Machine Learning Predict Returns From Earnings Announcements?

Separating the signal from the noise – Financial machine learning for Twitter

Information leakage in financial machine learning research

Advances in Financial Machine Learning: Numerai's Tournament (Presentation Slides)

Dissecting Momentum: We Need to Go Deeper

Financial Machine Learning Regulation

Ten Applications of Financial Machine Learning

Advances in Financial Machine Learning: Lecture 5/10

Advances in Financial Machine Learning: Lecture 4/10