Abstract

The project constructs a stock selection model by machine learning methods to enhance the performance of the benchmark index for individual investors. Stock returns prediction is a highly researched topic. However, it is a difficult problem because the stock prices are complex, non-linear, and chaotic. Moreover, overfitting is always an important issue in machine learning field. In this article, it shows that how to solve these problems by dealing with time series data, feature engineering, and model construction. We apply the stock selection model on S&P 500 index and FTSE 100 index. The result shows that the portfolios with stock selection model outperform the benchmarks, and 2% of the number of constitution stocks is the best choice for the stock selection model. Besides, feature importance analysis shows that the stock selection model can measure import features appropriately, which means it has the ability to adapt to different economic environments. In addition, the portfolios with fewer stocks usually outperform the portfolios with more stocks shows the good prediction of the stock selection model. The results imply that machine learning techniques have a good application in stock markets.

Highlights

  • Passive investment is popular and popular in recent years, since it can gain the reasonable returns from the markets

  • Most portfolios with stock selection model preformed much better than the benchmarks

  • Portfolios with fewer stocks usually performed better than portfolios with more stocks, which implied the accuracy of the prediction of stock selection model

Read more

Summary

Introduction

Passive investment is popular and popular in recent years, since it can gain the reasonable returns from the markets. William SCHWERT, 1983, showed the empirical results of Size factor [12] Momentum factors contains both price momentum [2] and trading volume [8, 9], some researches generate other method to measure the momentum [6]. Some investors care about the dividends [2] Technical index such as RSI and MACD are relative to the stock returns [14]. It is difficult to rely on the same factors to get excess return all the time In this project, random forest which is based on decision tree model is the main method of the stock selecting model. The target variable with discrete set variables is called classification tree, and the target variable with continuous set variables is called regression tree In this project, we use regression tree to predict the returns of stocks

Objectives
Methods
Findings
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.