Abstract
There is tremendous value in the ability to predict stock market trends and outcomes. The public sentiment surrounding a stock is unquestionably a vital factor contributing to the rise or fall of a stock price. This paper aims to detail how data from public sentiment can be integrated into traditional stock analyses and how these analyses can then be used to make predictions of stock price trends. Headlines from seven news publications and conversations from Yahoo! Finance's conversations forum were processed by the Valence Aware Dictionary and sEntiment Reasoner (VADER) natural language processing package to determine numerical polarities which represent a positive, negative, or neutral public sentiment around a stock ticker. The resulting polarities were paired with popular stock-table metrics (PEG Ratio, Forward EPS, etc.) to create a dataset for a Logistic Regression machine learning model. The model was trained on approximately 4400 major stocks to determine a binary “Buy” (1) or “Not Buy” (0) recommendation for each stock. The model achieved an F1 accuracy of 82.5% and for most major stocks, the model's recommendations were aligned with the stock analysts' ratings from the NASDAQ website. The logistic regression model would improve from leveraging a historical compass of data, given the hive-mind behavior that online discussion forums exhibit.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.