Opinion polls play an important role in modern democratic processes: they are known to not only affect the outcomes of elections, but also have a significant influence on government policy after elections. Recent years have seen large discrepancies between polls and outcomes at several major elections and referendums, stemming from decreased participation in polls and an increasingly volatile electorate. This calls for new ways to measure public support for political parties. In this paper, we propose a method for measuring the popularity of election candidates on social media using Machine Learning-based Natural Language Processing techniques. The method is based on detecting voting intentions in the data. This is a considerable advance upon earlier work using automatic sentiment analysis. We evaluate the method both intrinsically on a set of hand-labelled social media posts, and extrinsically – by forecasting daily election polls. In the extrinsic evaluation, we analyze data from the 2016 US presidential election, and find that voting intentions measured from social media provide significant additional predictive value for forecasting daily polls. Thus, we demonstrate that the proposed method can be used to interpolate polls both spatially and temporally, thus providing reliable, continuous and fine-grained information about public opinion on current political issues.
Read full abstract