Abstract

The huge volume and velocity of media content published on the Web presents a substantial challenge to human analysts. In prior work, we developed a system (network event detection, NED) to assist analysts by detecting events within high-volume news streams in real time. NED can process a heterogeneous stream of news articles or social media user posts, combining text mining and network analysis to detect breaking news stories and generate an easy-to-understand event summary. In this paper, we expand the NED event detection and summarisation approach in two ways. First, we introduce a new approach to named entity disambiguation for tweets, which contain minimal information due to brevity. Second, we apply sentiment analysis techniques to documents associated with a detected event to characterise the event as either broadly ‘positive’ or ‘negative’ based on media portrayal. Our expansion focuses on Twitter streams since Twitter has become an important news dissemination platform and is often the site where emerging events are first seen. To test the extended methodology, we apply it here to three data sets related to political elections in the UK and the USA. The addition of sentiment analysis to the NED event detection methodology improves the insight gained by the user by allowing quick evaluation of the perceived impact of an event. This approach may have potential applications in domains where public sentiment is relevant to decision-making around events, such as financial markets and politics.

Highlights

  • The growth and spread of the World Wide Web has radically changed the way news content is published and disseminated to the public

  • This creates the need for automated systems that can assist human analysts by processing large volumes of content from heterogeneous news streams in real time, detect and track breaking news events, and provide informative summarisations of complex media data sets

  • We demonstrate the expanded network event detection (NED) method using three Twitter data sets from three different political elections: (i) the US Presidential election of 2012, using data from Aiello et al (2013), in which President Barack Obama was elected; (ii) the US Presidential election of 2016, using data from Littman et al (2016), in which President Donald Trump was elected; and (iii) the UK General Elections of 2019, won by the Conservative Party led by Prime Minister Boris Johnson

Read more

Summary

Introduction

The growth and spread of the World Wide Web has radically changed the way news content is published and disseminated to the public. Traditional news platforms have moved online, while user-generated content such as blogs and social media have expanded the variety and availability of media. P. Williams have contributed to this work. In prior work Moutidis and Williams (2019), we introduced a method labelled the network event detection (NED) system (Fig. 1). NED is able to process heterogeneous streams of textual news documents, such as news articles, social media posts (Twitter, Reddit) and blog posts, identify important named entities (People, Organisations) within the evolving news stream, detect breaking news stories and generate an informative summary of each news event.

64 Page 2 of 12
Topic detection and tracking
Sentiment analysis for Twitter data
64 Page 4 of 12
Knowledge graph creation
Graph time series analysis
Summarising the detected events
64 Page 6 of 12
Extensions of the NED system
Extension 1: entity disambiguation
Extension 2: sentiment analysis
64 Page 8 of 12
Evaluation data sets
Results
64 Page 10 of 12
Discussion
Compliance with ethical standards
64 Page 12 of 12
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call