Early detection of heterogeneous disaster events using social media

Viktor Pekar,Chris Hale,Vincent Schmidt,Jane Binner,Hossein Najafi

doi:10.1002/asi.24208

Abstract

This article addresses the problem of detecting crisis‐related messages on social media, in order to improve the situational awareness of emergency services. Previous work focused on developing machine‐learning classifiers restricted to specific disasters, such as storms or wildfires. We investigate for the first time methods to detect such messages where the type of the crisis is not known in advance, that is, the data are highly heterogeneous. Data heterogeneity causes significant difficulties for learning algorithms to generalize and accurately label incoming data. Our main contributions are as follows. First, we evaluate the extent of this problem in the context of disaster management, finding that the performance of traditional learners drops by up to 40% when trained and tested on heterogeneous data vis‐á‐vis homogeneous data. Then, in order to overcome data heterogeneity, we propose a new ensemble learning method, and found this to perform on a par with the Gradient Boosting and AdaBoost ensemble learners. The methods are studied on a benchmark data set comprising 26 disaster events and four classification problems: detection of relevant messages, informative messages, eyewitness reports, and topical classification of messages. Finally, in a case study, we evaluate the proposed methods on a real‐world data set to assess its practical value.

Highlights

The precision rates obtained in this experiment are lower, but generally consistent with those obtained in experiments with the CrisisLex dataset, where the Maximum Entropy (MaxEnt) classifier reached 86.2% for Informativeness task, and the Gradient Boosting classifier achieved 41.8% for Eyewitnesses
To understand reasons for errors made by the classifiers, we looked at cases where both judges believed the classifiers assigned the wrong label and identified common error types: (1) news reports on accidents that are irrelevant to any rescue operations, (2) errors due to ambiguous words, (3) disaster events that took place far in the past, (4) fictional events, (5) general chatter
Our experiments show that this method clearly outperforms base classifiers and performs on par with several other popular ensemble classifiers (AdaBoost, Random Forests, Gradient Boosting)

Summary

Objectives

The problem of detecting disaster-related messages independently of the disaster type that we aim to solve is an example of such a situation: messages relating to different types of disasters tend to

Methods

Results

Discussion

Conclusion