Detection of housing and utility problems in districts through social media texts

Alexandr Zamiralov,Maria Khodorchenko,Denis Nasonov

doi:10.1016/j.procs.2020.11.023

Alexandr Zamiralov, Maria Khodorchenko + Show 1 more

Open Access

https://doi.org/10.1016/j.procs.2020.11.023

Copy DOI

Journal: Procedia Computer Science	Publication Date: Jan 1, 2020
Citations: 3	License type: cc-by-nc-nd

Affiliation: ITMO University

Abstract

Abstract Social media stores a significant amount of information which can be used for extraction of specific knowledge. A variety of topics that arise there concerns a lot of everyday life aspects, including urban-related problems. In this work, we demonstrate the way of using the texts from social media on the topic of housing and utility problems, such as litter on the streets, graffiti on a public building or noisy neighbours. Our aim is to develop an approach based on machine learning to automatically filter such citizen messages and classify them into several categories. To achieve this, we solve the classification problem with an almost unlimited number of negative categories using the One-Class approach and combine data from several resources to construct proper text embedding by combining results from the guided topic model and deep neural pretrained BERT method. Comparison with statistics taken from the official site indicates that the distributions of posts on each problem category are similar for districts of Saint-Petersburg

Full Text