Face off: Travel Habits, Road Conditions and Traffic City Characteristics Bared Using Twitter

Amit Agarwal,Durga Toshniwal

doi:10.1109/access.2019.2917159

Abstract

The adequacy of traditional transport related issues detection is often limited by physical sparse sensor coverage and reporting incident/issues to the emergency response system is labor intensive. The social media tweet text have been mined so as to identify the complaints regarding various road transportation issues of traffic, accident, and potholes. In order to identify and segregate tweets related to different issues, keyword-based approaches have been used previously, but these methods are solely dependent on seed keywords which are manually given and these set of keywords are not sufficient to cover all tweets posts. So, to overcome this issue, a novel approach has been proposed that captures the semantic context through dense word embedding by employing word2vec model. However, the process of tweet segregation on the basis of semantic similar keywords may suffer from the problem of pragmatic ambiguity. To handle this, Word2Vec model has been applied to match the semantically similar tweets with respect to each category. Furthermore, the hotspots have been identified corresponding to each category. However, due to the scarcity of geo-tagged tweets, we have proposed a hybrid method which amalgamates Named Entity Recognition (NER), Part of speech (POS), and Regular Expression (RE) to extract the location information from the tweet textual content. Due to the lack of availability of the ground truth dataset, model feasibility has been validated from the existing data records (i.e., published by government official accounts and reported on news media) and the evaluation results signify that the stated approach identifies few additional hotspots as compared to the existing reports while analyzing the tweets.

Highlights

In India, four major tier-1 cities (Mumbai, Delhi, Kolkata, and Bengaluru) annually losses 22 billion dollar due to congestion
In this paper, we introduced a framework that identifies incidents caused by non-recurrent events from the social media platform
The proposed framework can be divided into five major components which include collecting data from multiple sources, data preprocessing, identification of similar semantic keywords corresponding to the different categories, removing the pragmatic ambiguity and content based location identification for finding the vulnerable areas

Summary

INTRODUCTION

In India, four major tier-1 cities (Mumbai, Delhi, Kolkata, and Bengaluru) annually losses 22 billion dollar due to congestion It mainly induced from non-recurrent events such as accident, adverse road conditions, construction on roads, potholes, adverse weather condition, and inadequate drainage. It might be due to the restriction imposed by Twitter over tweet post length, i.e. 140 character limits It makes text classification and information extraction a challenging problem. This paper presents a methodology to crawl, pre-process and filter freely available tweets These tweets post analyzed to extract non-recurrent events information by using deep learning and Natural Language processing (NLP) techniques. The main contribution of this work can be summarized as follows: 1) Semantic Similar keywords:We have proposed and applying an adaptive semi-supervised method for tweets, by leveraging dense word embedding to identify semantic similar keywords for non-recurrent event’s.

RELATED WORK

METHODOLOGY

EXPERIMENTS AND RESULTS

MODEL FEASIBILITY

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 20	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Face off: Travel Habits, Road Conditions and Traffic City Characteristics Bared Using Twitter

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Named entity recognition for extracting concept in ontology building on Indonesian language using end-to-end bidirectional long short term memory
Joan Santoso ... Mauridhi Hery Purnomo
Expert Systems with Applications | VOL. 176
Joan Santoso, et. al.Joan Santoso ... Mauridhi Hery Purnomo
13 Mar 2021
Expert Systems with Applications | VOL. 176

POS Tagging and NER System for Kannada Using Conditional Random Fields
Arpitha Swamy ... Srinath S
International Journal of Information Retrieval Research | VOL. 11
Arpitha Swamy, et. al.Arpitha Swamy ... Srinath S
01 Oct 2021
International Journal of Information Retrieval Research | VOL. 11

InaNLP: Indonesia natural language processing toolkit, case study: Complaint tweet classification
Ayu Purwarianti ... Irfan Afif
-
Ayu Purwarianti, et. al.Ayu Purwarianti ... Irfan Afif
01 Aug 2016
01 Aug 2016

End to End Parts of Speech Tagging and Named Entity Recognition in Bangla Language
Jillur Rahman Saurav ... Farida Chowdhury
-
Jillur Rahman Saurav, et. al.Jillur Rahman Saurav ... Farida Chowdhury
01 Sep 2019
01 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Face off: Travel Habits, Road Conditions and Traffic City Characteristics Bared Using Twitter

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access