Abstract

Anomaly detection for large scale cellular networks can be used by network operators to optimize network performance and enhance mobile user experience. This paper aims at detecting user anomalies from spatio-temporal cell phone activity data. We design an approach combining time series analysis and machine learning to extract the traffic patterns of areal units. This approach can cluster areal units with similar traffic patterns and segment a city into distinct groups. Then, in grouped-areas, we use a clustering technique to detect anomalous behaviors of the cellular network and verify the accuracy of the results using ground truth information collected from online sources. The results indicate that anomalies are associated with abruptly high or unexpected traffic demand at a specific location and time. In addition, we obtain anomaly-free data by removing anomalous data and train a decomposed traffic prediction model. It is observed that the prediction model trained with anomaly-free data can achieve lower normalized mean square error (NMSE), i.e., higher prediction accuracy, than the model trained with anomalous data.

Highlights

  • In the past decade, billions of mobile devices have led to a dramatic growth of global mobile data that are generated on a large scale, with an expected size of 77 exabytes (1018) per month by 2022 [1]

  • By using the data with removed anomalies, we can improve the accuracy of traffic prediction

  • EXPERIMENTAL RESULTS AND DISCUSSIONS the performance of the proposed approach is evaluated by using the Call Detail Records (CDRs) dataset from real network Telecom Italia

Read more

Summary

Introduction

Billions of mobile devices have led to a dramatic growth of global mobile data that are generated on a large scale, with an expected size of 77 exabytes (1018) per month by 2022 [1]. The large-scale mobile data traffic provides us with an opportunity to better understand user specific activity and mobile traffic demand [2]. A significant growth in traffic demand in a particular hotspot area is detrimental to user experience and may eventually give rise to a service outage. Detecting user anomalies is valuable for both network operators and users. It allows network operators to possess more information corresponding to regions of interest in the network to optimize mobile network resource allocation, balance the traffic load, and propose intelligent fault diagnostic solutions.

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call