The unfolding usage of mobile users inclusive of both 4G and 5G creates huge accumulation of data in cellular network. The network service providers need to ensure proper management of resource in terms of uninterrupted service with cost-effectiveness. The detection of cellular traffic and Short Message Service (SMS) spammers is very challenging. In this paper a novel method is proposed to analyse and detect the traffic anomalies and SMS spammers. To achieve this, Call Detail Record (CDR) issued by service provider is used. The CDR is pre-processed to convert into machine understandable format using mean-normalization technique. K-means clustering elbow method proves to be the best tool in identifying the traffic clusters in the network that detects both high and low traffic in the network. The novelty of the proposed work is the detection of low traffic clusters which usually is misled as sleeping cell or cell outage. The paper also presents a model designed to predict whether the message is spam SMS or ham SMS. The proposed model is suitable to run different classifiers like Logistic Regression, Multi nominal Naive Bayes, Support Vector Machine (SVM), Random Forest Classifier. The model gives the highest accuracy rate of 98.277% with SVM in detecting SMS spam.
Read full abstract