Abstract

Energy consumption is increasing exponentially with the increase in electronic gadgets. Losses occur during generation, transmission, and distribution. The energy demand leads to increase in electricity theft (ET) in distribution side. Data analysis is the process of assessing the data using different analytical and statistical tools to extract useful information. Fluctuation in energy consumption patterns indicates electricity theft. Utilities bear losses of millions of dollar every year. Hardware-based solutions are considered to be the best; however, the deployment cost of these solutions is high. Software-based solutions are data-driven and cost-effective. We need big data for analysis and artificial intelligence and machine learning techniques. Several solutions have been proposed in existing studies; however, low detection performance and high false positive rate are the major issues. In this paper, we first time employ bidirectional Gated Recurrent Unit for ET detection for classification using real time-series data. We also propose a new scheme, which is a combination of oversampling technique Synthetic Minority Oversampling TEchnique (SMOTE) and undersampling technique Tomek Link: “Smote Over Sampling Tomik Link (SOSTLink) sampling technique”. The Kernel Principal Component Analysis is used for feature extraction. In order to evaluate the proposed model’s performance, five performance metrics are used, including precision, recall, F1-score, Root Mean Square Error (RMSE), and receiver operating characteristic curve. Experiments show that our proposed model outperforms the state-of-the-art techniques: logistic regression, decision tree, random forest, support vector machine, convolutional neural network, long short-term memory, hybrid of multilayer perceptron and convolutional neural network.

Highlights

  • In the modern world, electricity utilization is increasing day by day

  • Many machine learning techniques have been applied for Electricity Theft Detection (ETD) including the Long Short-Term Memory (LSTM) method proposed in [24]

  • Buzau et al [36] have proposed a hybrid model which consists of LSTM and MLP to secure smart grid from electricity theft

Read more

Summary

Introduction

Electricity utilization is increasing day by day. It is broadly categorized into six main areas. Many authors proposed different approaches to solve these issues, which are broadly classified into three main categories: Artificial Intelligence (AI) and Machine Learning (ML)-based, State-based, and game theory-based systems. In game theory-based approaches, there is a game between utility and electricity theft [15] These approaches have high False Positive Rate (FPR) and low detection rate. The main focus of machine learning and artificial intelligence based systems is to analyze the patterns of real time series data. These systems extract useful information from a dataset by analyzing electricity consumption patterns [16]. Receiver Operating Characteristic (ROC) curve, F1-score, precision, and recall

Literature Review
Limitations
Problem Statement
Proposed Model
Data Preprocessing
Handling Imbalance Data
Feature Extraction using KPCA
Bidirectional Gated Recurrent Unit for Classification
Study of Hyperparameters Used for Experiments
Experimental Results
Performance Metrics
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.