Abstract

In practical engineering, electricity theft detection is usually performed on highly imbalanced datasets (i.e., the number of fraudulent samples is much smaller than the benign ones), which limits the accuracy of the classifier. To alleviate the data imbalance problem, this paper proposes simple data augmentation tricks (SDAT) to boost performance on electricity theft detection tasks. SDAT includes five simple but powerful operations: adding noises to electricity consumption readings, drifting values of electricity consumption readings, quantizing electricity consumption readings to a level set, adding a fixed value to electricity consumption readings, and adding changeable values to electricity consumption readings. In addition, eight potential tricks are also mentioned. Numerical simulations are conducted on a real-world dataset. The simulation results show that SDAT can significantly boost the performance of different classifiers, especially for small datasets. Besides, specific suggestions on how to select parameters of SDAT are provided for its migration use to other datasets.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call