Probabilistic Forecasting Based Joint Detection and Imputation of Clustered Bad Data in Residential Electricity Loads

Soyeong Park,Seungwook Yoon,Euiseok Hwang,Seokkap Ko,Byungtak Lee

doi:10.3390/en14010165

Abstract

Residential electricity load data can include numerous types of bad data, even clustered bad data, as they that are typically captured by simple measurement instruments. For example, in the case of a time-series of Not-a-Number (NaN) errors, the values before or next to a NaN may appear as the sum of actual values during the times of the NaN series. To utilize load data that includes such erroneous data for prediction or data mining analysis, customized detection and imputation should be conducted. This study proposes a new joint detection and imputation method for handling clustered bad data in residential electricity loads. Examples of these data are known invalid data points, such as consecutive NaN or zero values followed by or being ahead of an outlier. The proposed joint detection and imputation scheme first investigates the neighbors of the invalid data points, using probabilistic forecasting techniques. These techniques are implemented by the next valid neighbors to determine whether there is an anomaly or not. Then, adaptive imputations are applied on the basis of the detection, the candidate point should be imputed simultaneously or not. To assess the potential of the newly proposed scheme to characterize the clustered bad data, we analyzed the electricity loads of 354 households. Moreover, joint detection and imputations are conducted to test with the randomly injected synthesized clustered bad data (containing NaNs of various lengths) that is followed by the summation of the actual NaN values. The proposed scheme succeeded in detecting clustered bad data with an accuracy of 95.5% and a false alarm rate of 3.6% for all households in the dataset. Outlier detection-assisted imputation schemes are evaluated for NaNs with optional outliers. Results demonstrate that these schemes improve the overall accuracy significantly compared to schemes without outlier detection.

Highlights

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.With the growing concerns on energy and environmental sustainability, a huge research effort has been made to achieve a smart and efficient energy management for decreasing the carbon footprint [1,2]
true negative (TN) and false positive (FP) were improved in the proposed method; TN was increased by 6.8%, and FP was decreased by 63.0%
We implemented two cases to compare the performance of the accumulated outlier detection aware imputation (AOD-AI)—one without AOD-AI and the other one with AOD-AI

Summary

Introduction

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. Real residential energy consumption data has the tendency to include bad data, such as Not-a-Number (NaN) or zero points that are found scattered in several cases, even in the shape of clusters; and anomalies whose value is the sum of the actual values during the clustered bad data points. These outliers can significantly affect the performance of data-driven methods when handling clustered bad data. The imputation range should be changed to include the previous point with clustered bad data when the detection result suspects that the previous point is the outlier

Methodologies

Probabilistic Forecasting Based Anomaly Detection

Forward-Backward Joint Imputation

Numerical Evaluation

Data Analysis

Detection Results for Residential Data

Imputation Results of the Residential Data

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Energies	Publication Date: Dec 30, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Probabilistic Forecasting Based Joint Detection and Imputation of Clustered Bad Data in Residential Electricity Loads

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Energies

Lead the way for us

Similar Papers

Joint Detection and Estimation for Cooperative Communications in Cluster-Based Networks
T.-Y Wang ... C.-P Li
-
T.-Y Wang, et. al.T.-Y Wang ... C.-P Li
01 Jun 2009
01 Jun 2009

Joint Multitarget Detection and Tracking in Multipath Environment Using Expectation Maximization Algorithm
Zhihua Li ... Mengdao Xing
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14
Zhihua Li, et. al.Zhihua Li ... Mengdao Xing
01 Jan 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14

Joint maximum likelihood detection and power allocation in cooperative MIMO relay systems
Thomas Hesketh ... Patrick Clarke
-
Thomas Hesketh, et. al.Thomas Hesketh ... Patrick Clarke
01 Mar 2012
01 Mar 2012

Comparative study of joint-detection based on FFTs and the 2-rake receiver of the TD-SCDMA up-link
Liu Weixin ... Zhang Zhongpei
-
Liu Weixin, et. al. Liu Weixin ... Zhang Zhongpei
26 Sep 2004
26 Sep 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Probabilistic Forecasting Based Joint Detection and Imputation of Clustered Bad Data in Residential Electricity Loads

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Energies