In an era of increasing digitalization, the integration of diverse sensors has become commonplace. Within the transportation sector, state Departments of Transportation have extensively deployed traffic sensors to support various engineering endeavors. However, these sensors are prone to faults such as data loss, random noise, bias, and drift due to sensor aging, defects, or environmental influences. Therefore, detecting these faults is imperative to maintain data integrity. This paper presents a novel approach for detecting faults in traffic data based on a cluster-guided data reconstruction process. First, a traffic sensor clustering module is designed using a dual-encoding attention graph auto-encoder (DA-GAE) to identify clusters of traffic sensors. This module leverages a joint embedding of node and edge features in a low-dimensional vector space. Subsequently, utilizing the identified clusters, a cluster-guided denoising graph auto-encoder (CG-DGAE) is devised and trained for data reconstruction. The CG-DGAE employs a diffusion graph convolutional network (DGCN) and is trained with a cluster-wise sampling strategy. Extensive experiments were conducted using traffic data obtained from a real-world large sensor network, demonstrating superior performance of the CG-DGAE model in data reconstruction compared to various baseline methods. For fault detection, a score function is devised to discern potential faults by contrasting the sensor data sequence and the reconstructed data sequence. The fault detection results show that our proposed CG-DGAE model achieved an impressive overall accuracy of 99.09%, a precision of 99.13%, a recall of 99.53%, and a F1 score of 99.53%. The superior capability of CG-DGAE in data reconstruction and fault detection is attributable to the cluster-wise spatial–temporal context leveraged by the model.
Read full abstract