The topic of this paper is fault detection for district heating substations, which is an important enabler for the transition towards fourth-generation district heating systems. Classical fault detection approaches are often based on anomaly detection, commonly making the implicit assumption that the errors between the measurements and the predictions made by the baseline model are i.i.d. and following an underlying Gaussian distribution. Our analysis shows that this does not hold up in the field, showing clear seasonality in the error over time. We propose to replace the Gaussian error model by a quantile regression model in order to provide a more nuanced fault threshold, conditioned on time and other input variables. Additionally, we observed that properly training the baseline model comes with its own challenges due to this time dependency, which we propose to resolve by employing an ensemble of models, trained on different periods of time. We demonstrate our method on unlabelled operational data obtained from a Swedish district heating operator to illustrate its use in the field. In addition, we validate it on labelled data from our residential lab setup, testing a variety of common faults.
Read full abstract