Sensor readings play a critical role in the prediction performance of AI-based data-driven fault-detection and diagnostics for borehole heat exchangers in heating and cooling systems. The accuracy and algorithm selection of individual deep neural networks are well-investigated research topics but the impact of quality of sensors data on algorithm selection and its further impact on prediction performance has not been investigated and quantified for geothermal heating/cooling systems. The objective of the current work is to investigate the impact of data quality, available as measured time-series data from the borehole field, on the prediction accuracy of the deep neural networks-based models. In the current work four different data sets (D-1, D-2, D-3, D-4) are used for data-driven modeling using four different deep machine learning approaches, i.e., LSTM, BD-LSTM, GRU and CNN. The developed AI models have been trained, validated, and tested using the real-field data sets with same number of independent input parameters i.e., inlet temperature and mass flow rate to predict the fluid outlet temperature. The validated AI models are used to predict the performance of borehole heat exchanger to meet the heating and cooling loads. The prediction results show that the BD-LSTM outperforms other deep neural networks in terms of accuracy. With the data sets D-1, D-2, D-3 and D-4 it was observed that BD-LSTM had lowest MAPE of 2.99 %, 0.22 %, 6.39 % and 2.21 % respectively. Several experiments show that the BD-LSTM and LSTM models are better than the other used approaches in terms of prediction accuracy, but larger number of trainable parameters results in more computation time than CNN and GRU respectively. More data points with higher sampling frequency, results in more accurate prediction as depicted by the results from D-2 and D-4. Although, the results showed that the data collection interval is crucial, but different test runs indicate that avoiding the loss of time-series data during the cleaning process is equally important to accurately capture the Seasonality and Trend of the heating/cooling data. This is realized in case of D-1 where almost 58 % of the values were removed as outliers and these missing values were needed to be approximated which also affected the model prediction accuracy. The presented analysis highlights that the LSTM based DNN model has the capability to predict borehole heat exchanger performance accurately and the predictions are highly dependent on data frequency, the quality as well as data quantity from geothermal installations used to train the model.
Read full abstract