Utilizing domain knowledge: Robust machine learning for building energy performance prediction with small, inconsistent datasets

Xia Chen,Manav Mahan Singh,Philipp Geyer

doi:10.1016/j.knosys.2024.111774

Abstract

Machine learning (ML) applications often require large datasets, a requirement that can pose a major challenge in fields where data is sparse or inconsistent. To address this issue, we propose a novel approach that combines prior knowledge with data-driven methods to significantly reduce data dependency. This study represents a disentangled system compositionality knowledge by the method of Component-Based Machine Learning (CBML) in the context of energy-efficient building engineering. In this way, CBML incorporates semantic domain knowledge within the structure of a data-driven model. To understand the advantage of CBML, we conducted a case experiment to assess the effectiveness of this knowledge-encoded ML approach in scenarios with sparse data input (1 % - 0.0125 % sampling rate) and several typical ML methods. Our findings reveal three key advantages of this approach over traditional ML methods: 1) It significantly improves the robustness of ML models when dealing with extremely small and inconsistent datasets; 2) It allows for efficient utilization of data from diverse record collections; 3) It can handle incomplete data while maintaining high interpretability and reducing training time. These features offer a promising solution to the challenges associated with deploying data-intensive methods and contribute to more efficient real-world data usage. Additionally, we outline four essential prerequisites to ensure the successful integration of prior knowledge and ML generalization in target scenarios and open-sourced the code and dataset for community reproduction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Knowledge-Based Systems	Publication Date: Apr 5, 2024
Citations: 2	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Utilizing domain knowledge: Robust machine learning for building energy performance prediction with small, inconsistent datasets

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Similar Papers

Towards out of distribution generalization for problems in mechanics
Lingxiao Yuan ... Emma Lejeune
Computer Methods in Applied Mechanics and Engineering | VOL. 400
Lingxiao Yuan, et. al.Lingxiao Yuan ... Emma Lejeune
14 Sep 2022
Computer Methods in Applied Mechanics and Engineering | VOL. 400

Disease Prediction Using Graph Machine Learning Based on Electronic Health Data: A Review of Approaches and Trends
Haohui Lu ... Shahadat Uddin
Healthcare | VOL. 11
Haohui Lu, et. al.Haohui Lu ... Shahadat Uddin
04 Apr 2023
Healthcare | VOL. 11

An intercomparison of weather normalization of PM2.5 concentration using traditional statistical methods, machine learning, and chemistry transport models
Huang Zheng ... Roy M Harrison
npj Climate and Atmospheric Science | VOL. 6
Huang Zheng, et. al.Huang Zheng ... Roy M Harrison
20 Dec 2023
npj Climate and Atmospheric Science | VOL. 6

Comparison of Deep Learning and Traditional Machine Learning Classification Performance in a SSVEP Based Brain Computer Interface
Zafer İşcan
Balkan Journal of Electrical and Computer Engineering | VOL. 10
Zafer İşcanZafer İşcan
30 Jul 2022
Balkan Journal of Electrical and Computer Engineering | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Utilizing domain knowledge: Robust machine learning for building energy performance prediction with small, inconsistent datasets

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems