Abstract

Knowledge on water-use patterns in residential settings can help policymakers formulate well-targeted water conservation measures and evaluate the efficacy of such measures. Prior studies have applied machine learning techniques to disaggregate household-level water consumption data, collected by smart meters, into specific end-use categories, such as showering, basin use, kitchen use, and use by washing machine. However, the analysis of the effect of the sampling interval on the classification accuracy level remains an underinvestigated issue. This article seeks to fill this knowledge gap by identifying an optimal sampling interval that can achieve a high level of classification accuracy while overcoming constraints of on-device data storage, data transmission, and energy consumption. To understand the benefits of collecting fine-grained data for machine learning, we have built a high-resolution tap-based Internet of Things (IoT) system comprising a set of Wi-Fi-based tap sensors, gateway infrastructure, and a secure data processing pipeline. Based on empirical tap-based data collected over an eight-month period, we concluded that when the sampling interval decreases slightly from 5 to 1 s, the accuracy level of the end-use classification model increases significantly from 66.6% to 76.1 %. This article also highlights the challenges of deploying IoT sensors to collect water consumption data in a domestic setting. In order to collect sufficient ground-truth data for the training and verification of a generalizable water end-use disaggregation model, it is necessary to sophisticate the flow data collection system by adopting low-power wide-area network technologies and reducing the level of energy consumption of the flow sensing components.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call