Abstract

With the emergence of Low-Cost Sensor (LCS) devices, measuring real-time data on a large scale has become a feasible alternative approach to more costly devices. Over the years, sensor technologies have evolved which has provided the opportunity to have diversity in LCS selection for the same task. However, this diversity in sensor types adds complexity to appropriate sensor selection for monitoring tasks. In addition, LCS devices are often associated with low confidence in terms of sensing accuracy because of the complexities in sensing principles and the interpretation of monitored data. From the data analytics point of view, data quality is a major concern as low-quality data more often leads to low confidence in the monitoring systems. Therefore, any applications on building monitoring systems using LCS devices need to focus on two main techniques: sensor selection and calibration to improve data quality. In this paper, data-driven techniques were presented for sensor calibration techniques. To validate our methodology and techniques, an air quality monitoring case study from the Bradford district, UK, as part of two European Union (EU) funded projects was used. For this case study, the candidate sensors were selected based on the literature and market availability. The candidate sensors were narrowed down into the selected sensors after analysing their consistency. To address data quality issues, four different calibration methods were compared to derive the best-suited calibration method for the LCS devices in our use case system. In the calibration, meteorological parameters temperature and humidity were used in addition to the observed readings. Moreover, we uniquely considered Absolute Humidity (AH) and Relative Humidity (RH) as part of the calibration process. To validate the result of experimentation, the Coefficient of Determination (R2), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE) were compared for both AH and RH. The experimental results showed that calibration with AH has better performance as compared with RH. The experimental results showed the selection and calibration techniques that can be used in designing similar LCS based monitoring systems.

Highlights

  • Multi-layer perceptron (MLP) is a forward-structured Artificial Neural Network that operates on sets of input vectors to give output with a set of output vectors

  • The Random Forest (RF) model is a machine learning technique based on a combination of classification or regression trees which was first introduced by Breiman in 2001

  • Low-Cost Sensor (LCS) give an alternative solution against the high-cost sensors used for various measurement and monitoring purposes as they are compact in size and low-cost

Read more

Summary

Introduction

There has not been a universal single type of LCS implementation as the LCSs have different working principles such as electrochemical, optical particle counters (OPC), non-dispersive infra-red (NDIR), metaloxide-semiconductor, or solid-state microsensors designed to monitor air pollutants [3,4,5] This diversity in working principles of sensors adds complexity to the process of LCS selection while building the monitoring systems using LCS-based devices. There have been challenges in terms of how to select, calibrate, and build LCS-based devices considering twin factors of sensors’ availability and data quality. A data-driven LCS calibration methodology was presented to address the challenges of LCS-based device building for an application.

Literature Review
Our Methodology
Selection
Narrowing down the Selection of Low-Cost Air Quality Monitoring Sensors
Multivariate Linear Regression
Statistical Approaches
Convolution Neural Network
Random Forest
Findings
Conclusions and Future Work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call