Evaluating and improving the reliability of gas-phase sensor system calibrations across new locations for ambient measurements and personal exposure monitoring

Sharad Vikram,Tajana Rosing,Michael Hannigan,Camron Chermak,Michael H Ostertag,Massimiliano Menarini,Ashley Collier-Oxandale,William G Griswold,Sanjoy Dasgupta

doi:10.5194/amt-12-4211-2019

Sharad Vikram, Tajana Rosing + Show 7 more

Open Access

https://doi.org/10.5194/amt-12-4211-2019

Copy DOI

Abstract

Abstract. Advances in ambient environmental monitoring technologies are enabling concerned communities and citizens to collect data to better understand their local environment and potential exposures. These mobile, low-cost tools make it possible to collect data with increased temporal and spatial resolution, providing data on a large scale with unprecedented levels of detail. This type of data has the potential to empower people to make personal decisions about their exposure and support the development of local strategies for reducing pollution and improving health outcomes. However, calibration of these low-cost instruments has been a challenge. Often, a sensor package is calibrated via field calibration. This involves colocating the sensor package with a high-quality reference instrument for an extended period and then applying machine learning or other model fitting technique such as multiple linear regression to develop a calibration model for converting raw sensor signals to pollutant concentrations. Although this method helps to correct for the effects of ambient conditions (e.g., temperature) and cross sensitivities with nontarget pollutants, there is a growing body of evidence that calibration models can overfit to a given location or set of environmental conditions on account of the incidental correlation between pollutant levels and environmental conditions, including diurnal cycles. As a result, a sensor package trained at a field site may provide less reliable data when moved, or transferred, to a different location. This is a potential concern for applications seeking to perform monitoring away from regulatory monitoring sites, such as personal mobile monitoring or high-resolution monitoring of a neighborhood. We performed experiments confirming that transferability is indeed a problem and show that it can be improved by collecting data from multiple regulatory sites and building a calibration model that leverages data from a more diverse data set. We deployed three sensor packages to each of three sites with reference monitors (nine packages total) and then rotated the sensor packages through the sites over time. Two sites were in San Diego, CA, with a third outside of Bakersfield, CA, offering varying environmental conditions, general air quality composition, and pollutant concentrations. When compared to prior single-site calibration, the multisite approach exhibits better model transferability for a range of modeling approaches. Our experiments also reveal that random forest is especially prone to overfitting and confirm prior results that transfer is a significant source of both bias and standard error. Linear regression, on the other hand, although it exhibits relatively high error, does not degrade much in transfer. Bias dominated in our experiments, suggesting that transferability might be easily increased by detecting and correcting for bias. Also, given that many monitoring applications involve the deployment of many sensor packages based on the same sensing technology, there is an opportunity to leverage the availability of multiple sensors at multiple sites during calibration to lower the cost of training and better tolerate transfer. We contribute a new neural network architecture model termed split-NN that splits the model into two stages, in which the first stage corrects for sensor-to-sensor variation and the second stage uses the combined data of all the sensors to build a model for a single sensor package. The split-NN modeling approach outperforms multiple linear regression, traditional two- and four-layer neural networks, and random forest models. Depending on the training configuration, compared to random forest the split-NN method reduced error 0 %–11 % for NO2 and 6 %–13 % for O3.

Highlights

As the use of low-cost sensor systems for citizen science and community-based research expands, improving the robustness of calibration for low-cost sensors will support these efforts by ensuring more reliable data and enabling a more effective use of the often-limited resources of these groups
Based on the hypothesis that the increased errors under transfer are due to overfitting, we propose that training a calibration model on multiple sites will improve transfer
We investigated the performance of three calibration models: multiple linear regression, neural networks, and random forest

Summary

Introduction

As the use of low-cost sensor systems for citizen science and community-based research expands, improving the robustness of calibration for low-cost sensors will support these efforts by ensuring more reliable data and enabling a more effective use of the often-limited resources of these groups These next-generation technologies have the potential to reduce the cost of air quality monitoring instruments by orders of magnitude, enabling the collection of data at higher spatial and temporal resolution, providing new options for both personal exposure monitoring and communities concerned about their air quality (Snyder et al, 2013). One group using low-cost sensors to provide more detailed and locally specific air quality information is the Imperial County Community Air Monitoring Network (English et al, 2017) The goal of this network of particulate monitors is to help inform local action (e.g., keeping kids with asthma inside) or open the door to conversations with regulators (English et al, 2017). Researchers are investigating the potential for wearable monitors to improve personal exposure estimates (Jerrett et al, 2017)

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Atmospheric Measurement Techniques	Publication Date: Aug 6, 2019
Citations: 20	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Evaluating and improving the reliability of gas-phase sensor system calibrations across new locations for ambient measurements and personal exposure monitoring

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Atmospheric Measurement Techniques

Lead the way for us

Similar Papers

A machine learning calibration model using random forests to improve sensor performance for lower-cost air quality monitoring
Naomi Zimmerman ... Albert A Presto
Atmospheric Measurement Techniques | VOL. 11
Naomi Zimmerman, et. al.Naomi Zimmerman ... Albert A Presto
15 Jan 2018
Atmospheric Measurement Techniques | VOL. 11

Application of Machine Learning and Process-Based Models for Rainfall-Runoff Simulation in DuPage River Basin, Illinois
Amrit Bhusal ... Ajay Kalra
Hydrology | VOL. 9
Amrit Bhusal, et. al.Amrit Bhusal ... Ajay Kalra
27 Jun 2022
Hydrology | VOL. 9

Seeing the Forest for the Trees: Random Forest Models for Predicting Survival in Kidney Transplant Recipients.
Ruth Sapir-Pichhadze ... Bruce Kaplan
Transplantation | VOL. 104
Ruth Sapir-Pichhadze, et. al.Ruth Sapir-Pichhadze ... Bruce Kaplan
01 May 2020
Transplantation | VOL. 104

Prediksi Daya Listrik Pada Pembangkit Listrik Siklus Gabungan Berdasarkan Kondisi Lingkungan Menggunakan Metode Machine Learning
Hendra Hendra
Elektron : Jurnal Ilmiah | VOL. -
Hendra HendraHendra Hendra
28 Dec 2023
Elektron : Jurnal Ilmiah | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating and improving the reliability of gas-phase sensor system calibrations across new locations for ambient measurements and personal exposure monitoring

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Atmospheric Measurement Techniques