Machine learning models accurately predict ozone exposure during wildfire events

Gregory L Watson,Donatello Telesca,Colleen E Reid,Gabriele G Pfister,Michael Jerrett

doi:10.1016/j.envpol.2019.06.088

Abstract

Epidemiologists use prediction models to downscale (i.e., interpolate) air pollution exposure where monitoring data is insufficient. This study compares machine learning prediction models for ground-level ozone during wildfires, evaluating the predictive accuracy of ten algorithms on the daily 8-hour maximum average ozone during a 2008 wildfire event in northern California. Models were evaluated using a leave-one-location-out cross-validation (LOLO CV) procedure to account for the spatial and temporal dependence of the data and produce more realistic estimates of prediction error. LOLO CV avoids both the well-known overly optimistic bias of k-fold cross-validation on dependent data and the conservative bias of evaluating prediction error over a coarser spatial resolution via leave-k-locations-out CV. Gradient boosting was the most accurate of the ten machine learning algorithms with the lowest LOLO CV estimated root mean square error (0.228) and the highest LOLO CV Rˆ2 (0.677). Random forest was the second best performing algorithm with an LOLO CV Rˆ2 of 0.661. The LOLO CV estimates of predictive accuracy were less optimistic than 10-fold CV estimates for all ten models. The difference in estimated accuracy between the 10-fold CV and LOLO CV was greater for more flexible models like gradient boosting and random forest. The order of estimated model accuracy depended on the choice of evaluation metric, indicating that 10-fold CV and LOLO CV may select different models or sets of covariates as optimal, which calls into question the reliability of 10-fold CV for model (or variable) selection. These prediction models are designed for interpolating ozone exposure, and are not suited to inferring the effect of wildfires on ozone or extrapolating to predict ozone in other spatial or temporal domains. This is demonstrated by the inability of the best performing models to accurately predict ozone during 2007 southern California wildfires.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine learning models accurately predict ozone exposure during wildfire events

Abstract

Talk to us

Similar Papers

More From: Environmental Pollution

Lead the way for us

Journal: Environmental Pollution	Publication Date: Jul 5, 2019
Citations: 79

Similar Papers

American Academy of Sports Physical Therapy Platform Presentation Abstracts SPL1–SPL82
-
Journal of Orthopaedic & Sports Physical Therapy | VOL. 51
--
01 Jan 2020
Journal of Orthopaedic & Sports Physical Therapy | VOL. 51

Paper 42: A Novel Machine Learning (ML) Algorithm to Predict Outcomes after Revision ACLR (rACLR) in the Multicenter Anterior Cruciate Ligament Reconstruction Study (MARS) Cohort
Kinjal Vasavada
Orthopaedic Journal of Sports Medicine | VOL. 11
Kinjal VasavadaKinjal Vasavada
01 Jul 2023
Orthopaedic Journal of Sports Medicine | VOL. 11

Evaluating the performance of machine learning methods and variable selection methods for predicting difficult-to-measure traits in Holstein dairy cattle using milk infrared spectral data
Lucio F.M Mota ... Alessio Cecchinato
Journal of Dairy Science | VOL. 104
Lucio F.M Mota, et. al.Lucio F.M Mota ... Alessio Cecchinato
15 Apr 2021
Journal of Dairy Science | VOL. 104

Machine learning approaches for formation matrix volume prediction from well logs: Insights and lessons learned
Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
Geoenergy Science and Engineering | VOL. 229
Pamidi Venkata Durga Kannaiah, et. al.Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
08 Jul 2023
Geoenergy Science and Engineering | VOL. 229

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine learning models accurately predict ozone exposure during wildfire events

Abstract

Talk to us

Similar Papers

More From: Environmental Pollution