Abstract

Abstract. A growing body of research on statistical applications for characterization of atmospheric aerosol Fourier transform infrared (FT-IR) samples collected on polytetrafluoroethylene (PTFE) filters (e.g., Russell et al., 2011; Ruthenburg et al., 2014) and a rising interest in analyzing FT-IR samples collected by air quality monitoring networks call for an automated PTFE baseline correction solution. The existing polynomial technique (Takahama et al., 2013) is not scalable to a project with a large number of aerosol samples because it contains many parameters and requires expert intervention. Therefore, the question of how to develop an automated method for baseline correcting hundreds to thousands of ambient aerosol spectra given the variability in both environmental mixture composition and PTFE baselines remains. This study approaches the question by detailing the statistical protocol, which allows for the precise definition of analyte and background subregions, applies nonparametric smoothing splines to reproduce sample-specific PTFE variations, and integrates performance metrics from atmospheric aerosol and blank samples alike in the smoothing parameter selection. Referencing 794 atmospheric aerosol samples from seven Interagency Monitoring of PROtected Visual Environment (IMPROVE) sites collected during 2011, we start by identifying key FT-IR signal characteristics, such as non-negative absorbance or analyte segment transformation, to capture sample-specific transitions between background and analyte. While referring to qualitative properties of PTFE background, the goal of smoothing splines interpolation is to learn the baseline structure in the background region to predict the baseline structure in the analyte region. We then validate the model by comparing smoothing splines baseline-corrected spectra with uncorrected and polynomial baseline (PB)-corrected equivalents via three statistical applications: (1) clustering analysis, (2) functional group quantification, and (3) thermal optical reflectance (TOR) organic carbon (OC) and elemental carbon (EC) predictions. The discrepancy rate for a four-cluster solution is 10 %. For all functional groups but carboxylic COH the discrepancy is ≤ 10 %. Performance metrics obtained from TOR OC and EC predictions (R2 ≥ 0.94 %, bias ≤ 0.01 µg m−3, and error ≤ 0.04 µg m−3) are on a par with those obtained from uncorrected and PB-corrected spectra. The proposed protocol leads to visually and analytically similar estimates as those generated by the polynomial method. More importantly, the automated solution allows us and future users to evaluate its analytical reproducibility while minimizing reducible user bias. We anticipate the protocol will enable FT-IR researchers and data analysts to quickly and reliably analyze a large amount of data and connect them to a variety of available statistical learning methods to be applied to analyte absorbances isolated in atmospheric aerosol samples.

Highlights

  • Measurement and quantification of atmospheric aerosol composition and abundance provide a basis from which we can monitor regional air quality, predict potential impacts on health and climate, and deduce formation mechanisms to reduce uncertainties in climate models for simulating al-Published by Copernicus Publications on behalf of the European Geosciences Union.A

  • This study approaches the question by detailing the statistical protocol, which allows for the precise definition of analyte and background subregions, applies nonparametric smoothing splines to model sample-specific PTFE variations, and integrates performance metrics from particulate matter (PM) and blank samples alike in the smoothing parameter selection

  • Our results demonstrate that similar estimates can be obtained using a new, automated baseline correction protocol

Read more

Summary

Introduction

Polytetrafluoroethylene (PTFE) filters have been extensively used in both measurement campaigns (Maria et al, 2002, 2003; Takahama et al, 2011; Frossard et al, 2014; Russell, 2003) and routine monitoring networks, such as the IMPROVE network in pristine and rural areas or the Chemical Speciation Network/Speciation Trends Network in urban and suburban areas in the United States (Dillner and Takahama, 2015a). Advantages of PTFE substrates include their stability, hydrophobicity, and negligible carbon gas adsorption (Turpin et al, 1994; Gilardoni et al, 2007; Ruthenburg et al, 2014). As such, they are amenable to gravimetric mass, elemental analysis, and detailed chemical speciation analysis (e.g., Surratt et al, 2007)

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call