Abstract
Particle exposure affects more humans globally than any other air pollutant. However, due to expensive instruments and infrastructural deficiency, a high spatiotemporal network of monitoring stations is not possible, leading to data-scarce regions. Satellite and reanalysis datasets can be implemented to estimate particulate matter, but they do not provide surface concentration and needs to be reconstructed from the components. In this study, a machine learning (ML) framework is implemented to reconstruct PM2.5 from MERRA-2 data components, namely black carbon (BC), organic carbon (OC), dust (DUST), sea salt (SS), and sulfate (SO4) mass concentration. The ground-level data were collected from India's 335 continuous ambient air quality monitoring stations (CAAQMS) and respective MERRA-2 data for 2017–2021 at hourly resolution. Random forest (RF) performs better with train and test scores (R2) of 0.84 and 0.73, respectively, while the empirical equation provides an R2 of only 0.26 on test data. The estimated PM2.5 for Indian states from 1980 to 2021 indicates a significant increase in most cases. However, states in the Indo-Gangetic plain such as Delhi, Punjab, Haryana, and Uttar Pradesh, are the most polluted regions of India. The major shift in concentration is from 2000 onwards, which can be seen as a direct result of the economic liberalization policies implemented in 1991. The results provide evidence for the limitations of the broad application of the empirical equation and the feasibility of ML algorithms as a potential reconstruction technique for developing robust and accurate region-specific models from MERRA-2 data.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.