Abstract

This paper is aimed at developing an air quality monitoring system using machine learning (ML), Internet of Things (IoT), and other elements to predict the level of particulate matter and gases in the air based on the air quality index (AQI). It is an air quality assessor and therefore a means of achieving the Sustainable Development Goals (SDGs), in particular, SDG 3.9 (substantial reduction of the health impacts of hazardous substances) and SDG 11.6 (reduction of negative impacts on cities and populations). AQI quantifies and informs the public about air pollutants and their adverse effects on public health. The proposed air quality monitoring device is low-cost and operates in real-time. It consists of a hardware unit that detects various pollutants to assess air quality as well as other airborne particles such as carbon dioxide (CO2), methane (CH4), volatile organic compounds (VOCs), nitrogen dioxide (NO2), carbon monoxide (CO), and particulate matter with an aerodynamic diameter of 2.5 microns or less (PM2.5). To predict air quality, the device was deployed from November 1, 2022, to February 4, 2023, in certain bauxite-rich areas of Adamawa and certain volcanic sites in western Cameroon. Therefore, machine learning algorithm models, namely, multiple linear regression (MLR), support vector regression (SVR), random forest regression (RFR), XGBoost (XGB), and K-nearest neighbors (KNN) were applied to analyze the collected concentrations and predict the future state of air quality. The performance of these models was evaluated using mean absolute error (MAE), coefficient of determination (R-square), and root mean square error (RMSE). The obtained data in this study show that these pollutants are present in selected localities albeit to different extents. Moreover, the AQI values obtained range from 10 to 530, with a mean of 132.380 ± 63.705, corresponding to moderate air quality state but may induce an adverse effect on sensitive members of the population. This study revealed that XGB regression performed better in air quality forecasting with the highest R-squared (test score of 0.9991 and train score of 0.9999) and lowest RMSE (test score of 1.5748 and train score of 0. 0073) and MAE (test score of 0.0872 and train score of 0.0020), while the KNN model had the worst prediction (lowest R-squared and highest RMSE and MAE). This embryonic work is a prototype for projects in Cameroon as measurements are underway for a national spread over a longer period of time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.