Evaluation of the Effectiveness of Feature Selection Methods Combined with Regression Algorithms to Predict Particulate Matter (PM10) in Gandhinagar, Gujarat, India

Zalak L Thakker,Sanjay H Buch

doi:10.32628/cseit2390641

Abstract

Feature selection is one of the important data pre-processing techniques that are used to increase the performance of machine learning models, to build faster and more cost-effective algorithms, and to make it easier to interpret the predictions made by the models. The main objective of this research work is to investigate the influence features to predict particulate matter (PM10). This research uses 24-hour average pollutant concentration data of 36 air quality monitoring stations provided by Gandhinagar Smart City Development Limited (GSCDL), Gandhinagar, Gujarat. Important features were identified using five feature selection techniques (correlation, forward selection, backward elimination, Exhaustive Feature Selection (EFS), and feature importance derived using Random Forest Regressor). With selected features six regression algorithms (Multiple Linear Regression, Random Forest, Decision Tree, K-nearest Neighbour, XGBoost, and Support Vector Regressor) were trained to predict PM10. Further, the models were compared based on the Root Mean Square Error (RMSE) and Coefficient of determination (R2) parameters to identify the model with good performance. This proposed model can be utilized as an early warning system, providing air quality information to local authorities to develop air-quality improvement initiatives.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of the Effectiveness of Feature Selection Methods Combined with Regression Algorithms to Predict Particulate Matter (PM10) in Gandhinagar, Gujarat, India

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology

Lead the way for us

Journal: International Journal of Scientific Research in Computer Science, Engineering and Information Technology	Publication Date: Mar 14, 2024
License type: CC BY 4.0

Similar Papers

Effect of feature optimization on performance of machine learning models for predicting traffic incident duration
Lubna Obaid ... Ali Bou Nassif
Engineering Applications of Artificial Intelligence | VOL. 131
Lubna Obaid, et. al.Lubna Obaid ... Ali Bou Nassif
10 Jan 2024
Engineering Applications of Artificial Intelligence | VOL. 131

Predicting PM2.5 Concentrations Across USA Using Machine Learning
P Preetham Vignesh ... Jonathan H Jiang
Earth and Space Science | VOL. 10
P Preetham Vignesh, et. al.P Preetham Vignesh ... Jonathan H Jiang
01 Oct 2023
Earth and Space Science | VOL. 10

Leaf Area Index Estimation Algorithm for GF-5 Hyperspectral Data Based on Different Feature Selection and Machine Learning Methods
Zhulin Chen ... Yuan Sun
Remote Sensing | VOL. 12
Zhulin Chen, et. al.Zhulin Chen ... Yuan Sun
01 Jul 2020
Remote Sensing | VOL. 12

A machine learning method based on stacking heterogeneous ensemble learning for prediction of indoor humidity of greenhouse
Sepehr Rezaei Melal ... Seyed Mohammadhossein Shekarian
Journal of Agriculture and Food Research | VOL. 16
Sepehr Rezaei Melal, et. al.Sepehr Rezaei Melal ... Seyed Mohammadhossein Shekarian
16 Mar 2024
Journal of Agriculture and Food Research | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of the Effectiveness of Feature Selection Methods Combined with Regression Algorithms to Predict Particulate Matter (PM10) in Gandhinagar, Gujarat, India

Abstract

Talk to us

Similar Papers

More From: International Journal of Scientific Research in Computer Science, Engineering and Information Technology