An effective implementation and assessment of a random forest classifier as a soil spatial predictive model

Gaurav Shukla,Rahul Dev Garg,Hari Shanker Srivastava,Pradeep Kumar Garg

doi:10.1080/01431161.2018.1430399

Abstract

ABSTRACTMapping the spatial distribution of soil classes is important for informing soil use and management decisions. This study aimed to effectively implement Random Forest (RF) model and to evaluate the behaviour and performance of the model for soil classification of Indian districts. Soil-forming factors, known as ‘scorpan,’ are selected as environmental covariates to tune RF model to classify 11 different soil categories. Thirty-five digital layers are prepared using different satellite data [ALOS (Advanced Land Observing Satellite) digital elevation model, Landsat-8, Moderate Resolution Imaging Spectroradiometer normalized difference vegetation index product, RISAT-1 (Radar Imaging Satellite-1), Sentinel-1A] and climatic data (precipitation and temperature) to represent scorpan environmental covariates in the study area. The RF parameters corresponding to highest Cohen’s kappa coefficient (κ) value and lowest number of random split variables are considered optimum values for RF model. Model behaviour evaluation is based on mapping accuracy, sensitivity to data set size, and noise. Two other machine-learning methods, CART (Classification and Regression Tree) decision tree (CDT) and CART ensemble bagger (CEB), are used to provide the comparative study. To access behaviour of models to the false data set, noise in training set is produced by assigning a false class to the training set in 5% increment. Comparative performance of RF model is based on quality assessment measures. To evaluate the performance of models, marginal rates, F-measure, and Jaccard’s coefficient of the community, classification success index and agreement coefficients are selected under quality assessment measures. The score is calculated to rank the algorithm. RF model shows high stability against data set reduction in comparison to other methods. The results show that the abrupt change in accuracy is only observed after 60% training data reduction in RF model; however, significant decrease in accuracy can be noted after 45% and 25% data reduction in CEB and CDT, respectively. The RF model shows comparatively the greater resistance to noise. Overall, RF model has performed better than CDT and CEB to classify soil categories in the study area. The results of this research provide new insights into the performance of RF in the context of soil class mapping.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An effective implementation and assessment of a random forest classifier as a soil spatial predictive model

Abstract

Talk to us

Similar Papers

More From: International Journal of Remote Sensing

Lead the way for us

Journal: International Journal of Remote Sensing	Publication Date: Jan 25, 2018
Citations: 27

Similar Papers

Seeing the Forest for the Trees: Random Forest Models for Predicting Survival in Kidney Transplant Recipients.
Ruth Sapir-Pichhadze ... Bruce Kaplan
Transplantation | VOL. 104
Ruth Sapir-Pichhadze, et. al.Ruth Sapir-Pichhadze ... Bruce Kaplan
01 May 2020
Transplantation | VOL. 104

Magnetic resonance imaging-based prediction models for differentiating intraspinal schwannomas from meningiomas: classification and regression tree and random forest analysis.
Zhen Xu ... Jin-Shao Ye
Quantitative Imaging in Medicine and Surgery | VOL. 14
Zhen Xu, et. al.Zhen Xu ... Jin-Shao Ye
01 May 2024
Quantitative Imaging in Medicine and Surgery | VOL. 14

Comparison of the performance of decision tree (DT) algorithms and extreme learning machine (ELM) model in the prediction of water quality of the Upper Green River watershed.
Jagadeesh Anmala ... Venkateswarlu Turuganti
Water Environment Research | VOL. 93
Jagadeesh Anmala, et. al.Jagadeesh Anmala ... Venkateswarlu Turuganti
04 Oct 2021
Water Environment Research | VOL. 93

Developing machine learning models with multi-source environmental data to predict wheat yield in China
Linchao Li ... Puyu Feng
Computers and Electronics in Agriculture | VOL. 194
Linchao Li, et. al.Linchao Li ... Puyu Feng
01 Mar 2022
Computers and Electronics in Agriculture | VOL. 194

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An effective implementation and assessment of a random forest classifier as a soil spatial predictive model

Abstract

Talk to us

Similar Papers

More From: International Journal of Remote Sensing