Abstract

Visible-near-infrared spectrum (Vis-NIR) spectroscopy technology is one of the most important methods for non-destructive and rapid detection of soil total nitrogen (STN) content. In order to find a practical way to build STN content prediction model, three conventional machine learning methods and one deep learning approach are investigated and their predictive performances are compared and analyzed by using a public dataset called LUCAS Soil (19,019 samples). The three conventional machine learning methods include ordinary least square estimation (OLSE), random forest (RF), and extreme learning machine (ELM), while for the deep learning method, three different structures of convolutional neural network (CNN) incorporated Inception module are constructed and investigated. In order to clarify effectiveness of different pre-treatments on predicting STN content, the three conventional machine learning methods are combined with four pre-processing approaches (including baseline correction, smoothing, dimensional reduction, and feature selection) are investigated, compared, and analyzed. The results indicate that the baseline-corrected and smoothed ELM model reaches practical precision (coefficient of determination (R2) = 0.89, root mean square error of prediction (RMSEP) = 1.60 g/kg, and residual prediction deviation (RPD) = 2.34). While among three different structured CNN models, the one with more 1 × 1 convolutions preforms better (R2 = 0.93; RMSEP = 0.95 g/kg; and RPD = 3.85 in optimal case). In addition, in order to evaluate the influence of data set characteristics on the model, the LUCAS data set was divided into different data subsets according to dataset size, organic carbon (OC) content and countries, and the results show that the deep learning method is more effective and practical than conventional machine learning methods and, on the premise of enough data samples, it can be used to build a robust STN content prediction model with high accuracy for the same type of soil with similar agricultural treatment.

Highlights

  • Soil total nitrogen (STN) has a significant impact on plant growth [1,2], and predicting soil total nitrogen (STN) content is vital for crop production as well as income generation for farmers

  • The results shown the partial least square (PLS)-external parameter orthogonalization (EPO) transformation dramatically improved model performance relative to PLS alone, reducing root mean squared error of prediction (RMSEP) by 53% for TN

  • Support vector regression (SVR) to predict soil properties from texturally homogeneous samples, the results shown that Vis-NIRS is suitable for the prediction of properties of texturally homogeneous samples

Read more

Summary

Introduction

Soil total nitrogen (STN) has a significant impact on plant growth [1,2], and predicting STN content is vital for crop production as well as income generation for farmers. Traditional modeling approaches to evaluate STN content often involve complicated data preprocessing since there exist a nonlinear relationship between STN and soil spectra. Partial least square (PLS) and principal component regression (PCR) are the most common modelling approaches for quantitative spectroscopy analyses in soils [14,15]. Veum et al [16] predicted total nitrogen (TN) using a large, regional dataset of in situ profile DRS spectra and compare the performance of traditional PLS analysis, PLS on external parameter orthogonalization (EPO) transformed spectra (PLS-EPO), PLS-EPO with the Bayesian Lasso (PLS-EPO-BL), and covariate-assisted PLS-EPO-BL models. Debanen et al [17] selected

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call