Abstract

The choice of the method of raw data dividing into train and test subsets in the models based on the artificial neural networks (ANN) is one of the underexplored problems of continuous Spatio-temporal field interpolation. Selecting the best training subset for modelling the spatial distribution of elements in the topsoil is not a trivial task since the sampling points are not equivalent. Errors and outliers, which are present in the distribution, can be misleading. Most of the points, which contain utility information, should be involved in modelling. The raw data were Chromium (Cr) and Manganese (Mn) contents in the topsoil on the residential areas in Noyabrsk and Novy Urengoy (South Part) cities (Russian subarctic zone). A three-step algorithm for extraction of the raw data dividing into training and test subsets for modelling the spatial distribution of the feature presented. The spatial distributions of the element contents in the topsoil, which constructed by the multilayer perceptron (MLP), consider spatial heterogeneity and training rules. The MLP structure was chosen by the minimization of the root mean squared error (RMSE). For each territory, according to the number of hits in the training subset, the points divided into three classes: the «elite», «middle», and «useless». Considering this information at the stage of dividing the raw sample makes it possible to increase the accuracy of the predictive model.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.