Abstract

A method is proposed for selecting the blurriness coefficient of kernel functions for nonparametric estimation of the probability density of a one-dimensional random variable with large volumes of statistical data, for example, obtained by remote sensing of natural objects. In the proposed method for selecting the blurriness coefficient, a regression estimate of the probability density is used. A method for synthesizing a regression probability density estimate is presented. The synthesis of the estimate is based on compression of the initial sample by decomposition of the range of values of a random variable. To decompose the range of values of a random variable, the Heinhold-Gaede rule and the formula for optimal selection of the number of sampling intervals are applied. Two approaches to the selection of the blurriness coefficient of the regression estimation of probability density using the traditional and proposed by the authors optimization methods of nonparametric estimation of probability density are considered. The traditional method of optimizing nonparametric estimation of probability density is based on minimizing its mean square deviation. In the proposed method, the selection of the blurriness coefficients of the kernel functions is based on the conditions of the minimum error of approximation of the regression estimate of the desired probability density. The approximation properties of the regression estimation of probability density using two methods of its optimization are analyzed. The conditions of their competence in estimating the probability densities of random variables with a lognormal distribution law are established. The results obtained allow for development when optimizing a regression estimate of the probability density of a multidimensional random variable.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call