Abstract

In this chapter, we introduce Gaussian processes for machine learning and their application to designing digital communication receivers. Gaussian processes for machine learning are Bayesian nonlinear tools for solving regression and classification problems. Gaussian processes for regression (GPR) were introduced in the mid-nineties to solve nonparametric estimation problems from a Bayesian perspective. They place a Gaussian process (GP) prior over the possible regressors and use the available data to obtain a posterior regressor, which it is able to explain the observations without overfitting. The covariance matrix of the GP prior describes the different solutions that can be achieved, e.g. linear, polynomial, or universal regressors. The solution of GPR is analytical given its covariance function and, besides providing point estimates, it also assigns confidence intervals for the predictions. Furthermore, the covariance function can be optimized by maximum likelihood to better represent the data, which adds additional flexibility to our regression approximation. GPR can be generalized to solve classification problems, namely Gaussian processes for classification (GPC). GPC extends the idea of GPR for a classification likelihood model. For this likelihood, the GPC posterior is no longer analytically tractable and we need to approximate it. Expectation Propagation (EP), which matches the mean and covariance of the GP posterior to a Gaussian distribution, is the most widely used approximation. Unlike most state-of-the-art classifiers, GPC does not return point-wise decisions, but it provides an accurate posterior probability for each classification decision. This is a major advantage to be exploited by subsequent applications for reducing the base error produced by our nonlinear classifiers. Nonlinear regression and classification techniques have been widely used for designing digital communication receivers for nonlinear channels or whenever there is little information about the channel model or for nonlinear model. These nonlinear tools must use short training sequences to learn the channel and to adapt to a wide range of scenarios, from linear minimum phase to nonlinear and non-minimum phase and from single to multi-user 11

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call