Nonparametric Regression Estimation Research Articles

BackgroundCortisol and dehydroepiandrosterone (DHEA) are considered to be valuable markers of the hypothalamus–pituitary–adrenal (HPA) axis, while salivary alpha-amylase (sAA) reflects the autonomic nervous system. Past studies have found certain diurnal patterns among these biomarkers, with some studies reporting results that differ from others. Also, some past studies have found an association among these three biomarkers while other studies have not. This study investigates these patterns and associations in older adults by taking advantage of modern statistical methods for dealing with non-normality, outliers and curvature. Basic characteristics of the data are reported as well, which are relevant to understanding the nature of any patterns and associations. MethodsBoxplots were used to check on the skewness and presence of outliers, including the impact of using simple transformations for dealing with non-normality. Diurnal patterns were investigated using recent advances aimed at comparing medians. When studying associations, the initial step was to check for curvature using a non-parametric regression estimator. Based on the resulting fit, a robust regression estimator was used that is designed to deal with skewed distributions and outliers. ResultsBoxplots indicated highly skewed distributions with outliers. Simple transformations (such as taking logs) did not deal with this issue in an effective manner. Consequently, diurnal patterns were investigated using medians and found to be consistent with some previous studies but not others. A positive association between awakening cortisol levels and DHEA was found when DHEA is relatively low; otherwise no association was found. The nature of the association between cortisol and DHEA was found to change during the course of the day. Upon awakening, cortisol was found to have no association with sAA when DHEA levels are relatively low, but otherwise there is a negative association. DHEA was found to have a positive association with sAA upon awakening. Shortly after awakening and for the remainder of the day, no association was found between DHEA and sAA ignoring cortisol. For DHEA and cortisol (taken as the independent variables) versus sAA (the dependent variable), again an association is found only upon awakening.

Probability estimation for binary and multicategory outcome using logistic and multinomial logistic regression has a long-standing tradition in biostatistics. However, biases may occur if the model is misspecified. In contrast, outcome probabilities for individuals can be estimated consistently with machine learning approaches, including k-nearest neighbors (k-NN), bagged nearest neighbors (b-NN), random forests (RF), and support vector machines (SVM). Because machine learning methods are rarely used by applied biostatisticians, the primary goal of this paper is to explain the concept of probability estimation with these methods and to summarize recent theoretical findings. Probability estimation in k-NN, b-NN, and RF can be embedded into the class of nonparametric regression learning machines; therefore, we start with the construction of nonparametric regression estimates and review results on consistency and rates of convergence. In SVMs, outcome probabilities for individuals are estimated consistently by repeatedly solving classification problems. For SVMs we review classification problem and then dichotomous probability estimation. Next we extend the algorithms for estimating probabilities using k-NN, b-NN, and RF to multicategory outcomes and discuss approaches for the multicategory probability estimation problem using SVM. In simulation studies for dichotomous and multicategory dependent variables we demonstrate the general validity of the machine learning methods and compare it with logistic regression. However, each method fails in at least one simulation scenario. We conclude with a discussion of the failures and give recommendations for selecting and tuning the methods. Applications to real data and example code are provided in a companion article (doi:10.1002/bimj.201300077).

Nonparametric Regression Estimation Research Articles

Related Topics

Articles published on Nonparametric Regression Estimation

Nonparametric regression estimation for functional stationary ergodic data with missing at random

Methodology for Non-Parametric Deconvolution When the Error Distribution is Unknown

Adaptive estimation in the functional nonparametric regression model

Optimal difference-based variance estimation in heteroscedastic nonparametric regression

Robust estimating equation-based sufficient dimension reduction

Additive kernel estimates of returns to schooling

UNIFORM CONSISTENCY FOR NONPARAMETRIC ESTIMATORS IN NULL RECURRENT TIME SERIES

Adaptive function estimation in nonparametric regression with one-sided errors

ADAPTIVE NONPARAMETRIC REGRESSION WITH CONDITIONAL HETEROSKEDASTICITY

THE INTEGRATED MEAN SQUARED ERROR OF SERIES REGRESSION AND A ROSENTHAL HILBERT-SPACE INEQUALITY

Block Thresholding on the Sphere

Semiparametric estimation of average treatment effect through a random coefficient dummy endogenous variable model

Empirical Comparison of Nonparametric Regression Estimates on Real Data

A plug-in the number of knots selector for polynomial spline regression

Generalized nonparametric smoothing with mixed discrete and continuous data

Performance criteria and discrimination of extreme undersmoothing in nonparametric regression

Nonparametric Kernel Methods with Errors‐in‐Variables: Constructing Estimators, Computing them, and Avoiding Common Mistakes

EFFICIENT NON‐PARAMETRIC ESTIMATION OF THE SPECTRAL DENSITY IN THE PRESENCE OF MISSING OBSERVATIONS

Diurnal patterns and associations among salivary cortisol, DHEA and alpha-amylase in older adults

Probability estimation with machine learning methods for dichotomous and multicategory outcome: Theory

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Nonparametric Regression Estimation Research Articles

Related Topics

Articles published on Nonparametric Regression Estimation

Nonparametric regression estimation for functional stationary ergodic data with missing at random

Methodology for Non-Parametric Deconvolution When the Error Distribution is Unknown

Adaptive estimation in the functional nonparametric regression model

Optimal difference-based variance estimation in heteroscedastic nonparametric regression

Robust estimating equation-based sufficient dimension reduction

Additive kernel estimates of returns to schooling

UNIFORM CONSISTENCY FOR NONPARAMETRIC ESTIMATORS IN NULL RECURRENT TIME SERIES

Adaptive function estimation in nonparametric regression with one-sided errors

ADAPTIVE NONPARAMETRIC REGRESSION WITH CONDITIONAL HETEROSKEDASTICITY

THE INTEGRATED MEAN SQUARED ERROR OF SERIES REGRESSION AND A ROSENTHAL HILBERT-SPACE INEQUALITY

Block Thresholding on the Sphere

Semiparametric estimation of average treatment effect through a random coefficient dummy endogenous variable model

Empirical Comparison of Nonparametric Regression Estimates on Real Data

A plug-in the number of knots selector for polynomial spline regression

Generalized nonparametric smoothing with mixed discrete and continuous data

Performance criteria and discrimination of extreme undersmoothing in nonparametric regression

Nonparametric Kernel Methods with Errors‐in‐Variables: Constructing Estimators, Computing them, and Avoiding Common Mistakes

EFFICIENT NON‐PARAMETRIC ESTIMATION OF THE SPECTRAL DENSITY IN THE PRESENCE OF MISSING OBSERVATIONS

Diurnal patterns and associations among salivary cortisol, DHEA and alpha-amylase in older adults

Probability estimation with machine learning methods for dichotomous and multicategory outcome: Theory