Abstract

Regularization techniques such as the lasso (Tibshirani 1996) and elastic net (Zou and Hastie 2005) can be used to improve regression model coefficient estimation and prediction accuracy, as well as to perform variable selection. Ordinal regression models are widely used in applications where the use of regularization could be beneficial; however, these models are not included in many popular software packages for regularized regression. We propose a coordinate descent algorithm to fit a broad class of ordinal regression models with an elastic net penalty. Furthermore, we demonstrate that each model in this class generalizes to a more flexible form, that can be used to model either ordered or unordered categorical response data. We call this the elementwise link multinomial-ordinal (ELMO) class, and it includes widely used models such as multinomial logistic regression (which also has an ordinal form) and ordinal logistic regression (which also has an unordered multinomial form). We introduce an elastic net penalty class that applies to either model form, and additionally, this penalty can be used to shrink a non-ordinal model toward its ordinal counterpart. Finally, we introduce the R package ordinalNet, which implements the algorithm for this model class.

Highlights

  • Ordinal regression models arise in contexts where the response variable belongs to one of several ordered categories

  • This paper introduced the elmentwise link, multinomial-ordinal (ELMO) model class, a rich class of multinomial regression models that includes commonly used categorical regression models

  • The parallel form is appropriate for ordinal data, while the nonparallel form is a more flexible model which can be used with an unordered categorical response

Read more

Summary

Introduction

Ordinal regression models arise in contexts where the response variable belongs to one of several ordered categories (such as 1=“poor”, 2=“fair”, 3=“good”, 4=“excellent”). One of the most common regression models for this type of data is the cumulative logit model (McCullagh, 1980), which is known as the proportional odds model or the ordinal logistic regression model. Other ordinal regression models include the stopping ratio model, the continuation ratio model, and the adjacent category model. The VGAM R package (Yee and Wild, 1996; Yee, 2010, 2015) fits all of the aforementioned models, but without regularization or variable selection. Popular CRAN packages for penalized regression, such as penalized (Goeman et al, 2014) and glmnet (Friedman et al, 2010), do not currently fit ordinal models

Objectives
Methods
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call