Sparse Poisson regression via mixed-integer optimization.

Hiroki Saishu,Kota Kudo,Yuichi Takano,Yossiri Adulyasak

doi:10.1371/journal.pone.0249916

Hiroki Saishu, Kota Kudo + Show 2 more

Open Access

https://doi.org/10.1371/journal.pone.0249916

Copy DOI

Journal: PloS one	Publication Date: Apr 22, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: University of Tsukuba, Tsukuba University of Technology

Abstract

We present a mixed-integer optimization (MIO) approach to sparse Poisson regression. The MIO approach to sparse linear regression was first proposed in the 1970s, but has recently received renewed attention due to advances in optimization algorithms and computer hardware. In contrast to many sparse estimation algorithms, the MIO approach has the advantage of finding the best subset of explanatory variables with respect to various criterion functions. In this paper, we focus on a sparse Poisson regression that maximizes the weighted sum of the log-likelihood function and the L2-regularization term. For this problem, we derive a mixed-integer quadratic optimization (MIQO) formulation by applying a piecewise-linear approximation to the log-likelihood function. Optimization software can solve this MIQO problem to optimality. Moreover, we propose two methods for selecting a limited number of tangent lines effective for piecewise-linear approximations. We assess the efficacy of our method through computational experiments using synthetic and real-world datasets. Our methods provide better log-likelihood values than do conventional greedy algorithms in selecting tangent lines. In addition, our MIQO formulation delivers better out-of-sample prediction performance than do forward stepwise selection and L1-regularized estimation, especially in low-noise situations.

Highlights

A count variable, which takes only on nonnegative integer values, reflects the number of occurrences of an event during a fixed time period
We focus on the mixed-integer optimization (MIO) approach to sparse estimation
This paper aims at establishing an effective MIO approach to sparse Poisson regression based on piecewise-linear approximations

Summary

Introduction

A count variable, which takes only on nonnegative integer values, reflects the number of occurrences of an event during a fixed time period. Count regression models such as Poisson, overdispersed Poisson, and negative binomial regression are standard methods for predicting such count variables [1,2,3]. The aim of sparse estimation is to decrease the number of nonzero estimates of regression coefficients. This method is often used for selecting a significant subset of explanatory variables [9,10,11,12].

Objectives

Methods

Results

Conclusion