Finding the Best Regression Subset by Reduction in Nonfull-Rank Cases

Alan H Feiveson

doi:10.1137/s0895479891221721

Abstract

The computational problem of finding the best fitting subset of independent variables in least-squares regression with a fixed subset size is addressed, especially in the context of the nonfull-rank case with more variables than observations. For the full-rank case, the most efficient widely used methods work by finding the complementary subset with minimum to the total regression sum of squares; a task that can usually be accomplished with far less computation than exhaustive evaluation of all subsets. Here, a method using Cholesky-type factorizations (Algorithm 2) has been developed, which also takes advantage of the computational savings offered by the reduction approach, but which can be used in nonfull-rank cases where existing methods are not applicable. Algorithm 2 is derived by examining the asymptotic properties of a full-rank procedure (Algorithm 1) used on a perturbation of the cross-product matrix. In the course of testing, it was discovered that Algorithm 1, with the appropriate ridge parameter, usually selected the best subset with less computation than Algorithm 2; however, if one requires mathematical certitude, use of Algorithm 2 is indicated. Also, some new approaches are proposed for developing efficient methods of identifying the best subset directly, rather than by complement to the minimum-reduction subset.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finding the Best Regression Subset by Reduction in Nonfull-Rank Cases

Abstract

Talk to us

Similar Papers

More From: SIAM journal on matrix analysis and applications : a publication of the Society for Industrial and Applied Mathematics

Lead the way for us

Journal: SIAM journal on matrix analysis and applications : a publication of the Society for Industrial and Applied Mathematics	Publication Date: Jan 1, 1994
Citations: 3

Similar Papers

One Missing Value Problem in Latin Square Design of Any Order: Regression Sum of Squares
Kittiwat Sirikasemsuk
-
Kittiwat SirikasemsukKittiwat Sirikasemsuk
01 Aug 2016
01 Aug 2016

Measure of overall regression sum of squares of symmetric randomized complete block design with a lost observation
Kittiwat Sirikasemsuk
International Journal of Engineering & Technology | VOL. 7
Kittiwat SirikasemsukKittiwat Sirikasemsuk
08 Mar 2018
International Journal of Engineering & Technology | VOL. 7

Toward the Identification of Stoichiometric Models for Complex Reaction Mixtures
Jenna Fromer ... Christos Georgakis
Industrial & Engineering Chemistry Product Research and Development | VOL. 62
Jenna Fromer, et. al.Jenna Fromer ... Christos Georgakis
19 Sep 2022
Industrial & Engineering Chemistry Product Research and Development | VOL. 62

A study over the general formula of regression sum of squares in multiple linear regression
Mehmet Korkmaz
Numerical Methods for Partial Differential Equations | VOL. 37
Mehmet KorkmazMehmet Korkmaz
28 Aug 2020
Numerical Methods for Partial Differential Equations | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finding the Best Regression Subset by Reduction in Nonfull-Rank Cases

Abstract

Talk to us

Similar Papers

More From: SIAM journal on matrix analysis and applications : a publication of the Society for Industrial and Applied Mathematics