An implicit enumeration algorithm for mining high dimensional data

Xinli Bao,Kenneth Gilbert,Hamparsum Bozdogan,Vuttichai Chatpattananan

doi:10.1504/ijor.2005.007437

Abstract

Model selection is an important problem in mining information from large data bases. For example, in selecting a regression model, there may be J independent variables from which to choose, giving 2J feasible possible combinations of models from which to choose. Information criteria such as Akaike's (1973) Information Criterion (AIC) and Bozdogan's (1988, 1990, 1994, 2000, 2004) Information Measure of Complexity (ICOMP) Criterion, provide a method defining the 'best' solution, by providing an estimate of the measure of difference between a given model and the true model. In this paper, we introduce a new exact implicit enumeration (IE) algorithm to identify the subset of variables that minimises the information criterion. The IE algorithm uses efficient bounding strategies for the nonlinear objective function of the model selection problem. In computational tests, the IE algorithm outperforms the existing exact algorithms from the literature. The IE algorithm also has the advantage of being the only exact algorithm that can be used with all of the existing information criteria, including ICOMP. ICOMP has the advantage that it explicitly takes into account the effect of the covariance of the variables on parameter estimation in the model selection process and that it makes also no assumption that the parameter estimates are unbiased.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An implicit enumeration algorithm for mining high dimensional data

Abstract

Talk to us

Similar Papers

More From: International Journal of Operational Research

Lead the way for us

Journal: International Journal of Operational Research	Publication Date: Jan 1, 2005
Citations: 13

Similar Papers

An implicit enumeration algorithm for arrival aircraft
C.R Brinton
-
C.R BrintonC.R Brinton
05 Oct 1992
05 Oct 1992

Exact and heuristic algorithms for the domination problem
Ernesto Parra Inza ... Frank Angel Hernández Mira
European Journal of Operational Research | VOL. 313
Ernesto Parra Inza, et. al.Ernesto Parra Inza ... Frank Angel Hernández Mira
29 Aug 2023
European Journal of Operational Research | VOL. 313

Performance of Akaike Information Criterion and Bayesian Information Criterion in Selecting Partition Models and Mixture Models.
Qin Liu ... Shane A Richards
Systematic Biology | VOL. 72
Qin Liu, et. al.Qin Liu ... Shane A Richards
28 Dec 2022
Systematic Biology | VOL. 72

Model Selection Procedures in Bounds Test of Cointegration: Theoretical Comparison and Empirical Evidence
Waqar Badshah ... Mehmet Bulut
Economies | VOL. 8
Waqar Badshah, et. al.Waqar Badshah ... Mehmet Bulut
08 Jun 2020
Economies | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An implicit enumeration algorithm for mining high dimensional data

Abstract

Talk to us

Similar Papers

More From: International Journal of Operational Research