Regression Procedures in SAS: Problems?

Bulent Uyar,Orhan Erdem

doi:10.1080/00031305.1990.10475746

Abstract

The Statistical Analysis System (SAS) is one of the more widely used statistical software packages. Two of the procedures available in SAS are the REGression PROCedure, and the General Linear Models PROCedure. These are executed by the statements PROC REG; and PROC respectively. Each procedure instructs SAS to estimate the regression equation specified in the MODEL statement following it. In addition to specifying the dependent and the independent variables of the model, users may utilize a MODEL statement to list certain options. Some of these options control the extent of detail shown in the printouts; others are specific to the estimation procedure. In either case, such options essentially override the SAS default conventions-namely, those that are operative when no options are specified in the MODEL statement. We contend that when SAS is executing a regression using these procedures, SAS does not necessarily cross-check all of the options listed in the MODEL statement against either the equation specified or the data set provided for the regression. As a result, computation by SAS of some regressionrelated (aggregate) statistics is carried out independently of the specification of the actual equation, with SAS relying on the options alone in determining which methods or formulas to use. It is therefore possible to obtain differing regression summary statistics (such as the coefficient of determination, R2, and the F statistic) for the same equation specified (by proper use of the SAS modeling options) in two equivalent ways. The problem for the unwary user is that the results may have misleading implications for the overall statistical significance of the model. Computer programs are not mind readers. No comprehensive system such as SAS can foresee all eventualities. Thus it is suggested that SAS users' manuals include a warning and alert users that these two procedures do not always cross-check for all the options listed in the MODEL statement. The objective of this article is to demonstrate this problem and, in doing so, establish where the problem arises. For this purpose, we run three regressions on the same data base and compare and analyze the estimation results. Since PROC REG; and PROC GLM; give the same results, the discussion is presented in terms of PROC REG;.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Regression Procedures in SAS: Problems?

Abstract

Talk to us

Similar Papers

More From: The American Statistician

Lead the way for us

Journal: The American Statistician	Publication Date: Nov 1, 1990
Citations: 4

Similar Papers

PROC QTL—A SAS Procedure for Mapping Quantitative Trait Loci
Zhiqiu Hu ... Shizhong Xu
International Journal of Plant Genomics | VOL. 2009
Zhiqiu Hu, et. al.Zhiqiu Hu ... Shizhong Xu
01 Jan 2009
International Journal of Plant Genomics | VOL. 2009

Efficient SAS Programs for Computing Path Coefficients and Index Weights for Selection Indices
Manjit S Kang
Journal of Crop Improvement | VOL. 29
Manjit S KangManjit S Kang
02 Jan 2015
Journal of Crop Improvement | VOL. 29

Short communication: Comparison of 2 methods of assessing calf birth weights in dairy calves
N.M Long ... J.F Smith
Journal of Dairy Science | VOL. 95
N.M Long, et. al.N.M Long ... J.F Smith
10 Oct 2012
Journal of Dairy Science | VOL. 95

Using Coding Interviews as an Organizational and Evaluative Framework for a Graduate Course in Programming
Gregory Samsa
Journal of Curriculum and Teaching | VOL. 9
Gregory SamsaGregory Samsa
20 Aug 2020
Journal of Curriculum and Teaching | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Regression Procedures in SAS: Problems?

Abstract

Talk to us

Similar Papers

More From: The American Statistician