Abstract

In data analysis, selecting a proper statistical model is a challenging issue. Upon the selection, there are other important factors impacting the results. In this article, two statistical models, a General Linear Model (GLM) and the Ratio Estimator will be compared. Where applicable, some issues such as heteroscedasticity, outliers, etc. and the role they play in data analysis will be studied. For reducing the severity of heteroscedasticity, Weighted Least Square (WLS), Generalized Least Square (GLS), and Feasible Generalized Least Square (FGLS) will be deployed. Also, a revised version of FGLS is introduced. Since these issues are data dependent, shrimp effort data collected in the Gulf of Mexico for the years 2005 through 2018 will be used and it is shown that the revised FGLS reduces the impact of heteroscedasticity significantly compared to that of FGLS. The data sets will also be checked for the outliers and corrections are made (where applicable). It is concluded that these issues play a significant role in data analysis and must be taken seriously. Further, the two statistical models, that is, the GLM and the Ratio Estimator are compared.

Highlights

  • Introduction and BackgroundSelecting a model which satisfactorily represents a given data set is very challenging

  • A revised version of Feasible Generalized Least Square (FGLS) is introduced. Since these issues are data dependent, shrimp effort data collected in the Gulf of Mexico for the years 2005 through 2018 will be used and it is shown that the revised FGLS reduces the impact of heteroscedasticity significantly compared to that of FGLS

  • Heteroscedasticity is a very important topic in data analysis where it refers to the circumstance in which the variability of a variable is unequal across the range of values of the variables that predict it

Read more

Summary

A Comparison of a General Linear Model and the Ratio Estimator

Morteza Marzjarani Correspondence: National Marine Fisheries Service, Southeast Fisheries Science Center, Galveston Laboratory, 4700AvenueU, Galveston, Texas 77551, USA. Received: March 8, 2020 Accepted: April 14, 2020 Online Published: April 15, 2020 doi:10.5539/ijsp.v9n3p54

Introduction and Background
Data Files
Model Selection
The Process of Testing and Selecting the Model Covariates
Collinearity
Heteroscedasticity
Residual Analysis
Outliers
Model Predictability Feature
Ratio Estimator Efforts and Its Comparison to the GLM Estimates
Findings
Alternative Models

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.