Accelerate Literature Icon
Want to do a literature review? Try our new Literature Review workflow

ON A WIDE CLASS OF GENERALIZED GEOMETRIC DISTRIBUTION FOR OVER AND UNDER DISPERSED DATA SETS

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

In this paper, a wide class of generalized geometric distribution is introduced and we name this class of distributions as “the alpha generalized geometric distribution (AGGD)". Several important distributions are obtained as a special cases of this proposed model. Important distributional properties such as generating functions, moments, recursive relations of the proposed distribution are examined. Parameter estimation using maximum likelihood is discussed. Three well-known data sets, having long tails, are analyzed and the results of fitting by various models are provided. Further, the generalized likelihood ratio test procedure is considered for testing the significance of the parameters of the GGD. Finally, performance of the different estimation methods are compared by means of a Monte Carlo simulation.

Similar Papers
  • Research Article
  • Cite Count Icon 3
  • 10.6339/jds.2014.12(1).1213
L-Moments Estimations for Mixture of Weibull Distributions
  • Mar 9, 2021
  • Journal of data science
  • Ülkü Erişoğlu

Mixture of Weibull distributions has wide application in modeling of heterogeneous data sets. The parameter estimation is one of the most important problems related to mixture of Weibull distributions. In this paper, we propose a L-moment estimation method for mixture of two Weibull distributions. The proposed method is compared with maximum likelihood estimation (MLE) method according to the bias, the mean absolute error, the mean total error and completion time of the algorithm (time) by simulation study. Also, applications to real data sets are given to show the flexibility and potentiality of the proposed estimation method. The comparison shows that, the proposed method is better than MLE method.

  • Book Chapter
  • Cite Count Icon 1
  • 10.1007/978-3-030-85254-2_8
A Robust Regression Method Based on Pearson Type VI Distribution
  • Jan 1, 2022
  • Yasin Büyükkör + 1 more

In classical regression analysis, the distribution of the error is assumed to be Gaussian, and Least Squares (LS) estimation method is used for parameter estimation. In practice, even if the distribution of errors is assumed to be Gaussian, residuals are not generally Gaussian. If the data set contains outlier (s) or there are observations that are suspected to be outlier, normality assumption is violated, and parameter estimates will be biased. Many statisticians used robust method, such as the M-Estimation Method, which is a generalized version of the Maximum Likelihood (ML) Estimation method, for parameter estimation when such problems occurred. However, if the data set has skewness and excess kurtosis, traditional M-Estimators cannot achieve a good solution. In this study, using the relationship between Pearson Differential Equation (PDE) and Influence Function (IF), M-Estimation method is proposed for data sets that follow Pearson Type VI (PVI) distribution. The advantage of this method takes into account the skewness and kurtosis values of the data set and generates dynamic solutions. Objective, influence, weight functions and tail properties of the PVI distribution are obtained by using the Probability Density Function (pdf) of the PVI distribution. For the regression parameter estimates, Iteratively Re-Weighted Least Squares Estimation Method (IRWLS) is used. In many simulation studies with different scenarios and applications with real data, if the data have skewness and excess kurtosis, the proposed method has achieved better results than other M-Estimation methods in terms of Total Absolute Deviation (TAB) and Mean Square Error (MSE).KeywordsM-Estimation methodRobust regressionPearson type VI distributionInfluence functionIteratively re-weighted least squares method

  • Research Article
  • Cite Count Icon 3
  • 10.1080/09720510.2020.1864940
The AGU-F distribution : Properties and applications on over and underdispersed data
  • Jun 8, 2021
  • Journal of Statistics and Management Systems
  • Friday Ikechukwu Agu + 2 more

In this paper, we introduced a one-parameter lifetime distribution called the Agu-F distribution (Agu-F-D) and explored some of its useful statistical properties. The numerical applications of the Agu-F-D was examined using over and under dispersed, low and high kurtosis, positively, and negatively skewed three lifetime data sets. The parameter of the distribution was obtained using the maximum likelihood estimation method. The results obtained indicated that the proposed Agu-F-D performed best to the high kurtosis, positively, skewed, over and under dispersed data sets under consideration than the Lindley, Exponential and Ishita distributions. Thus, it can be used as an alternative model.

  • Research Article
  • Cite Count Icon 354
  • 10.1080/10705511.2014.859510
Relative Performance of Categorical Diagonally Weighted Least Squares and Robust Maximum Likelihood Estimation
  • Jan 2, 2014
  • Structural Equation Modeling: A Multidisciplinary Journal
  • Deborah L Bandalos

Robust maximum likelihood (ML) and categorical diagonally weighted least squares (cat-DWLS) estimation have both been proposed for use with categorized and nonnormally distributed data. This study compares results from the 2 methods in terms of parameter estimate and standard error bias, power, and Type I error control, with unadjusted ML and WLS estimation methods included for purposes of comparison. Conditions manipulated include model misspecification, level of asymmetry, level and categorization, sample size, and type and size of the model. Results indicate that cat-DWLS estimation method results in the least parameter estimate and standard error bias under the majority of conditions studied. Cat-DWLS parameter estimates and standard errors were generally the least affected by model misspecification of the estimation methods studied. Robust ML also performed well, yielding relatively unbiased parameter estimates and standard errors. However, both cat-DWLS and robust ML resulted in low power under conditions of high data asymmetry, small sample sizes, and mild model misspecification. For more optimal conditions, power for these estimators was adequate.

  • Research Article
  • Cite Count Icon 22
  • 10.1088/1741-2552/acf1cc
Three novel methods for determining motor threshold with transcranial magnetic stimulation outperform conventional procedures
  • Sep 13, 2023
  • Journal of Neural Engineering
  • Boshuo Wang + 2 more

Objective. Thresholding of neural responses is central to many applications of transcranial magnetic stimulation (TMS), but the stochastic aspect of neuronal activity and motor evoked potentials (MEPs) challenges thresholding techniques. We analyzed existing methods for obtaining TMS motor threshold and their variations, introduced new methods from other fields, and compared their accuracy and speed. Approach. In addition to existing relative-frequency methods, such as the five-out-of-ten method, we examined adaptive methods based on a probabilistic motor threshold model using maximum-likelihood (ML) or maximum a-posteriori (MAP) estimation. To improve the performance of these adaptive estimation methods, we explored variations in the estimation procedure and inclusion of population-level prior information. We adapted a Bayesian estimation method which iteratively incorporated information of the TMS responses into the probability density function. A family of non-parametric stochastic root-finding methods with different convergence criteria and stepping rules were explored as well. The performance of the thresholding methods was evaluated with an independent stochastic MEP model. Main Results. The conventional relative-frequency methods required a large number of stimuli, were inherently biased on the population level, and had wide error distributions for individual subjects. The parametric estimation methods obtained the thresholds much faster and their accuracy depended on the estimation method, with performance significantly improved when population-level prior information was included. Stochastic root-finding methods were comparable to adaptive estimation methods but were much simpler to implement and did not rely on a potentially inaccurate underlying estimation model. Significance. Two-parameter MAP estimation, Bayesian estimation, and stochastic root-finding methods have better error convergence compared to conventional single-parameter ML estimation, and all these methods require significantly fewer TMS pulses for accurate estimation than conventional relative-frequency methods. Stochastic root-finding appears particularly attractive due to the low computational requirements, simplicity of the algorithmic implementation, and independence from potential model flaws in the parametric estimators.

  • Conference Article
  • Cite Count Icon 5
  • 10.1109/sam.2006.1706163
Parametric GLRT for Multichannel Adaptive Signal Detection
  • Jul 12, 2006
  • Kwang June Sohn + 2 more

We consider herein the problem of detecting a multichannel signal in the presence of spatially and temporally colored disturbance. A parametric generalized likelihood ratio test (GLRT) is developed by modeling the disturbance as a multichannel autoregressive (AR) process. The parametric GLRT differs from Kelly's widely known GLRT which does not utilize any parametric model for the disturbance signal. Maximum likelihood (ML) parameter estimation underlying the parametric GLRT is examined. It is shown that the ML estimator for the alternative hypothesis is non-linear and there exists no closed-form expression. An alternative asymptotic ML (AML) estimator is presented, which yields asymptotically optimum parameter estimates at a reduced complexity. The performance of the parametric GLRT is studied by considering challenging cases with limited or no training signals for parameter estimation. Such cases (especially when training is unavailable) are of great interest in detecting signals in heterogeneous, fast changing, or dense-target environments. Compared with the recently introduced parametric adaptive matched filter (PAMF) and parametric Rao detectors, the parametric GLRT achieves higher data efficiency, offering improved detection performance in general

  • Research Article
  • Cite Count Icon 6
  • 10.1016/j.cie.2021.107188
Estimating the failure rate of the log-logistic distribution by smooth adaptive and bias-correction methods
  • Mar 1, 2021
  • Computers & Industrial Engineering
  • Xi Zheng + 3 more

Estimating the failure rate of the log-logistic distribution by smooth adaptive and bias-correction methods

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 6
  • 10.4236/ojs.2016.66091
Comparison of Two Time Series Decomposition Methods: Least Squares and Buys-Ballot Methods
  • Jan 1, 2016
  • Open Journal of Statistics
  • I S Iwueze + 3 more

This paper discusses comparison of two time series decomposition methods: The Least Squares Estimation (LSE) and Buys-Ballot Estimation (BBE) methods. As noted by Iwueze and Nwogu (2014), there exists a research gap for the choice of appropriate model for decomposition and detection of presence of seasonal effect in a series model. Estimates of trend parameters and seasonal indices are all that are needed to fill the research gap. However, these estimates are obtainable through the Least Squares Estimation (LSE) and Buys-Ballot Estimation (BBE) methods. Hence, there is need to compare estimates of the two methods and recommend. The comparison of the two methods is done using the Accuracy Measures (Mean Error (ME)), Mean Square Error (MSE), the Mean Absolute Error (MAE), and the Mean Absolute Percentage Error (MAPE). The results from simulated series show that for the additive model; the summary statistics (ME, MSE and MAE) for the two estimation methods and for all the selected trending curves are equal in all the simulations both in magnitude and direction. For the multiplicative model, results show that when a series is dominated by trend, the estimates of the parameters by both methods become less precise and differ more widely from each other. However, if conditions for successful transformation (using the logarithmic transform in linearizing the multiplicative model to additive model) are met, both of them give similar results.

  • Research Article
  • Cite Count Icon 38
  • 10.1007/s12040-017-0863-6
Spatial hydrological drought characteristics in Karkheh River basin, southwest Iran using copulas
  • Aug 1, 2017
  • Journal of Earth System Science
  • Esmaeel Dodangeh + 3 more

Investigation on drought characteristics such as severity, duration, and frequency is crucial for water resources planning and management in a river basin. While the methodology for multivariate drought frequency analysis is well established by applying the copulas, the estimation on the associated parameters by various parameter estimation methods and the effects on the obtained results have not yet been investigated. This research aims at conducting a comparative analysis between the maximum likelihood parametric and non-parametric method of the Kendall \(\tau \) estimation method for copulas parameter estimation. The methods were employed to study joint severity–duration probability and recurrence intervals in Karkheh River basin (southwest Iran) which is facing severe water-deficit problems. Daily streamflow data at three hydrological gauging stations (Tang Sazbon, Huleilan and Polchehr) near the Karkheh dam were used to draw flow duration curves (FDC) of these three stations. The \(Q_{75}\) index extracted from the FDC were set as threshold level to abstract drought characteristics such as drought duration and severity on the basis of the run theory. Drought duration and severity were separately modeled using the univariate probabilistic distributions and gamma–GEV, LN2–exponential, and LN2–gamma were selected as the best paired drought severity–duration inputs for copulas according to the Akaike Information Criteria (AIC), Kolmogorov–Smirnov and chi-square tests. Archimedean Clayton, Frank, and extreme value Gumbel copulas were employed to construct joint cumulative distribution functions (JCDF) of droughts for each station. Frank copula at Tang Sazbon and Gumbel at Huleilan and Polchehr stations were identified as the best copulas based on the performance evaluation criteria including AIC, BIC, log-likelihood and root mean square error (RMSE) values. Based on the RMSE values, nonparametric Kendall-\(\tau \) is preferred to the parametric maximum likelihood estimation method. The results showed greater drought return periods by the parametric ML method in comparison to the nonparametric Kendall \(\tau \) estimation method. The results also showed that stations located in tributaries (Huleilan and Polchehr) have close return periods, while the station along the main river (Tang Sazbon) has the smaller return periods for the drought events with identical drought duration and severity.

  • Research Article
  • Cite Count Icon 102
  • 10.1375/1369052042663832
Effects of Censoring on Parameter Estimates and Power in Genetic Modeling
  • Dec 1, 2004
  • Twin Research
  • Eske M Derks + 2 more

Genetic and environmental influences on variance in phenotypic traits may be estimated with normal theory Maximum Likelihood (ML). However, when the assumption of multivariate normality is not met, this method may result in biased parameter estimates and incorrect likelihood ratio tests. We simulated multivariate normal distributed twin data under the assumption of three different genetic models. Genetic model fitting was performed in six data sets: multivariate normal data, discrete uncensored data, censored data, square root transformed censored data, normal scores of censored data, and categorical data. Estimates were obtained with normal theory ML (data sets 1-5) and with categorical data analysis (data set 6). Statistical power was examined by fitting reduced models to the data. When fitting an ACE model to censored data, an unbiased estimate of the additive genetic effect was obtained. However, the common environmental effect was underestimated and the unique environmental effect was overestimated. Transformations did not remove this bias. When fitting an ADE model, the additive genetic effect was underestimated while the dominant and unique environmental effects were overestimated. In all models, the correct parameter estimates were recovered with categorical data analysis. However, with categorical data analysis, the statistical power decreased. The analysis of L-shaped distributed data with normal theory ML results in biased parameter estimates. Unbiased parameter estimates are obtained with categorical data analysis, but the power decreases.

  • Research Article
  • Cite Count Icon 79
  • 10.1111/j.1365-246x.1988.tb03433.x
Parameter estimation in non-Gaussian noise
  • Jul 1, 1988
  • Geophysical Journal International
  • C G Constable

SUMMARY Least squares (LS) estimation of model parameters is widely used in geophysics. If the data errors are Gaussian and independent the LS estimators will be maximum likelihood (ML) estimators and will be unbiased and of minimum variance. However, if the noise is not Gaussian, e.g. if the data are contaminated by extreme outliers, LS fitting will result in parameter estimates which may be biased or grossly inaccurate. When the probability distribution of the errors is known it is possible, using the maximum likelihood method, to obtain consistent and efficient (minimum variance) estimates of parameters. In some cases the distribution of the noise may be determined empirically, and the resulting distribution used in the ML estimation. A procedure for doing this is described here. Hourly values of geomagnetic observatory data are used to illustrate the technique. These data sets contain a number of periodic components, whose amplitudes and phases are geophysically interesting. Geomagnetic storms and other phenomena in the record make the noise distribution long-tailed, asymmetric and variable with location. Using an iterative procedure, one can model the form of these distributions using smoothing splines. For these data ML estimation yields quite different results from standard robust and LS procedures. The technique has the potential for widespread application to other problems involving the recovery of a known form of signal from non-Gaussian noise.

  • Research Article
  • Cite Count Icon 29
  • 10.1515/97
Estimation of Parameters in 3-Parameter Weibull Probability Distribution Functions
  • Sep 1, 2005
  • Hungarian Journal of Industry and Chemistry
  • Rein Luus + 1 more

Three-parameter Weibull probability distribution function is used to represent the time-to-failure data. For parameter estimation, the errors-in-variables, maximum likelihood, and least squares methods are compared. The results obtained from five data sets show that maximum likelihood method gives the most reliable parameter estimates which are close to those obtained by errors-in-variables approach. The least squares method gave the poorest results in most cases. For parameter estimation, the Luus-Jaakola direct search optimization procedure yielded the optimal parameter values in negligible computation time, taking less than a second of computation time on a Pentium4/2.66MHz personal computer for 50 data points.

  • Research Article
  • Cite Count Icon 1
  • 10.1080/03610918.2025.2539171
Discrete Weibull geometric regression model-estimation and applications
  • Jul 29, 2025
  • Communications in Statistics - Simulation and Computation
  • P P Prinsha + 1 more

The primary objective of statistical modeling is to develop a mathematical structure that can capture the average effect of a given set of explanatory variables on the endogenous variable called regression models. Such models are used to quantify the uncertainty and study the heterogeneity in the populations. In this paper a new regression model for count data based on Discrete Weibull geometric (DWG) distribution is developed and the maximum-likelihood (ML) estimation of the parameters are illustrated. A modification of this regression model called zero-inflated DWG (ZIDWG) regression is also introduced, to model the count data sets with excess zeros, and discussed the EM algorithm for the numerical computation of the maximum-likelihood estimates of the model parameters. The performance of the models are numerically evaluated using different simulated datasets. The usefulness of the newly developed models are illustrated using different real datasets and models were compared using the AIC and BIC values.

  • Research Article
  • 10.1080/02664763.2015.1125866
Stochastic analysis of covariance when the error distribution is long-tailed symmetric
  • Dec 30, 2015
  • Journal of Applied Statistics
  • Pelin Kasap + 2 more

ABSTRACTIn this study, we consider stochastic one-way analysis of covariance model when the distribution of the error terms is long-tailed symmetric. Estimators of the unknown model parameters are obtained by using the maximum likelihood (ML) methodology. Iteratively reweighting algorithm is used to compute the ML estimates of the parameters. We also propose new test statistic based on ML estimators for testing the linear contrasts of the treatment effects. In the simulation study, we compare the efficiencies of the traditional least-squares (LS) estimators of the model parameters with the corresponding ML estimators. We also compare the power of the test statistics based on LS and ML estimators, respectively. A real-life example is given at the end of the study.

  • Research Article
  • Cite Count Icon 34
  • 10.1080/02664763.2021.1928018
A discrete analogue of odd Weibull-G family of distributions: properties, classical and Bayesian estimation with applications to count data
  • May 21, 2021
  • Journal of Applied Statistics
  • M El-Morshedy + 2 more

In the statistical literature, several discrete distributions have been developed so far. However, in this progressive technological era, the data generated from different fields is getting complicated day by day, making it difficult to analyze this real data through the various discrete distributions available in the existing literature. In this context, we have proposed a new flexible family of discrete models named discrete odd Weibull-G (DOW-G) family. Its several impressive distributional characteristics are derived. A key feature of the proposed family is its failure rate function that can take a variety of shapes for distinct values of the unknown parameters, like decreasing, increasing, constant, J-, and bathtub-shaped. Furthermore, the presented family not only adequately captures the skewed and symmetric data sets, but it can also provide a better fit to equi-, over-, under-dispersed data. After producing the general class, two particular distributions of the DOW-G family are extensively studied. The parameters estimation of the proposed family, are explored by the method of maximum likelihood and Bayesian approach. A compact Monte Carlo simulation study is performed to assess the behavior of the estimation methods. Finally, we have explained the usefulness of the proposed family by using two different real data sets.

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant