Large Number Of Zeros Research Articles

ABSTRACT Background Dependent variables in health psychology are often counts, for example, of a behaviour or number of engagements with an intervention. These counts can be very strongly skewed, and/or contain large numbers of zeros as well as extreme outliers. For example, ‘How many cigarettes do you smoke on an average day?’ The modal answer may be zero but may range from 0 to 40+. The same can be true for minutes of moderate-to-vigorous physical activity. For some people, this may be near zero, but take on extreme values for someone training for a marathon. Typical analytical strategies for this data involve explicit (or implied) transformations (smoker v. non-smoker, log transformations). However, these data types are ‘counts’ (i.e. non-negative whole numbers) or quasi-counts (time is ratio but discrete minutes of activity could be analysed as a count), and can be modelled using count distributions – including the Poisson and negative binomial distribution (and their zero-inflated and hurdle extensions, which alloweven more zeros). Methods In this tutorial paper I demonstrate (in R, Jamovi, and SPSS) the easy application of these models to health psychology data, and their advantages over alternative ways of analysing this type of data using two datasets – one highly dispersed dependent variable (number of views on YouTube, and another with a large number of zeros (number of days on which symptoms were reported over a month). Results The negative binomial distribution had the best fit for the overdispersed number of views on YouTube. Negative binomial, and zero-inflated negative binomial were both good fits for the symptom data with over-abundant zeros. Conclusions In both cases, count distributions provided not just a better fit but would lead to different conclusions compared to the poorly fitting traditional regression/linear models.

Read full abstract

The main aim of the article is to present a new forecasting technique, applicable in case of intermittent demand. To present properties of this new technique, the accuracy of the predictions generated by the Croston’s method and by the author’s method (based on stochastic simulation) was analyzed. For comparison, methods such as moving average and simple exponential smoothing are as well used as a reference. Also the SBA method, a modification of Croston’s method, is applied. Croston’s method is an extension of adaptive methods. It separates the interval between the (non‑zero) sales and the sales level. Its purpose is to better forecast intermittent (sporadic) demand. The second prognostic method is the author’s proposal which relies on two stages. In the first stage, based on stochastic simulation, it determines if an event (sale) occurs in a given period. In the second stage, the sales level is estimated (if the previous stage shows that the sales will occur). Due to the strong asymmetry of the sales, the sales level is determined on the basis of the corresponding quantiles. The basis for forecasting are weekly sales series of about fourteen thousand products (real data). The analyzed time series can be defined as atypical, which is manifested by a small number of non‑zero observations (high number of zeros), high volatility and randomness (randomness tests indicate white noise). Forecast error measures are used to characterize both the bias and the efficiency. The forecast error measures will be characterized so that they can be applied to a time series with a large number of zeros (including the author’s forecast error measure proposal). Forecasts were evaluated with respect to the distributions of four ex post errors, such as mean error (ME), mean absolute deviation (MAD), mean absolute scaled error (MASE) and the author’s proposal (error D). The proposed technique, based on stochastic simulation, seems to be the least biased and most efficient. The Croston’s method gives positively biased predictions with rather low efficiency. The proposed forecasting technique might support decisions in enterprises facing the problem of forecasting intermittent demand. The more accurate forecasts could increase the quality of customer service and optimize the inventory level.

Read full abstract

Large Number Of Zeros Research Articles

Articles published on Large Number Of Zeros

A zero-inflated Poisson integer-valued autoregressive model with time-varying coefficients covariates

Score Matching for Compositional Distributions

Generalized zero-inated Poisson regression mixture model for fitting health-related data

Accuracy of Zero Inflated Generalized Poisson Exponentially Moving Average Control Chart

Modeling Hospitalization Decision and Utilization for the Elderly in China

Models for Overdispersion Count Data with Generalized Distribution: An Application to Parasites Intensity

Log-ratio analysis of microbiome data with many zeroes is library size dependent.

Measuring temporal patterns in ecology: The case of mast seeding.

Comparison of zero replacement strategies for compositional data with large numbers of zeros

Too many zeros and/or highly skewed? A tutorial on modelling health behaviour as count data with Poisson and negative binomial regression

A review of models of forest fire occurrence prediction in China

A four-parameter negative binomial-Lindley distribution for modeling over and underdispersed count data with excess zeros

Quasi-binomial zero-inflated regression model suitable for variables with bounded support

Preserving the distribution function in surveys in case of imputation for zero inflated data

A latent allocation model for the analysis of microbial composition and disease

Fat-Tailed Regression Modeling with Spliced Distributions

Fat-Tailed Regression Modeling with Spliced Distributions

New Forecasting Technique for Intermittent Demand, Based on Stochastic Simulation. An Alternative to Croston’s Method

Modeling change trajectories with count and zero-inflated outcomes: Challenges and recommendations

An effective numerical method for solving fractional pantograph differential equations using modification of hat functions

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large Number Of Zeros Research Articles

Articles published on Large Number Of Zeros

A zero-inflated Poisson integer-valued autoregressive model with time-varying coefficients covariates

Score Matching for Compositional Distributions

Generalized zero-inated Poisson regression mixture model for fitting health-related data

Accuracy of Zero Inflated Generalized Poisson Exponentially Moving Average Control Chart

Modeling Hospitalization Decision and Utilization for the Elderly in China

Models for Overdispersion Count Data with Generalized Distribution: An Application to Parasites Intensity

Log-ratio analysis of microbiome data with many zeroes is library size dependent.

Measuring temporal patterns in ecology: The case of mast seeding.

Comparison of zero replacement strategies for compositional data with large numbers of zeros

Too many zeros and/or highly skewed? A tutorial on modelling health behaviour as count data with Poisson and negative binomial regression

A review of models of forest fire occurrence prediction in China

A four-parameter negative binomial-Lindley distribution for modeling over and underdispersed count data with excess zeros

Quasi-binomial zero-inflated regression model suitable for variables with bounded support

Preserving the distribution function in surveys in case of imputation for zero inflated data

A latent allocation model for the analysis of microbial composition and disease

Fat-Tailed Regression Modeling with Spliced Distributions

Fat-Tailed Regression Modeling with Spliced Distributions

New Forecasting Technique for Intermittent Demand, Based on Stochastic Simulation. An Alternative to Croston’s Method

Modeling change trajectories with count and zero-inflated outcomes: Challenges and recommendations

An effective numerical method for solving fractional pantograph differential equations using modification of hat functions