Empirical Distribution Of Data Research Articles

Speed is one of the most important parameters describing the condition of the traffic flow. Many analytical models related to traffic flow either produce speed as a performance measure, or use speed to determine other measures such as travel time, delay, and the level of service. Mathematical models or distributions used to describe speed characteristics are very useful, especially when they are utilized in the context of simulation and theoretical derivations. Traditionally, normal, log–normal and composite distributions have been the usual mathematical distributions to characterize speed data. These traditional distributions, however, often fail to produce an adequate goodness-of-fit when the empirical distribution of speed data exhibits bimodality (or multimodality), skewness, or excess kurtosis (peakness). This often occurs when the speed data are generated from several different sub-populations, for example, mixed traffic flow conditions or mixed vehicle compositions. The traditional modeling approach also lacks the ability to explain the underlying factors that lead to different speed distribution curves. The objective of this paper is to explore the applicability of the finite mixture of normal (Gaussian) distributions to capture the heterogeneity in vehicle speed data, and thereby explaining the aforementioned special characteristics. For the parameter estimation, Bayesian estimation method via Markov Chain Monte Carlo (MCMC) sampling is adopted. The field data collected on IH-35 in Texas is used to evaluate the proposed models. The results of this study show that the finite mixture of normal distributions can very effectively describe the heterogeneous speed data, and provide richer information usually not available from the traditional models. The finite mixture modeling produces an excellent fit to the multimodal speed distribution curve. Moreover, the causes of different speed distributions can be identified through investigating the components.

Read full abstract

Exploratory data analysis (EDA) techniques based on the boxplot and robust-class selection were applied to the analysis of single-element stream sediment data in the Collo area (N-E Algeria). The area is characterised by many factors that affect data variability: variation of lithology, combined existence of permanent and ephemeral streams, flash floods, rugged terrain, and climate. The boxplot proved to be very useful in capturing the empirical data distribution, the skewness, and in defining outliers. No data transformation was needed prior to the analysis of single element distributions (Cr, Pb, Zn, Cu, As, and Fe) as is the case in classical statistics. Geochemical mapping of these elements was based on resistant class selection as defined by the boxplot. Results showed the close spatial correlation of outlier data for Cr with a plagioclase-lherzolite intrusion and known chromite pods. The geochemical maps of Pb, Zn, Cu and As concentrations showed an association of these elements coincident with known base-metal sulphides and arsenopyrite mineralisation, delineating a northeastern anomaly spreading well over known mineralisation. The choice of the robust classes based on the boxplot also showed the close spatial distribution of Fe with known hematite and magnetite mineralization. EDA techniques proved to be very useful in delineating known mineralisation in the Collo area where stream sediment data is subject to variability owing to many factors (geologic, physiographic and climatic). EDA proved to be a simple and very useful tool in analysing single-element geochemical data. EDA could be used in similar terrains as an alternative to the classical methods where a normality precondition is needed prior to analysis and where class selection may be affected by the presence of “wild” data.

Read full abstract

Empirical Distribution Of Data Research Articles

Related Topics

Articles published on Empirical Distribution Of Data

Differential analysis of Operating System indicators for anomaly detection in dependable systems: An experimental study

Statistical analysis of bistatic and monostatic sea clutter

Evolving Scale-Free Networks by Poisson Process: Modeling and Degree Distribution.

Time-Efficient Algorithms for Robust Estimators of Location, Scale, Symmetry, and Tail heaviness

Confidence interval of nonlinear regression of restoration time of network terminal devices

A Nash equilibrium simulation model for the competitiveness evaluation of the auction based day ahead electricity market

On the approximation of empirical data for service system simulations

Development of Method for Three-Point Data Estimation and SVR-QSAR Model to Screen Anti Cancer Leads

Optimization of the number of components in the mixed model using multi-criteria decision-making

Bayesian mixture modeling approach to account for heterogeneity in speed data

Financial Data and the Skewed Generalized T Distribution

Extending Lotkaian informetrics

Using Bayesian networks for bankruptcy prediction: Some methodological issues

Extended Correlation

Background and threshold: critical comparison of methods of determination

Quantification of Variability and Uncertainty for Air Toxic Emission Inventories with Censored Emission Factor Data

An out-of-equilibrium model of the distributions of wealth

MODELING FINANCIAL SERIES DISTRIBUTIONS: A VERSATILE DATA FITTING APPROACH

Parametric and non-parametric statistical analysis of DT-MRI data

An application of exploratory data analysis (EDA) as a robust non-parametric technique for geochemical mapping in a semi-arid climate

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Empirical Distribution Of Data Research Articles

Related Topics

Articles published on Empirical Distribution Of Data

Differential analysis of Operating System indicators for anomaly detection in dependable systems: An experimental study

Statistical analysis of bistatic and monostatic sea clutter

Evolving Scale-Free Networks by Poisson Process: Modeling and Degree Distribution.

Time-Efficient Algorithms for Robust Estimators of Location, Scale, Symmetry, and Tail heaviness

Confidence interval of nonlinear regression of restoration time of network terminal devices

A Nash equilibrium simulation model for the competitiveness evaluation of the auction based day ahead electricity market

On the approximation of empirical data for service system simulations

Development of Method for Three-Point Data Estimation and SVR-QSAR Model to Screen Anti Cancer Leads

Optimization of the number of components in the mixed model using multi-criteria decision-making

Bayesian mixture modeling approach to account for heterogeneity in speed data

Financial Data and the Skewed Generalized T Distribution

Extending Lotkaian informetrics

Using Bayesian networks for bankruptcy prediction: Some methodological issues

Extended Correlation

Background and threshold: critical comparison of methods of determination

Quantification of Variability and Uncertainty for Air Toxic Emission Inventories with Censored Emission Factor Data

An out-of-equilibrium model of the distributions of wealth

MODELING FINANCIAL SERIES DISTRIBUTIONS: A VERSATILE DATA FITTING APPROACH

Parametric and non-parametric statistical analysis of DT-MRI data

An application of exploratory data analysis (EDA) as a robust non-parametric technique for geochemical mapping in a semi-arid climate