Impact of data gaps on the accuracy of annual and monthly average daily bicycle volume calculation at permanent count stations

Mohamed El Esawey

doi:10.1016/j.compenvurbsys.2018.03.002

Abstract

This research explores the impact of missing rate of cycling count data on the accuracy of monthly and annual average daily bicycle volume estimates (MADB and AADB). The study made use of a full year of daily bicycle counts at six count stations in Vancouver, Canada. Two missing data patterns were simulated in this study: Completely at Random (MCR) and Not Missing at Random (NMR), also known as the systematic pattern. In the first pattern, repeated random samples of daily bicycle count of different missing rates were drawn from the full data set and used to calculate MADBs and AADB at each count station. In the second pattern, long period data gaps were assumed for periods of one week to four months and MADTs and AADBs were calculated. The estimates calculated from incomplete data were compared to the actual estimates and the errors for each scenario were determined. The results showed that the impact of missing counts on the estimation accuracy of the AADB is minimal where the errors did not exceed 5%, even for high missing rates. This is conditional on that the data is missing randomly and there are a few samples that cover each month of the year. On the other hand, the estimation errors of MADBs were found to be relatively high when the missing rates were high. These results indicated that even if half of the permanent counter data is missing at random, the maximum estimation error would not exceed 14%. The combined impact of AADB and MADB estimation was captured by comparing the MFs calculated using full data versus those calculated by incomplete data. The results showed maximum errors of 94% and 34% for missing rates of 90% and 70%. For the scenario of long period data gaps, the maximum estimation error of AADB ranged between 1.5% and 21.1% when data was missing for one week to four months. Disaggregate error analysis showed that missing data of July would have the most negative impact on the estimation accuracy of AADB. Finally, a Multiple Imputation (MI) method was applied to fill in data gaps for high missing rates. The method led to a maximum AADB estimation error of <3% even if four months of data were continuously missing at one count station.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Impact of data gaps on the accuracy of annual and monthly average daily bicycle volume calculation at permanent count stations

Abstract

Talk to us

Similar Papers

More From: Computers, Environment and Urban Systems

Lead the way for us

Journal: Computers, Environment and Urban Systems	Publication Date: Mar 15, 2018
Citations: 5

Similar Papers

What is missing from my missing data plan?
Sharon D Yeatts ... Renée H Martin
Stroke | VOL. 46
Sharon D Yeatts, et. al.Sharon D Yeatts ... Renée H Martin
07 May 2015
Stroke | VOL. 46

Methods for handling missing binary data in substance use disorder trials
Boyu Ren ... Garrett M Fitzmaurice
Drug and alcohol dependence | VOL. 250
Boyu Ren, et. al.Boyu Ren ... Garrett M Fitzmaurice
13 Jul 2023
Drug and alcohol dependence | VOL. 250

Bridging gaps in demographic analysis with phylogenetic imputation.
Tamora D James ... Dylan Z Childs
Conservation Biology | VOL. 35
Tamora D James, et. al.Tamora D James ... Dylan Z Childs
21 Jan 2021
Conservation Biology | VOL. 35

How to deal with missing longitudinal data in cost of illness analysis in Alzheimer's disease-suggestions from the GERAS observational study.
Mark Belger ... Giuseppe Bruno
BMC medical research methodology | VOL. 16
Mark Belger, et. al.Mark Belger ... Giuseppe Bruno
18 Jul 2016
BMC medical research methodology | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Impact of data gaps on the accuracy of annual and monthly average daily bicycle volume calculation at permanent count stations

Abstract

Talk to us

Similar Papers

More From: Computers, Environment and Urban Systems