Abstract

The advent of big data has aided understanding of the driving forces of human mobility, which is beneficial for many fields, such as mobility prediction, urban planning, and traffic management. However, the data sources used in many studies, such as mobile phone location and geo-tagged social media data, are sparsely sampled in the temporal scale. An individual’s records can be distributed over a few hours a day, or a week, or over just a few hours a month. Thus, the representativeness of sparse mobile phone location data in characterizing human mobility requires analysis before using data to derive human mobility patterns. This paper investigates this important issue through an approach that uses subscriber mobile phone location data collected by a major carrier in Shenzhen, China. A dataset of over 5 million mobile phone subscribers that covers 24 h a day is used as a benchmark to test the representativeness of mobile phone location data on human mobility indicators, such as total travel distance, movement entropy, and radius of gyration. This study divides this dataset by hour, using 2- to 23-h segments to evaluate the representativeness due to the availability of mobile phone location data. The results show that different numbers of hourly segments affect estimations of human mobility indicators and can cause overestimations or underestimations from the individual perspective. On average, the total travel distance and movement entropy tend to be underestimated. The underestimation coefficient results for estimation of total travel distance are approximately linear, declining as the number of time segments increases, and the underestimation coefficient results for estimating movement entropy decline logarithmically as the time segments increase, whereas the radius of gyration tends to be more ambiguous due to the loss of isolated locations. This paper suggests that researchers should carefully interpret results derived from this type of sparse data in the era of big data.

Highlights

  • Understanding human mobility is of crucial importance [1,2], with potential benefits for various fields such as mobility prediction [3,4], urban planning [5,6,7], transportation research [8,9], and humanISPRS Int

  • The data of 5.8 million subscribers were included in this research, it could be used to investigate the effects of different time segments in characterizing human mobility patterns

  • We investigated the representativeness of sparse mobile phone location data in characterizing mobility indicators, which are used for measuring the range of activity space, the travel distance, and the heterogeneity of visitation patterns within activity space

Read more

Summary

Introduction

Understanding human mobility is of crucial importance [1,2], with potential benefits for various fields such as mobility prediction [3,4], urban planning [5,6,7], transportation research [8,9], and human. We qquantitatively analyze the representativeness of mobile phone location data on estimations of individual human mobility patterns. CDRs usually capture individual footprints during phone communicatioonn,, wheerreeaass the actively tracked mobile phone location data contains phone communicatioonn records and location records triggered by location update strategies such as periodic and regular updates and cellular handover. This paper is organized as follows: in the second section, we provide a review of studies related to this research. The last section summarizes our findings and discusses future research directions

Mobile Phone Location Data for Human Mobility Research
Representative Issues of Big Data
Study Area
EExxttrraaccttiinngg VVaalliidd SSuubbsscribbeers
Evaluating the Aggregated Underestimation Coefficient
Individual Perspective
Average Perspective
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call