Abstract

Cellular signaling data (CSD) is a new data source of intelligent transportation and has great potential in the study of human mobility. However, research recently has shown that the privacy of individual trajectories of travelers can be easily leaked due to their unique travel patterns, which will lead to the disclosure of a traveler’s trajectory privacy. To address this issue, the risk of privacy leakage in a CSD data set was quantitatively measured due to reidentification attacks. To reduce the risk of trajectory privacy leakage, a spatial generalization method based on an improved genetic algorithm (IGA) was proposed. We transformed the base station aggregation problem that meets the k-anonymity requirement into a shortest path problem. The data utility loss after privacy preservation was evaluated based on origin–destination (OD) flow analysis of traffic analysis zones (TAZs). Finally, the relationships between privacy preservation and data utility were revealed. In all of the cases, more privacy protection corresponded to less data utility. The discovered relationships provide useful guidance for data publishers on how to choose the right tradeoff between privacy protection and data utility when publishing or sharing such sensitive data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call