Abstract

Abstract The LibCity-Dataset represents a significant contribution to the field of urban spatial-temporal data mining. This dataset uniquely integrates macro traffic state data with micro trajectory data, providing researchers with comprehensive and diverse urban spatial-temporal data. Specifically, we begin by collecting and processing existing open-source spatial-temporal data. Subsequently, we independently collected Beijing taxi trajectory data through third-party interfaces. This data bridges the gap in the scarcity of current open-source vehicle trajectory data. The distinctive aspect of the LibCity-Dataset lies in its innovative approach of standardizing the storage format, achieved through the implementation of atomic files. By adopting this standardized format, diverse data sources are harmonized, enabling effortless application of spatial-temporal prediction models across various datasets. The uniform storage format not only simplifies experimentation but also expedites the advancement of spatial-temporal prediction research, acting as a catalyst for further innovation. This Data Note provides a comprehensive insight into the creation methodology of the LibCity-Dataset, including data collection and processing methodology, data description, data validation, and usage notes. By facilitating open-source collaboration and setting a benchmark for standardization within the spatial-temporal prediction domain, this dataset aims to foster increased research cooperation and knowledge sharing. The open-source link of our dataset is https://drive.google.com/drive/folders/1g5v2Gq1tkOq8XO0HDCZ9nOTtRpB6-gPe.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call