Abstract

Abstract The LibCity-Dataset represents a significant contribution to the field of urban spatial-temporal data mining. This dataset uniquely integrates macro traffic state data with micro trajectory data, providing researchers with comprehensive and diverse urban spatial-temporal data. Specifically, we begin by collecting and processing existing open-source spatial-temporal data. Subsequently, we independently collected Beijing taxi trajectory data through third-party interfaces. This data bridges the gap in the scarcity of current open-source vehicle trajectory data. The distinctive aspect of the LibCity-Dataset lies in its innovative approach of standardizing the storage format, achieved through the implementation of atomic files. By adopting this standardized format, diverse data sources are harmonized, enabling effortless application of spatial-temporal prediction models across various datasets. The uniform storage format not only simplifies experimentation but also expedites the advancement of spatial-temporal prediction research, acting as a catalyst for further innovation. This Data Note provides a comprehensive insight into the creation methodology of the LibCity-Dataset, including data collection and processing methodology, data description, data validation, and usage notes. By facilitating open-source collaboration and setting a benchmark for standardization within the spatial-temporal prediction domain, this dataset aims to foster increased research cooperation and knowledge sharing. The open-source link of our dataset is https://drive.google.com/drive/folders/1g5v2Gq1tkOq8XO0HDCZ9nOTtRpB6-gPe.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.