Abstract
Applications with mobile and sensing devices have already become ubiquitous. In most of these applications, trajectory data is continuously growing to huge volumes. Existing systems for big trajectory data organize trajectories at distributed block storage systems. Systems like DITA that use block storage (e.g., 128 MB each) are more efficient for analytical queries, but they cannot incrementally maintain the partitioned data and do not support delete operations, resulting in difficulties in trajectory analytics. In this paper, we propose an incremental trajectory partitioning framework Tinba that enables distributed block storage systems to efficiently maintain optimized partitions under incremental updates of trajectories. We employ a data flushing technique to bulk ingest trajectory data for random writing in distributed file system (DFS). We recast the incremental partitioning problem as an optimal partitioning problem and prove its NP-hardness. A cost–benefit model is proposed to address the optimal partitioning problem. Moreover, Tinba supports most of the existing similarity measures to quantify the similarity between trajectories. A heuristic is developed to instantiate the Tinba framework. Comprehensive experiments on real-world and synthetic datasets demonstrate the advancements in ingestion performance and partition quality, as opposed to other trajectory partition methods.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.