Abstract

Abstract. The term Big Data has been recently used to define big, highly varied, complex data sets, which are created and updated at a high speed and require faster processing, namely, a reduced time to filter and analyse relevant data. These data is also increasingly becoming Open Data (data that can be freely distributed) made public by the government, agencies, private enterprises and among others. There are at least two issues that can obstruct the availability and use of Open Big Datasets: Firstly, the gathering and geoprocessing of these datasets are very computationally intensive; hence, it is necessary to integrate high-performance solutions, preferably internet based, to achieve the goals. Secondly, the problems of heterogeneity and inconsistency in geospatial data are well known and affect the data integration process, but is particularly problematic for Big Geo Data. Therefore, Big Geo Data integration will be one of the most challenging issues to solve. With these applications, we demonstrate that is possible to provide processed Big Geo Data to common users, using open geospatial standards and technologies. NoSQL databases like MongoDB and frameworks like RASDAMAN could offer different functionalities that facilitate working with larger volumes and more heterogeneous geospatial data sources.

Highlights

  • The term Big Data has been recently used to define big, highly varied, complex data sets, which are created and updated at a high speed and require faster processing, namely, a reduced time to filter and analyse relevant data

  • According to the 2013 IBM Annual Report (IBM, 2013), 2.5 billion gigabytes of data are created every day, and 80 percent of these data is everything from images, video, and audio, to social media, telecommunications data, and distributed devices, which are geo-referenced or can be geo-referenced

  • For the 2014 edition, they provided data for two Italian areas: the city of Milan and the Province of Trentino (Barlacchi et al, 2015). These data are available to the public under the Open Database License (ODbL)

Read more

Summary

INTRODUCTION

The term Big Data has been recently used to define big, highly varied, complex data sets, which are created and updated at a high speed and require faster processing, namely, a reduced time to filter and analyse relevant data. Big Geo Data integration will be one of the most challenging issues to solve This contribution presents two web applications of Big Geo Data management, which attempt to address these issues, using a freely available large telecommunications dataset from Telecom Italia. Both applications were implemented as demos, to test the technologies, : To create a data filtering or/and processing system that exchanges different data formats into one format. Sections three and four describes the Social Media Data Management with RASDAMAN and the Sensing the City, Calls, and Tweets applications respectively, their technical The last section concludes pointing to further future work

THE TELECOM OPEN DATA
SOCIAL MEDIA DATA MANAGEMENT WITH RASDAMAN
Technical Implementation
Client-side design
Client- side design
14 The importance value is assign by Twitter
Findings
CONCLUSIONS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call