Abstract

IP-based geolocation is essential for various location-aware Internet applications, such as online advertisement, content delivery, and online fraud prevention. Achieving accurate geolocation enormously relies on the number of high-quality (i.e., the fine-grained and stable over time) landmarks. However, the previous efforts of garnering landmarks have been impeded by the limited visible landmarks on the Internet and manual time cost. In this paper, we leverage the availability of numerous online webcams used to monitor physical surroundings as a rich source of promising high-quality landmarks for serving IP-based geolocation. In particular, we present a new framework called GeoCAM, which is designed to automatically generate qualified landmarks from online webcams, providing an IP-based geolocation service with high accuracy and wide coverage. GeoCAM periodically monitors websites hosting live webcams and uses the natural language processing technique to extract the IP addresses and latitude/longitude of webcams for generating landmarks at a large-scale. Given latency and topology constraints among webcam landmarks, GeoCAM uses the maximum likelihood estimation to approximately pinpoint the geolocation of a target host. We develop a prototype of GeoCAM and conduct real-world experiments for validating its efficacy. Our results show that GeoCam can detect 282,902 live webcams hosted in webpages with 94.2% precision and 90.4% recall, and then generate 16,863 stable and fine-grained landmarks, which are two orders of magnitude more than the landmarks used in prior works. To demonstrate the superiority of using large-scale webcams as landmarks, we implement four different geolocation algorithms and compare their performance between webcam landmarks and open-source landmarks. The evaluation results show that all the algorithms can significantly improve geolocation accuracy by using webcam landmarks.

Highlights

  • I P-BASED geolocation is used to determine the real-world geographic location of an Internet-connected host [1], which is valuable for many Internet applications, including targeted advertising, content delivery, and online fraud detection

  • We proposed a framework GeoCAM to automatically generate webcam landmarks and provide IP-based geolocation services

  • We demonstrated the viability of different geolocation algorithms, including Constraint-Based Geolocation (CBG), Octant, Spotter, and our hybrid algorithms

Read more

Summary

Introduction

I P-BASED geolocation is used to determine the real-world geographic location of an Internet-connected host [1], which is valuable for many Internet applications, including targeted advertising, content delivery, and online fraud detection. Providing an accurate geolocation service is heavily dependent on the number of high-quality landmarks. Geolocation services would provide high accurate mappings between IP addresses and geographical locations. Efforts to gather such mapping information are impeded by the limitations of available landmarks on the Internet. Those landmarks are community-based [3]–[5], leading to a limited scale in terms of their number and coverage. PlanetLab [5] only includes 420 available landmarks, and most of them are located in Europe and North America. Prior works proposed to increase the number of landmarks by mining web services [6], [7]. Online webcams have great potential to be used as promising landmarks but have not yet been explored

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.