TBDClust: Time-based density clustering to enable free browsing of sites in pay-per-use mobile Internet providers

Luis Miguel Torres,Eduardo Magaña,Daniel Morató,Santiago Garcia-Jimenez,Mikel Izal

doi:10.1016/j.jnca.2017.10.007

Luis Miguel Torres, Eduardo Magaña + Show 3 more

Open Access

https://doi.org/10.1016/j.jnca.2017.10.007

Copy DOI

Abstract

The World Wide Web has evolved rapidly, incorporating new content types and becoming more dynamic. The contents from a website can be distributed between several servers, and as a consequence, web traffic has become increasingly complex. From a network traffic perspective, it can be difficult to ascertain which websites are being visited by a user, let alone which part of the user's traffic each website is responsible for. In this paper we present a method for identifying the TCP connections involved in the same full webpage download without the need of deep packet inspection. This identification is needed for example to enable free browsing of specific websites in a pay per use mobile Internet access. It could be not only for third party promoted websites but also portals to gubernamental or medical emergency websites. The proposal is based on a modification of the DBSCAN clustering algorithm to work online and over one-dimensional sorted data. In order to validate our results we use both real traffic and packet captures from a controlled environment. The proposal achieves excellent results in consistency (99%) and completeness (92%), meaning that its error margin identifying the webpage downloads is minimal.

Highlights

The web is probably the Internet application that has grown and evolved the most during the past two decades
In this paper we address this problem by presenting a method capable of identifying individual full webpage downloads by clustering related connections together in real time
After performing a thorough characterization of these captures and testing different approaches to our problem, we present a method based on the DBSCAN (Ester et al, 1996) clustering algorithm which was designed for density-based clustering in noisy databases

Summary

Introduction

The web is probably the Internet application that has grown and evolved the most during the past two decades. Services like e-mail, video streaming, on-line games or e-learning are, in many cases, provided through the web, taking advantage of the fact that web browsers are present in almost any network-enabled device and that web traffic usually faces few network restrictions. This ever-increasing popularity of the web has introduced new network requirements which have pushed for improvements in the web application protocols and the development of new techniques, like content distribution networks (CDNs) (Fortino and Mastroianni, 2009). The web has achieved a remarkable flexibility which allows it to provide a huge range of different services, but adding many layers of complexity in order to achieve it

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Network and Computer Applications	Publication Date: Oct 5, 2017
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

TBDClust: Time-based density clustering to enable free browsing of sites in pay-per-use mobile Internet providers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Network and Computer Applications

Lead the way for us

Similar Papers

Mobile Internet Access as a Human Right: A View from the European High North
Stefan Kirchner
-
Stefan KirchnerStefan Kirchner
01 Jan 2020
01 Jan 2020

Touch-Based Access to Mobile Internet
Minna Isomursu ... Mari Ervasti
International Journal of Mobile Human Computer Interaction | VOL. 1
Minna Isomursu, et. al.Minna Isomursu ... Mari Ervasti
01 Oct 2009
International Journal of Mobile Human Computer Interaction | VOL. 1

Communication behaviors and perceptions of mobile internet adopters
Torsten J Gerpott
info | VOL. 12
Torsten J GerpottTorsten J Gerpott
29 Jun 2010
info | VOL. 12

Privacy leaks in mobile phone internet access
Collin Mulliner
-
Collin MullinerCollin Mulliner
01 Oct 2010
01 Oct 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TBDClust: Time-based density clustering to enable free browsing of sites in pay-per-use mobile Internet providers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Network and Computer Applications