Identifying legitimate Web users and bots with different traffic profiles — an Information Bottleneck approach

Grażyna Suchacka,Jacek Iwański

doi:10.1016/j.knosys.2020.105875

Grażyna Suchacka, Jacek Iwański

Open Access

https://doi.org/10.1016/j.knosys.2020.105875

Copy DOI

Journal: Knowledge-Based Systems	Publication Date: Apr 11, 2020
Citations: 21	License type: cc-by-nc-nd

Affiliation: Opole University

Abstract

Recent studies reported that about half of Web users nowadays are intelligent agents (Web bots). Many bots are impersonators operating at a very high sophistication level, trying to emulate navigational behaviors of legitimate users (humans). Moreover, bot technology continues to evolve which makes bot detection even harder. To deal with this problem, many advanced methods for differentiating bots from humans have been proposed, a large part of which relies on supervised machine learning techniques. In this paper, we propose a novel approach to identify various profiles of bots and humans which combines feature selection and unsupervised learning of HTTP-level traffic patterns to develop a user session classification model. Session clustering is performed with the agglomerative Information Bottleneck (aIB) algorithm, as well as with some other reference algorithms. The model is then used to classify new sessions to one of the profiles and to label the sessions as performed by bots or humans. An extensive experimental study, based on real server log data, demonstrates the ability of aIB clustering to distinguish user profiles and confirms high performance of the classification model in terms of accuracy, F1, recall, and precision.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identifying legitimate Web users and bots with different traffic profiles — an Information Bottleneck approach

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Similar Papers

Feature characterization in fMRI data: the Information Bottleneck approach
Bertrand Thirion ... Olivier Faugeras
Medical Image Analysis | VOL. 8
Bertrand Thirion, et. al.Bertrand Thirion ... Olivier Faugeras
13 Oct 2004
Medical Image Analysis | VOL. 8

Applying the information bottleneck to statistical relational learning
Fabrizio Riguzzi ... Nicola Di Mauro
Machine Learning | VOL. 86
Fabrizio Riguzzi, et. al.Fabrizio Riguzzi ... Nicola Di Mauro
10 May 2011
Machine Learning | VOL. 86

Some Question to Monte-Carlo Simulation in AIB Algorithm
Sanming Song ... Yinwei Zhan
-
Sanming Song, et. al.Sanming Song ... Yinwei Zhan
27 May 2008
27 May 2008

The Density Connectivity Information Bottleneck
Yongli Ren ... Gang Li
-
Yongli Ren, et. al.Yongli Ren ... Gang Li
01 Nov 2008
01 Nov 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifying legitimate Web users and bots with different traffic profiles — an Information Bottleneck approach

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems