Dynamic Web log session identification with statistical language models

Xiangji Huang,Aijun An,Fuchun Peng,Dale Schuurmans

doi:10.1002/asi.20084

Abstract

AbstractWe present a novel session identification method based on statistical language modeling. Unlike standard timeout methods, which use fixed time thresholds for session identification, we use an information theoretic approach that yields more robust results for identifying session boundaries. We evaluate our new approach by learning interesting association rules from the segmented session files. We then compare the performance of our approach to three standard session identification methods—the standard timeout method, the reference length method, and the maximal forward reference method—and find that our statistical language modeling approach generally yields superior results. However, as with every method, the performance of our technique varies with changing parameter settings. Therefore, we also analyze the influence of the two key factors in our language‐modeling–based approach: the choice of smoothing technique and the language model order. We find that all standard smoothing techniques, save one, perform well, and that performance is robust to language model order.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Web log session identification with statistical language models

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology

Lead the way for us

Journal: Journal of the American Society for Information Science and Technology	Publication Date: Aug 13, 2004
Citations: 92

Similar Papers

Applying language modeling to session identification from database trace logs
Xiangji Huang ... Qingsong Yao
Knowledge and Information Systems | VOL. 10
Xiangji Huang, et. al.Xiangji Huang ... Qingsong Yao
24 Mar 2006
Knowledge and Information Systems | VOL. 10

Statistical feature language model
Salma Jamoussi ... Kamel Smaili
-
Salma Jamoussi, et. al.Salma Jamoussi ... Kamel Smaili
04 Oct 2004
04 Oct 2004

Exploring the language modeling toolkits for Arabic text
Fawaz S Al-Anzi ... Dia Abuzeina
-
Fawaz S Al-Anzi, et. al.Fawaz S Al-Anzi ... Dia Abuzeina
01 Nov 2017
01 Nov 2017

Modelo Acústico y de Lenguaje del Idioma Español para el dialecto Cucuteño, Orientado al Reconocimiento Automático del Habla
Juan David Celis Nuñez ... Rodrigo Andres Llanos Castro
Ingeniería | VOL. 22
Juan David Celis Nuñez, et. al.Juan David Celis Nuñez ... Rodrigo Andres Llanos Castro
12 Sep 2017
Ingeniería | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Web log session identification with statistical language models

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology