Abstract

Clustering of the web sessions to identify the vis-itors' choices while browsing the web pages, is an important problem in web mining. The sequence of pages viewed by the user in a particular time-frame, i.e., the session, captures his/her interest in a specific topic. Clustering of these sessions is therefore needed to provide customized services to the users having similar interests. In this article, we propose a novel and accurate similarity measure, P <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">sim</sub> , between two web pages and a method of clustering the web sessions using a recently developed Fast Optimal Global Sequence Alignment Algorithm (FOGSAA). FOGSAA is an optimal global alignment algorithm which is used to align the pairs of sessions. It computes the pair-wise distances, which is used to cluster the sessions in similar groups. FOGSAA aligns the sessions in much less time and results in an average time gain of 35.84% over the conventional dynamic programming based Needleman-Wunsch's method, where both are generating the same optimal alignment. Therefore, application of FOGSAA to align the sessions makes the procedure faster and at the same time maintains the quality.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call