Preprocessing: A Prerequisite for Discovering Patterns inWeb Usage Mining Process

C Ramya

doi:10.7763/ijiee.2013.v3.297

Abstract

Web log data is usually diverse and voluminous. This data must be assembled into a consistent, integrated and comprehensive view, in order to be used for pattern discovery. Without properly cleaning, transforming and structuring the data prior to the analysis, one cannot expect to find meaningful patterns. As in most data mining applications, data preprocessing involves removing and filtering redundant and irrelevant data, removing noise, transforming and resolving any inconsistencies. In this paper, a complete preprocessing methodology having merging, data cleaning, user/session identification and data formatting and summarization activities to improve the quality of data by reducing the quantity of data has been proposed. To validate the efficiency of the proposed preprocessing methodology, several experiments are conducted and the results show that the proposed methodology reduces the size of Web access log files down to 73-82% of the initial size and offers richer logs that are structured for further stages of Web Usage Mining (WUM). So preprocessing of raw data in this WUM process is the central theme of this paper.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Preprocessing: A Prerequisite for Discovering Patterns inWeb Usage Mining Process

Abstract

Talk to us

Similar Papers

More From: International Journal of Information and Electronics Engineering

Lead the way for us

Journal: International Journal of Information and Electronics Engineering	Publication Date: Jan 1, 2013
Citations: 3

Similar Papers

A Fuzzy Set Theoretic approach to discover user sessions from web navigational data
Zahid Ansari ... Mohammad Fazle Azeemz
-
Zahid Ansari, et. al.Zahid Ansari ... Mohammad Fazle Azeemz
01 Sep 2011
01 Sep 2011

A community detection algorithm for Web Usage Mining systems
Yacine Slimani ... Abdelouahab Moussaoui
-
Yacine Slimani, et. al.Yacine Slimani ... Abdelouahab Moussaoui
01 Nov 2011
01 Nov 2011

Web Usage Mining Using Support Vector Machine
Sung-Hae Jun
-
Sung-Hae JunSung-Hae Jun
01 Jan 2004
01 Jan 2004

Review on modern Data Preprocessing techniques in Web usage mining (WUM)
P Sukumar ... S Yuvaraj
-
P Sukumar, et. al.P Sukumar ... S Yuvaraj
01 Oct 2016
01 Oct 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Preprocessing: A Prerequisite for Discovering Patterns inWeb Usage Mining Process

Abstract

Talk to us

Similar Papers

More From: International Journal of Information and Electronics Engineering