Abstract

The aim of this study is to improve web log mining and online navigation pattern prediction. Web mining is an active and wide area which incorporates several usages for the web site design, providing personalization server and other business making decisions etc. Efficient web log mining results and online navigational pattern prediction is a tough process due to vast development in web. It includes the process such as data cleaning, session identification and clustering of web logs generally. In this study initially the web log data is preprocessed and sessions are identified using refined time-out based heuristic for session identification. Then for pattern discovery a density based clustering algorithm is used. Finally for online navigation pattern prediction a new technique of SVM classification is used, which rectifies time complexity with increased prediction accuracy.

Highlights

  • World Wide Web is a huge repository of web pages and links

  • Web data mining is the application of data mining techniques in web data

  • Web Usage Mining applies mining techniques in log data to extract the behavior of users which is used in various applications like personalized services, adaptive web sites, customer profiling, prefetching, creating attractive web sites etc

Read more

Summary

Introduction

World Wide Web is a huge repository of web pages and links It provides abundance of information for the Internet users. Web data mining is the application of data mining techniques in web data It automatically discovers and extracts information from Web documents and services (Etzioni, 1996). Web Usage Mining applies mining techniques in log data to extract the behavior of users which is used in various applications like personalized services, adaptive web sites, customer profiling, prefetching, creating attractive web sites etc. It consists of main three categories, Web usage mining, Web structure mining and Web content mining. Web content mining aims to extract/mine useful information or knowledge from Web page contents

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call