Abstract

As the information available on the World Wide Web is increasing day-by-day, access to the websites is also increasing which results in huge amount of web log data (also called web usage data). Discovery and analysis of useful information from these web logs become a practical necessity. Frequent access pattern, which is the sequence of accesses pursued by users frequently, is one of the interesting and useful knowledge in practice. Web access pattern tree (WAP-tree) mining is a frequent pattern mining technique for web log access sequences, which first stores the original web access sequence database on a prefix tree for storing non-sequential data. WAP-tree algorithm then, mines the frequent sequences from the WAP-tree by recursively reconstructing intermediate trees, starting with suffix sequences and ending with prefix sequences. In this paper, we propose a more efficient algorithm named eWAP-mine (enhanced web access pattern mining algorithm), which is based directly on the initial conditional web access sequence base (1-CWASD) of each frequent event and eliminates the need for reconstructing intermediate conditional WAP-trees.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call