Abstract

In this research, we will talk about the characterization of web information and its order and taking about the records (logs) maintained by the server. Server log documents are fundamentally the ASCII content records which contain the log record of users. The research work is a comparative analysis between web based log formats pre-fetching using two main techniques, i.e. Apriori and FP Growth so that user's navigational behavior can be extracted easily and efficiently. To filter spam with conventional strategies as dark white records (url, IP addresses, mailing information) is practically unimaginable. Use of content mining strategies to a web logs can raise proficiency of a filtration of spam.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call