Abstract

Today web mining is a challenging task in organization. Every organization generated vast amount of data from various source. Web mining is the process of extracting useful knowledge from web resources. Log files are maintained by the web server. The challenging task for E-commerce companies is to know their customer behavior to improve the business by analyzing web log files. E-commerce website can generate tens of peta bytes of data in their web log files. This paper discuss about the importance of log files in E-commerce world. The analysis of log files is used for learning the user behavior in E-commerce system. The analysis of such large web log files need parallel processing and reliable data storage system. The Hadoop framework provides reliable storage by Hadoop Distributed File System and parallel processing system for large database using MapReduce programming model. These mechanisms help to process log data in parallel manner and computes results efficiently. This approach reduces the response time as well as load on the end system. This work proposes apredictive prefetching system based on preprocessing of web logs using HadoopMapReduce, which will provide accurate results in minimum response time for E-commerce business activities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.