Abstract

Due to digitalization in today's modern era log file analysis has become a necessary task to track user behavior and get important knowledge based on it. Log files are generated at massive rate and to analyze them is tedious task. In order to analyze large dataset user need effective solution for integrating the data and also parallel processing for that spark's machine learning is used which will give power to run machine learning algorithm in distributed environment on large volume dataset (Big Data). Keywords— Analytical Engine, Hadoop, HDFS, Machine learning, Spark,

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call