Abstract

Under the background of the booming industrial Internet, data acquisition and real-time calculation of industrial equipment have become the key issues of many industrial manufacturing enterprises. Therefore, this paper studies the design and implementation of web log analysis system. In this paper, a real-time industrial equipment big data acquisition and calculation system based on SparkStreaming, HBase and Hive is constructed. On the equipment big data acquisition technology, k-means algorithm in data mining technology is used to classify and integrate the data. Based on Spark framework, the parallel calculation of K-means algorithm is realized, which solves the problem of low efficiency of industrial big data analysis. Experiments show that the development of this system can effectively improve the efficiency of collecting and analyzing big data logs of industrial equipment, and further improve the timeliness and expansibility of transmission.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call