Abstract

Big data platform for equipment condition assessment is built for comprehensive analysis. The platform has various application demands. According to its response time, its application can be divided into offline, interactive and real-time types. For real-time application, its data processing efficiency is important. In general, data cache is one of the most efficient ways to improve query time. However, big data caching is different from the traditional data caching. In the paper we propose a distributed cache management framework of big data for equipment condition assessment. It consists of three parts: cache structure, cache replacement algorithm and cache placement algorithm. Cache structure is the basis of the latter two algorithms. Based on the framework and algorithms, we make full use of the characteristics of just accessing some valuable data during a period of time, and put relevant data on the neighborhood nodes, which largely reduce network transmission cost. We also validate the performance of our proposed approaches through extensive experiments. It demonstrates that the proposed cache replacement algorithm and cache management framework has higher hit rate or lower query time than LRU algorithm and round-robin algorithm.

Highlights

  • With the rapid development of power grid and the explosion of transmission and transformation equipment, massive and heterogeneous data are generated and separately collected into different business information systems during grid operation and equipment monitoring

  • We introduce the three part of proposed distributed cache management framework: cache structure, cache replacement algorithm and cache placement algorithm

  • In the section we propose a Cache Replacement algorithm based on periodic Value (CRV)

Read more

Summary

Introduction

With the rapid development of power grid and the explosion of transmission and transformation equipment, massive and heterogeneous data are generated and separately collected into different business information systems during grid operation and equipment monitoring. In order to make full use of various data from different systems, we build a big data platform for equipment condition assessment. It connects production management system (PMS), energy management system (EMS), condition monitoring system of transmission and transformation equipment, geographic information system (GIS) and meteorlogical information system, and is used to research dynamic evaluation of load capacity, fault diagnosis, condition evaluation and risk assessment [1]. We need to optimize big data processing performance to improve data analyzing efficiency. We could put important and frequently accessed data into cache. If data A and B are stored on one node, network transmission is avoided and processing efficiency is high. After we get the cached data, a method of placing them on proper nodes needs designing

Related work
Cache management framework of big data
Cache structure
Cache replacement algorithm
Cache placement algorithm
Performance evaluation
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call