Abstract
In the smart grid environment, huge volumes of data will be accumulated from the condition monitoring system of power equipment. Using the traditional centralized storage architecture and relational databases, the performance of data querying and processing is slow, and cannot meet real-time requirements of the power equipment condition monitoring system. Meanwhile, MapReduce is a desirable parallel programming platform that is widely applied in kinds of data process fields. In this paper, a case-study on distributed storage using HBase and parallel processing of insulator leakage current data is presented. We propose efficient MapReduce based algorithms for parallel join query, parallel characteristics extraction and analogous assessment of insulator contamination degree. We evaluate our work on real large scale datasets utilizing Hadoop platform. Results reveal that the speedup and scale-up of our work are competent.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.