Abstract

There are mass data that contain important defect texts in the power grid enterprise, and they contain important reliability information. And the efficiency is very low to mine the exact information about the texts especially when the texts are in Chinese. Thus, the defect text mining technique based on the modified semantic framework is proposed. All texts are translated into English and use the text mining model based on the modified semantic framework, the defect texts are divided into a fixed pattern and the digital information can be extracted accurately. Take the transformer as an example, the first step is to establish the ontology dictionary and to separate the sentence and extract the texts’ features. Then, the modified power semantic framework and the semantic slots are defined, and the slots filling method and the semantic framework construction process are discussed, which can automatically perfect the ontology dictionary by merging the word series. Finally, the researches of defect text mining results of statistical reliability are studied, and the results show that the proposed model and method is feasible and effective when applied to automatic classification and statistics of grid defect.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call