In order to improve the analysis and processing speed of power system fault text in actual business scenarios, a power fault text information automatic extraction model based on pre-training and multi-task learning is proposed. The pre-training model is used to learn the context information of power text words, and the first-order and second-order fusion features of words are mined to enhance the representation ability of features. The multi-task learning framework is used to combine the learning of named entity recognition and relationship extraction tasks to achieve mutual complementation and mutual promotion between entity recognition and relationship extraction, thereby improving the performance of power fault text information extraction. Finally, the model is verified by the daily business data of a power grid data center. Compared with other models, the accuracy and recall rate of power fault text entity recognition and relationship extraction are improved.
Read full abstract