Abstract

Traffic accident case named entity recognition, which helps mine key information in traffic accident texts, plays a vital role in downstream tasks such as the construction of knowledge graphs in road traffic and intelligent policing. In this paper, we construct a named entity recognition model based on the EDE (Entity Data Enhancement)-ERNIE-Bidirectional Gated Recurrent Unit Network (BiGRU)-Conditional Random Field (CRF) to address the current situation of low traffic accident case data and poor recognition of long-text entities. First, the amount of accident case data is enhanced using the entity random substitution method. Next, the text data of traffic accident cases are characterized as a dynamic word vector using the ERNIE pretraining model. Then, the BiGRU network learns the long-distance dependency relationship in the text to enhance the effect of the model on long-text entity recognition. Finally, the result sequence is constrained by the CRF layer to realize the named entity recognition model. The experimental part uses data related to real traffic accident cases in a domestic area. The data enhancement method increases the data volume three times compared to the original data volume. Experimental results show that the EDE-ERNIE-BiGRU-CRF model achieves better F1 values, recall and precision achieved better performance than the entity recognition methods of BERT-BiGRU-CRF, ERNIE-BiGRU-CRF, ERNIE-BiLSTM-CRF, ERNIE-CRF, ERNIE, BiGRU-CRF, ROBERTA-wwm-ext-BiGRU-CRF and verify its effectiveness for entity recognition in traffic accident cases.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call