Abstract

Joint entity recognition and relation extraction are complex in natural language processing. It is essential in information extraction and can be applied to knowledge graph construction question-answering systems. Existing problems in the agricultural text processing field include low text utilization, harrowing entity recognition relationship extraction, and low accuracy. To improve the utilization rate of the agricultural text and implement joint entity recognition and relation extraction of agricultural texts, this study constructs the agricultural text entity-relationship dataset AgriRE by collecting existing agricultural texts from the Internet and defining rules for corpus annotation. The AgriRE dataset sets up the primary entity and six types of relationships: alias, damaged position, genus, family, distribution area, and damaged crops. The dataset contains 177454 data samples, includes 1798 agricultural entities, and 12789 agricultural relationships. Based on the AgriRE dataset, this study proposes a joint entity recognition and relation extraction model named RoBERT-Agr based on the combination of RoBERTa, WWM and CRF algorithms. The model is used to realize the mutual entity recognition and relation extraction. The experimental results show that the method based on the RoBERT-Agr model has the highest F1 score compared with the existing advanced models. The model’s classification accuracy can reach 96.18%, and the F1 score on the AgriRE test set is 95.72% by training, verifying, and testing the model.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.