Abstract

In this work, we propose a semi-supervised boot-strapping approach for relation extraction in domain specific texts, specifically focusing on agricultural domain. Our approach utilizes the BERT model with dependency parsing for relation extraction. The proposed model, focuses on identifying five inter subdomain relations viz., Soil_Location, Soil_Crop, Disease_Pathogen, Pathogen_Crop, and Chemical_Crop. We created a corpus of 30,000 sentences extracted from recognised agriculture sites to evaluate the model. The labeled relations were then manually checked to evaluate the prediction accuracy. We used a test corpus with 700 sentences that included 3500 triplets for the evaluation. The proposed approach presents an average macro F -Score of 86.4 %, which is quite promising for semi-supervised domain specific relation extraction systems. Experimental results show the efficacy of the proposed approach in classifying relational phrases in a semi-supervised set-up for the agricultural domain.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.