Software Entity Recognition Method Based on BERT Embedding

Chao Sun,Mingjing Tang,Li Liang,Wei Zou

doi:10.1007/978-3-030-62463-7_4

Abstract

AbstractThe global open source software ecosystem contains rich information in the field of software engineering. The existing analysis methods for the text content of the knowledge community in this field are mainly focus on the structural relationship and rule-based association and mining. This paper proposes a software entity recognition method based on BERT word embedding. Firstly, the BiLSTM-CRF model is constructed, and the entity recognition model is constructed by combining the word vector embedding in software engineering field. Then, the word vector in the input layer of the model is improved by introducing the BERT pre-training language model. In the process of pre-training of BERT, the pre-training data should be constructed based on the discussion content of Stack Overflow software Q & A community. Then, we use these data to pre-training the BERT model, so as to obtain the word vector representation suitable for software engineering field, improving the effect of entity recognition in software engineering field, and solving the problem that the traditional word vector embedding is mostly based on the general domain data training, which is not fully suitable for software engineering field, and can’t well represent the context semantic information. At the same time, to solve the problem that there are few annotated data in the field of software, this paper tries to extends the data appropriately by the method of model prediction and dictionary matching, and carries out experimental test. Finally, this paper uses the method of deep learning to realize the entity recognition in the field of software engineering, so as to provide support for the extraction of software entities, the construction of software knowledge base, and the intelligent application of software engineering.KeywordsEntity recognitionBERT modelStack overflow

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Software Entity Recognition Method Based on BERT Embedding

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Software Engineering Design Principles Applied to Instructional Design: What can we Learn from our Sister Discipline?
Nor Hafizah Adnan ... Albert D Ritzhaupt
TechTrends | VOL. 62
Nor Hafizah Adnan, et. al.Nor Hafizah Adnan ... Albert D Ritzhaupt
08 Dec 2017
TechTrends | VOL. 62

Will Artificial Intelligence become alternative to Software Engineers? - A Futuristic Approach
Nishant R Mahato
Revista Review Index Journal of Multidisciplinary | VOL. 2
Nishant R MahatoNishant R Mahato
30 Sep 2022
Revista Review Index Journal of Multidisciplinary | VOL. 2

Research on Extracting Named Entities in Software Engineering Field from Wiki Webpage
Jiapei Guo ... Yan Sun
-
Jiapei Guo, et. al.Jiapei Guo ... Yan Sun
01 May 2019
01 May 2019

On the value of encouraging gender tolerance and inclusiveness in software engineering communities
Elijah Zolduoarrati ... Sherlock A Licorish
Information and Software Technology | VOL. 139
Elijah Zolduoarrati, et. al.Elijah Zolduoarrati ... Sherlock A Licorish
01 Nov 2021
Information and Software Technology | VOL. 139

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Software Entity Recognition Method Based on BERT Embedding

Abstract

Talk to us

Similar Papers