Abstract
According to the CHAOS report from Standish Group during 1992–2017, the degree of success of projects in the development of software intensive systems (Software Intensive Systems, SIS) has changed insignificantly, remaining at the level of 50% inconsistency with the initial requirements (finance, time and functionality) for medium-sized projects. The annual financial losses in the world due to the total failures are of the order of hundreds of billion dollars. The majority of information about software projects has textual representation. Analysis of this information is vital for project status understanding, revealing problems on the early stage. Nowadays the majority of tasks in NLP field are solved by means of neural network language models. These models already have shown state-of-the-art results in classification, translation, named entity recognition, and so on. Pre-trained models are accessible in the internet, but the real life problem domain could differ from the origin domain where the network was learned. In this paper an approach to vocabulary expansion for neural network language model by means of hierarchical clustering is presented. This technique allows one to adopt pre-trained language model to a different domain.
Full Text
Topics from this Paper
Pre-trained Model
Neural Network Language Model
Development Of Software Intensive Systems
Annual Financial Losses
Neural Language
+ Show 5 more
Create a personalized feed of these topics
Get StartedTalk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Similar Papers
Jan 9, 2023
Apr 19, 2021
Apr 19, 2021
Dec 16, 2021
Jan 1, 2021
IEEE Transactions on Audio, Speech, and Language Processing
Nov 1, 2017
Jan 1, 2022
BMC Bioinformatics
May 26, 2021
Applied Soft Computing
Dec 1, 2021
Jan 1, 2020
Jan 1, 2022
Jan 1, 2020
Jan 1, 2022
May 11, 2022