Exploring Natural Language Processing in Model-To-Model Transformations

Paulius Danenas,Tomas Skersys

doi:10.1109/access.2022.3219455

Paulius Danenas, Tomas Skersys

Open Access

https://doi.org/10.1109/access.2022.3219455

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 3	License type: CC BY 4.0

Affiliation: Kaunas University of Technology

Abstract

In this paper, we explore the possibility to apply natural language processing in visual model-to-model (M2M) transformations. Therefore, we present our research results on information extraction from text labels in process models modeled using Business Process Modeling Notation (BPMN) and use case models depicted in Unified Modeling Language (UML) using the most recent developments in natural language processing (NLP). In this paper, we focus on three relevant tasks, namely, the extraction of verb/noun phrases that would be used to form relations, parsing of conjunctive/disjunctive statements, and the detection of abbreviations and acronyms. Relation extraction was attempted to solve by implementing techniques that combine state-of-the-art NLP language models with formal regular expressions grammar-based structure detection. In this paper, we perform thorough testing of the most recent state-of-the-art NLP tools (CoreNLP, Stanford Stanza, Flair, Spacy, AllenNLP, BERT, ELECTRA), as well as custom BERT-BiLSTM-CRF and ELMo-BiLSTM-CRF implementations, trained with certain data augmentations to improve performance on the most ambiguous cases; these tools are used as a foundation for building tools to extract noun and verb phrases from short text labels generally used in UML and BPMN models. Furthermore, we describe our attempts to improve these extractors by solving the abbreviation/acronym detection problem using machine learning-based detection, as well as process conjunctive and disjunctive statements, due to their relevance to performing advanced text normalization. The obtained results show that the best phrase extraction and conjunctive phrase processing performance was obtained using Stanza based implementation, yet, our trained BERT-BiLSTM-CRF outperformed it for the verb phrase detection task. Our acronym detection approach resulted in the precision of 0.78 and F1-Score of 0.73 which may also be considered quite positive. While this work was inspired by our ongoing research on partial model-to-model transformations, we believe it to be applicable in other areas requiring similar text processing capabilities as well.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Natural Language Processing in Model-To-Model Transformations

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A business process re-engineering approach to transform business process simulation to BPMN model.
Reema Choudhary ... Nauman Riaz
PLOS ONE | VOL. 18
Reema Choudhary, et. al.Reema Choudhary ... Nauman Riaz
15 Mar 2023
PLOS ONE | VOL. 18

Research on Smart Contract Optimization Method on Blockchain
Wen Hu ... Zhipeng Fan
IT Professional | VOL. 21
Wen Hu, et. al.Wen Hu ... Zhipeng Fan
01 Sep 2019
IT Professional | VOL. 21

Business Process Models to Web Services Generation: A Systematic Literature Review
Iqra Zafar ... Muhammad Waseem Anwar
-
Iqra Zafar, et. al.Iqra Zafar ... Muhammad Waseem Anwar
01 Nov 2018
01 Nov 2018

From business process models to process-oriented software systems
Chun Ouyang ... Jan Mendling
ACM Transactions on Software Engineering and Methodology | VOL. 19
Chun Ouyang, et. al.Chun Ouyang ... Jan Mendling
01 Aug 2009
ACM Transactions on Software Engineering and Methodology | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Natural Language Processing in Model-To-Model Transformations

Abstract

Talk to us

Similar Papers

More From: IEEE Access