Abstract

Processing natural language and extract relevant information in deep technical engineering domain remains an open challenge. 
 On the other side, manufacturers of high-value assets which often deliver product services through the equipment life, supporting maintenance, spare parts management and remote monitoring and diagnostics for issues resolution, have availability of a good amount of textual data containing technical cases with a certain engineering depth.
 This paper presents a case study in which various Artificial Intelligence algorithms were applied to historical technical cases to extract know-how useful to help technicians in approaching new cases.
 Initially the work process and available data are presented; the focus is on the outbound communication delivered from the technical team to the site operators, that is structured in 3 main paragraphs: event description, technical assessment, recommended actions. 
 The work proceeded in two parallel streams: the first concerned the analysis of event descriptions and technical assessments, aiming to detect recurring topics; the second concerned the analysis of recommended actions that technical support delivered trough years to site operators in order to create a library, which can help for enabling statistical data analysis, quality check review and being the starting point for further AI/NLP developments.
 A text preprocessing was applied to both streams, consisted in defining standard and domain entities / stopwords and identifying / removing them, creating acronyms and synonyms maps in order to make context disambiguation, sentence splitting for the recommended actions, and finally text lemmatization. For every text the output of the preprocess was a series of keywords. 
 Then, unsupervised learning algorithms were applied. For this purpose, firstly, we applied feature extraction, bag of words (TF-IDF) and word embeddings (W2V, D2V, BERT), in order to transform our data from language domain into points in a n-features domain. Afterwards, different combinations of unsupervised algorithms were applied to split data into homogeneous groups, such as: LDA, K-means, Spectral, Affinity Propagation and HDBSCAN. 
 The combinations between language modeling and clustering were evaluated using the Silhouette score and visual analysis. 
 To validate the effectiveness, the developed NLP algorithms have been implemented into the current SW application used by technical support to perform the service. Moreover, a dedicated app to show trending topics and retrieve insightful information has been developed. 
 An outlook of the open technical challenges and on the future perspective of NLP applications in the work process are finally delivered.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.