Semantic Enrichment of Taxonomy for BI Applications using Multifaceted data sources through NLP techniques

Muhammad Arslan,Christophe Cruz

doi:10.1016/j.procs.2022.09.533

Muhammad Arslan, Christophe Cruz

Open Access

https://doi.org/10.1016/j.procs.2022.09.533

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Taxonomies are crucial for executing Business Intelligence (BI) applications by preventing users from being overwhelmed with information. The business applications require knowledge of the key concepts and their organization to perform the classification of news articles. To ensure and maintain reliable information classification quality, it is crucial to keep the same definitions and organization of these concepts. This indicates the major significance of BI taxonomies in organizations. However, their development in business information systems follows an ad hoc process in most cases. Compared to many other domains, e.g. environmental and life sciences research, no mature and updated BI taxonomies are available in the literature. Existing studies cover BI taxonomies, but these are excessively generic and domain-specific. As a result, the BI domain suffers from many immature, incorrect, and incomplete notions of concepts. New BI-related concepts emerge rapidly, making it essential to include them in existing taxonomies during the enrichment process. The contribution of our research is the exploration of the possibilities of taxonomy enrichment using existing datasets. The expansion of the existing business taxonomy using multifaceted data sources to capture new concepts comprising 1) lexical datasets, 2) pre-trained word embeddings, 3) linked open data vocabulary, and 4) corpus-based relevant thematic extraction of features from news articles using Natural Language Processing (NLP) techniques. The highest semantic enrichment rate of a taxonomy got on a combination of these 4 methods. Eventually, enriched business taxonomy will contribute to the improved classification of news articles.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Computer Science	Publication Date: Jan 1, 2022
Citations: 3	License type: cc-by-nc-nd

R Discovery Prime

Semantic Enrichment of Taxonomy for BI Applications using Multifaceted data sources through NLP techniques

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

The Comprehension of Figurative Language: What Is the Influence of Irony and Sarcasm on NLP Techniques?
Leila Weitzel ... Ronaldo Cristiano Prati
-
Leila Weitzel, et. al.Leila Weitzel ... Ronaldo Cristiano Prati
01 Jan 2015
01 Jan 2015

A COMPARATIVE STUDY OF STATISTICAL AND NATURAL LANGUAGE PROCESSING TECHNIQUES FOR SENTIMENT ANALYSIS
Wai-Howe Khong ... Hui-Ngo Goh
Jurnal Teknologi | VOL. 77
Wai-Howe Khong, et. al.Wai-Howe Khong ... Hui-Ngo Goh
26 Nov 2015
Jurnal Teknologi | VOL. 77

A comprehensive investigation of natural language processing techniques and tools to generate automated test cases
Imran Ahsan ... Wasi Haider Butt
-
Imran Ahsan, et. al.Imran Ahsan ... Wasi Haider Butt
22 Mar 2017
22 Mar 2017

Natural Language Processing Utilisation in Healthcare
S Vani ... T Tangarasan
-
S Vani, et. al.S Vani ... T Tangarasan
04 Feb 2022
04 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Semantic Enrichment of Taxonomy for BI Applications using Multifaceted data sources through NLP techniques

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science