Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms

Helmut Krcmar,Matthias Pfaff

doi:10.5220/0005462303600366

Abstract

In the domain of IT benchmarking collected data are often stored in natural language text and therefore intrinsically unstructured. To ease data analysis and data evaluations across different types of IT benchmarking approaches a semantic representation of this information is crucial. Thus, the identification of conceptual (semantical) similarities is the first step in the development of an integrative data management in this domain. As an ontology is a specification of such a conceptualization an association of terms, relations between terms and related instances must be developed. Building on previous research we present an approach for an automated term extraction by the use of natural language processing (NLP) techniques. Terms are automatically extracted out of existing IT benchmarking documents leading to a domain specific dictionary. These extracted terms are representative for each document and describe the purpose and content of each file and server as a basis for the ontology development process in the domain of IT benchmarking.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2015
Citations: 6	License type: cc-by-nc-nd

Similar Papers

From Natural Language Text to Visual Models: A survey of Issues and Approaches
Cristina-Claudia OSMAN ... Paula-Georgiana ZALHAN
Informatica Economica | VOL. 20
Cristina-Claudia OSMAN, et. al.Cristina-Claudia OSMAN ... Paula-Georgiana ZALHAN
30 Dec 2016
Informatica Economica | VOL. 20

English
...
-
, et. al. ...
01 Jan 2014
01 Jan 2014

Language Learning Research at the Intersection of Experimental, Computational, and Corpus‐Based Approaches
Patrick Rebuschat ... Detmar Meurers
Language Learning | VOL. 67
Patrick Rebuschat, et. al.Patrick Rebuschat ... Detmar Meurers
01 Jun 2017
Language Learning | VOL. 67

Increasing the accessibility of NLP techniques for Defence and Security using a web-based tool

-

19 Nov 2019
19 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms

Abstract

Talk to us

Similar Papers