Abstract

In the present paper, a framework for the continued supporting a systematic literature review (SLR) is proposed, which includes the application of text mining methods in order to automate the classification of scientific publications and the more in-depth analysis of their content. For this purpose, a dataset is created from the titles, abstracts and keywords of papers, included in a systematic literature review on the application of semantic technologies in bibliographic databases. Data analytics methods are applied - frequency analysis of words and word combinations; linear regression for trend exploration; text classification, where the categories are the applied semantic technologies or the researched problems in accordance with a pre-defined classification framework. The vector space model enriched with PMI (pointwise mutual information) measure is used for the classification. An assessment of the text classification performance in terms of various measures is made and the obtained results are summarized.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call