Assessment of Information Extraction Techniques, Models and Systems

Atta-Ur Rahman,Mehwash Farooqui,Mohammed Gollapalli,Dakheel Almoqbil,Mohammad Aftab Alam Khan,Dhiaa Musleh,Majed Nabil,Mohammed Imran Basheer Ahmed,Mohammed Salih Ahmed,Haya Alubaidan,Maqsood Mahmud,Gomathi Krishnasamy

doi:10.18280/mmep.090315

Atta-Ur Rahman, Mehwash Farooqui + Show 10 more

Open Access

PDF Available

https://doi.org/10.18280/mmep.090315

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

The present article aims to review and evaluate the practiced and classical techniques, tools, models, and systems concerning automatic information extraction (IE) from published scientific documents like research articles, patents, theses, technical reports, and case studies etc. IE is performed for various reasons such as better indexing, archiving, searching, and retrieving. That is mainly used by the search engines and the indexing services as well the digital libraries and semantic web. In this regard, several studies have been conducted targeting various nature of documents. The study pays special consideration to the successful IE models, algorithms and approaches applied to structural IE from published documents. To grasp this, the paper is classified into several segments and each segment covers a significant aspect of IE. Furthermore, to validate their benefits and drawbacks, a comparative study of all the approaches have been conducted in terms of various performance factors like precision, accuracy, recall and F-score. Potential areas of improvement have been emphasized as research gap for the scholars in the closely related areas. Ultimately, a comprehensive summary of the evaluation is presented in tabular form and review is concluded. It was observed that the hybrid methods outperform the other methods due to their versatile nature to address various document formats.

Full Text