IntelliExtract: An End-to-End Framework for Chinese Resume Information Extraction from Document Images

Yijing Liu

doi:10.56028/aetr.6.1.570.2023

Abstract

Traditional document processing can be labor-intensive and time-consuming to manually extract and organize the information in a document. This manual process is often inefficient and error-prone. In order to improve processing efficiency and accuracy of document data, we develop IntelliExtract, an end-to-end framework designed for document information extraction. This is a comprehensive framework that includes image text detection and recognition, information extraction, and document intelligent question-answering. Some recent models and algorithms are employed, OCR models for converting scanned documents into machine readable text, layout analysis algorithms for understanding the spatial arrangement of document elements, and information extraction techniques for extracting structured data from unstructured documents. To evaluate the effectiveness of the framework, we conducted experiments by employing a Chinese Talent Resumes Dataset for visualizing the results. For named entity extraction, the confidence level of the extracted results from the text in the images is generally above 0.95. The proposed framework provides a powerful tool for enterprises, educational institutions, and other entities in processing document information, and holds promise for significant practical applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IntelliExtract: An End-to-End Framework for Chinese Resume Information Extraction from Document Images

Abstract

Talk to us

Similar Papers

More From: Advances in Engineering Technology Research

Lead the way for us

Similar Papers

Information extraction framework for Kurunthogai
C N Subalalitha
Sādhanā | VOL. 44
C N SubalalithaC N Subalalitha
05 Jun 2019
Sādhanā | VOL. 44

Ontology-based Sequence Labelling for Automated Information Extraction for Supporting Bridge Data Analytics
Kaijian Liu ... Nora El-Gohary
Procedia Engineering | VOL. 145
Kaijian Liu, et. al.Kaijian Liu ... Nora El-Gohary
01 Jan 2015
Procedia Engineering | VOL. 145

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Jiapeng Wang ... Lianwen Jin
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Jiapeng Wang, et. al.Jiapeng Wang ... Lianwen Jin
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

QA4IE: A Question Answering Based Framework for Information Extraction
Lin Qiu ... Lihua Qian
-
Lin Qiu, et. al.Lin Qiu ... Lihua Qian
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IntelliExtract: An End-to-End Framework for Chinese Resume Information Extraction from Document Images

Abstract

Talk to us

Similar Papers

More From: Advances in Engineering Technology Research