A Two-Step Resume Information Extraction Algorithm

Jie Chen,Zhendong Niu,Chunxia Zhang

doi:10.1155/2018/5761287

Abstract

With the rapid growth of Internet-based recruiting, there are a great number of personal resumes among recruiting systems. To gain more attention from the recruiters, most resumes are written in diverse formats, including varying font size, font colour, and table cells. However, the diversity of format is harmful to data mining, such as resume information extraction, automatic job matching, and candidates ranking. Supervised methods and rule-based methods have been proposed to extract facts from resumes, but they strongly rely on hierarchical structure information and large amounts of labelled data, which are hard to collect in reality. In this paper, we propose a two-step resume information extraction approach. In the first step, raw text of resume is identified as different resume blocks. To achieve the goal, we design a novel feature, Writing Style, to model sentence syntax information. Besides word index and punctuation index, word lexical attribute and prediction results of classifiers are included in Writing Style. In the second step, multiple classifiers are employed to identify different attributes of fact information in resumes. Experimental results on a real-world dataset show that the algorithm is feasible and effective.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Problems in Engineering	Publication Date: Jan 1, 2018
Citations: 29	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Two-Step Resume Information Extraction Algorithm

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

Classification of Hyperspectral Images by SVM Using a Composite Kernel by Employing Spectral, Spatial and Hierarchical Structure Information
Yi Wang ... Hexiang Duan
Remote Sensing | VOL. 10
Yi Wang, et. al.Yi Wang ... Hexiang Duan
12 Mar 2018
Remote Sensing | VOL. 10

A Tree Regularized Classifier—Exploiting Hierarchical Structure Information in Feature Vector for Human Action Recognition
...
KSII Transactions on Internet and Information Systems | VOL. 11
, et. al. ...
31 Mar 2017
KSII Transactions on Internet and Information Systems | VOL. 11

Neighboring Envelope Embedded Stacked Autoencoder for Deep Learning on Hierarchically Structured Samples
Chuanyan Zhou ... Fan Li
IEEE Transactions on Artificial Intelligence | VOL. 5
Chuanyan Zhou, et. al.Chuanyan Zhou ... Fan Li
01 Feb 2024
IEEE Transactions on Artificial Intelligence | VOL. 5

Image quality assessment based on structural saliency
Ziran Zhang ... Jianhua Zhang
-
Ziran Zhang, et. al.Ziran Zhang ... Jianhua Zhang
01 Aug 2014
01 Aug 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Two-Step Resume Information Extraction Algorithm

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering