Abstract

The selection of a suitable job applicant from the pool of thousands applications is often daunting job for an employer. The categorization of job applications submitted in form of Resumes against available vacancy(s) takes significant time and efforts of an employer. Thus, Resume Classification System (RCS) using the Natural Language Processing (NLP) and Machine Learning (ML) techniques could automate this tedious process. Moreover, the automation of this process can significantly expedite and transparent the applicants’ screening process with mere human involvement. This experimental study presents an automated NLP and ML-based RCS that classifies the Resumes according to job categories with performance guarantees. This study employs various ML algorithms and NLP techniques to measure the accuracy of RCS and proposes a solution with better accuracy and reliability in different settings. To demonstrate the significance of NLP and ML techniques for RCS, the extracted features were evaluated on nine ML classification models namely Support Vector Machine - SVM (Linear, SGD, SVC and NuSVC), Naïve Bayes (Bernoulli, Multinomial & Gaussian), K-Nearest Neighbor (KNN), and Logistic Regression (LR). The Term-Frequency-Inverse-Document-Frequency (TF-IDF) feature representation scheme was proved suitable for RCS. The developed models were evaluated using the Confusion Matrix, F-Score, Recall, Precision, and overall Accuracy. The experimental results indicate that using the One-Vs-Rest-Classification strategy for this multi-class Resume classification task, the SVM class of Machine Learning classifiers performed better on the study dataset of over nine hundred sixty plus parsed resumes with more than 96% accuracy. The promising results suggest that NLP and ML techniques employed in this study could be used for developing an efficient RCS.

Highlights

  • Internet-based recruiting systems have been rapidly adopted by recruiters in recent years

  • The results show that most of the algorithms produced excellent results on study data. This can be comprehended as the dataset size was optimal and best Natural Language Processing (NLP) and Machine Learning (ML) techniques were employed to achieve significantly better results

  • It is shown that LSVC, Stochastic Gradient Descent (SGD), Logistic Regression (LR), and Support Vector Classifier (SVC) produced exceptionally well results

Read more

Summary

Introduction

Internet-based recruiting systems have been rapidly adopted by recruiters in recent years. The ever increasing unemployment rate in developing countries like Pakistan results in considerable amount of job applications for a vacant position [4]. Recruiters need to screen through a large amount of data to select the most suitable application from the pool. It significantly increases the workload of the concerned department of Recruiter [5]. This process involves the engagement of considerable Human Resources and requires rigorous efforts and resources to finalize the most suitable applicant for further

Objectives
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.