ResuméAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models

Ahmed Heakl,Youssef Mohamed,Noran Mohamed,Aly Elsharkawy,Ahmed Zaky

doi:10.1016/j.procs.2024.10.189

Abstract

The increasing reliance on online recruitment platforms coupled with the adoption of AI technologies has highlighted the critical need for efficient resume classification methods. However, challenges such as small datasets, lack of standardized resume templates, and privacy concerns hinder the accuracy and effectiveness of existing classification models. In this work, we address these challenges by presenting a comprehensive approach to resume classification. We curated a large-scale dataset of 13,389 resumes from diverse sources and employed Large Language Models (LLMs) such as BERT and Gemma1.1 2B for classification. Our results demonstrate significant improvements over traditional machine learning approaches, with our best model achieving a top-1 accuracy of 92% and a top-5 accuracy of 97.5%. These findings underscore the importance of dataset quality and advanced model architectures in enhancing the accuracy and robustness of resume classification systems, thus advancing the field of online recruitment practices. Our models, code, and dataset are available as open-source resources.123

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ResuméAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

Sentiment Analysis of Lithuanian Texts Using Traditional and Deep Learning Approaches
Jurgita Kapočiūtė-Dzikienė ... Robertas Damaševičius
Computers | VOL. 8
Jurgita Kapočiūtė-Dzikienė, et. al.Jurgita Kapočiūtė-Dzikienė ... Robertas Damaševičius
01 Jan 2019
Computers | VOL. 8

A High-Accuracy Model Average Ensemble of Convolutional Neural Networks for Classification of Cloud Image Patches on Small Datasets
Van Hiep Phung ... Eun Joo Rhee
Applied Sciences | VOL. 9
Van Hiep Phung, et. al.Van Hiep Phung ... Eun Joo Rhee
23 Oct 2019
Applied Sciences | VOL. 9

Multi-class multi-level classification algorithm for skin lesions classification using machine learning techniques
Nazia Hameed ... M.A Hossain
Expert Systems with Applications | VOL. 141
Nazia Hameed, et. al.Nazia Hameed ... M.A Hossain
18 Sep 2019
Expert Systems with Applications | VOL. 141

Foundation models in ophthalmology: opportunities and challenges.
Mertcan Sevgi ... Pearse A Keane
Current opinion in ophthalmology | VOL. -
Mertcan Sevgi, et. al.Mertcan Sevgi ... Pearse A Keane
26 Sep 2024
Current opinion in ophthalmology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ResuméAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science