Abstract

The dynamic nature of the job market, with its constant growth and evolving demands, makes the assessment of skills needed for a particular job increasingly difficult, time-consuming, and tedious. This necessitates an automated approach to identify the required skills for job seekers and recruiters. This study aimed to utilize machine learning (ML) and deep learning algorithms to streamline skill identification from technology-related job descriptions. We tested the viability of the Bidirectional Encoder Representation of Transformers (BERT) language model to complete this multi-class, multi-label classification task. We hypothesized that BERT, utilizing an NLP transformer, would predict job-required skills more accurately than traditional statistical ML models. To test this, we utilized a dataset comprised of job descriptions with varying skills. We experimented with two preprocessing approaches, K-Means Clustering and Latent Dirichlet Allocation, which are both statistical ML models meant to cluster the job descriptions. We then compared the performance of the BERT tokenizer to a combination of text-to-number tokenization techniques (Term Frequency - Inverse Document Frequency and Word2Vec) and supervised classifiers (K-Nearest Neighbors, Random Forest, and Multi-Layer Perceptron). The NLP transformer-based BERT model achieved the highest accuracy. We conclude that while traditional ML models provide some insights, advanced models like BERT show potential for more accurate skill prediction since the NLP transformer model we tested outperformed the ML models. However, the complexity of real-world data necessitates further refinement and development of these technologies. Our study shows that advanced deep learning-based models have the potential to enhance the job market’s efficiency by automating skill identification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.