“FIJO”: a French Insurance Soft Skill Detection Dataset

David Beauchemin,Marouane Yassine,Yvan Le Ster,Julien Laumonier

doi:10.21428/594757db.858dd91f

Abstract

Understanding the evolution of job requirements is becoming more important for workers, companies and public organizations to follow the fast transformation of the employment market. Fortunately, recent natural language processing (NLP) approaches allow for the development of methods to automatically extract information from job ads and recognize skills more precisely. However, these efficient approaches need a large amount of annotated data from the studied domain which is difficult to access, mainly due to intellectual property. This article proposes a new public dataset, FIJO, containing insurance job offers, including many soft skill annotations. To understand the potential of this dataset, we detail some characteristics and some limitations. Then, we present the results of skill detection algorithms using a named entity recognition approach and show that transformers-based models have good token-wise performances on this dataset. Lastly, we analyze some errors made by our best model to emphasize the difficulties that may arise when applying NLP approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

“FIJO”: a French Insurance Soft Skill Detection Dataset

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Canadian Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the Canadian Conference on Artificial Intelligence	Publication Date: May 27, 2022
License type: cc-by

Similar Papers

Portability of natural language processing methods to detect suicidality from clinical text in US and UK electronic health records.
Marika Cusick ... Jyotishman Pathak
Journal of affective disorders reports | VOL. 10
Marika Cusick, et. al.Marika Cusick ... Jyotishman Pathak
01 Dec 2022
Journal of affective disorders reports | VOL. 10

NLP Applications for Big Data Analytics Within Healthcare
Aadarsh Choudhary ... Anurag Choudhary
-
Aadarsh Choudhary, et. al.Aadarsh Choudhary ... Anurag Choudhary
01 Jan 2021
01 Jan 2021

Reprint of: Where and how to find bio-inspiration?: A comparison of search approaches for bio-inspired design
Mart Willocx ... Joost R Duflou
CIRP Journal of Manufacturing Science and Technology | VOL. 34
Mart Willocx, et. al.Mart Willocx ... Joost R Duflou
27 Jul 2021
CIRP Journal of Manufacturing Science and Technology | VOL. 34

Where and how to find bio-inspiration?: A comparison of search approaches for bio-inspired design
Mart Willocx ... Joost R Duflou
CIRP Journal of Manufacturing Science and Technology | VOL. 31
Mart Willocx, et. al.Mart Willocx ... Joost R Duflou
26 Oct 2020
CIRP Journal of Manufacturing Science and Technology | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

“FIJO”: a French Insurance Soft Skill Detection Dataset

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Canadian Conference on Artificial Intelligence