Abstract

The occupational profiling system driven by the traditional survey method has some shortcomings such as lag in updating, time consumption and laborious revision. It is necessary to refine and improve the traditional occupational portrait system through dynamic occupational information. Under the circumstances of big data, this paper showed the feasibility of vocational portraits driven by job advertisements with data analysis and processing engineering technicians (DAPET) as an example. First, according to the description of occupation in the Chinese Occupation Classification Grand Dictionary, a text similarity algorithm was used to preliminarily choose recruitment data with high similarity. Second, Convolutional Neural Networks for Sentence Classification (TextCNN) was used to further classify the preliminary corpus to obtain a precise occupational dataset. Third, the specialty and skill were taken as named entities that were automatically extracted by the named entity recognition technology. Finally, putting the extracted entities into the occupational dataset, the occupation characteristics of multiple dimensions were depicted to form a profile of the vocation.

Highlights

  • A complete occupational information system is convenient for institutions and groups such as government agencies, social service institutions, higher education institutions, enterprises and labor forces to obtain information on occupations and occupational skills effectively and in a timely manner

  • The former was based on the long short-term memory (LSTM) algorithm to extract skill entities, and the latter was based on the conditional random field (CRF), LSTM-CRF and BiLSTM-CRF to extract entity related to data science positions

  • Occupational profiling driven by online job advertisements vocation case was data analysis and processing engineering technicians (DAPET) which is subordinate to the minor categories management engineering and technical personnel in Chinese Occupation Classification Grand Dictionary (COCGD)

Read more

Summary

Introduction

A complete occupational information system is convenient for institutions and groups such as government agencies, social service institutions, higher education institutions, enterprises and labor forces to obtain information on occupations and occupational skills effectively and in a timely manner. In China, the traditional occupational portrait system includes the Chinese Occupation Classification Grand Dictionary (COCGD), which covers 1,481 occupations in the whole society [4], 1,055 vocational skill standards and qualification requirements for some occupations. These standards are basically able to comprehensively and objectively describe the professions in China. The existing study was driven by job advertisement mainly focused on the position analysis with the position data obtained by keyword searching Another is how to extract the key requirements such as skills and specialties from the recruitment information.

Related work
Occupational analysis
Position analysis
COCGD description
Data description
Models and process
Preliminary corpus construction
TextCNN model further determines the occupational data set
BERT-BiLSTM-CRF model extracted the named entities
Result analysis
Overall analysis
Overlapping analysis
Horizontal analysis
Occupational profile
Findings
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call