Pairwise Loss Research Articles

Occupation-specific job tasks (OSTs) refer to the duties, responsibilities, and activities associated with a particular occupation, which define the core functions and performance expectations for those engaged in that profession. Efficient recognition and extraction of OSTs from large-scale job description data are essential for establishing a continually updated occupational information system (OIS), such as O*NET, which serves as critical tools for advancing research in work and labor markets. However, this task presents substantial challenges due to its heavy reliance on domain experts for the labor-intensive annotation of job postings, rendering the process time-consuming and difficult to scale for large-scale implementation. To this end, in this paper, we present COTR , a novel data-driven framework designed for the efficient recognition of OSTs from job postings, capable of continually identifying new tasks through class-incremental learning. Specifically, we first employ large language models (LLMs) and prompt learning to develop a three-phase process—“expansion, translation, and generation”—that addresses the critical challenge of the absence of predefined OSTs in non-English labor market data, leveraging O*NET as a foundational reference. Subsequently, we introduce a BERT-based model for OST recognition, incorporating a uniquely designed pair-wise loss function that distills valuable insights from ChatGPT or other LLMs, thereby substantially enhancing recognition performance. In addition, to achieve cost-effective training data annotation, we develop an LLM-based coarse-to-fine candidate OSTs generation algorithm, integrating contrastive active learning to optimize the annotation process through human-machine collaboration. Notably, we design a supervised fine-tuning strategy with a novel encoding technique to optimize LLMs, improving the recall rate of the generated candidate OSTs and achieving up to a 343-fold increase in annotation efficiency compared to traditional manual expert annotation in our experiments. Afterward, we propose an efficient class-incremental learning method that incorporates an out-of-distribution (OOD) detection module for identifying potential novel OSTs and a fine-tuning module to extend the model’s recognition capabilities to include newly discovered tasks. Finally, we construct two real-world datasets using job posting data collected from the labor markets of China and the United States, respectively. Extensive experiments on the real-world datasets, along with two publicly available datasets, have demonstrated the effectiveness of the proposed COTR. Furthermore, several case studies showcase the significant benefits of COTR for various downstream applications in labor market analysis, including analyzing the evolving demand for OSTs, assessing the value of OSTs, and recognizing the relationships between OSTs and associated skills.

Read full abstract

The users on the Internet have been growing exponentially, and tag recommendation can automatically provide users with a selection of tags of interest to meet their personalized needs. Users can utilize these tags to freely annotate their favorite resources, making them efficient and fast in retrieving related resources. Tensor factorization methods are commonly used in tag recommendation at present. These methods model the user × item × tag interactions, transform the latent feature representations of users, items and tags into low-rank matrices and use inner products for prediction. However, the problem of using inner product is that it does not satisfy the triangle inequality, it ignores the distance relationship among entity pairs and cannot capture the fine-grained preference information. Metric learning in recommendation domains focus on using pairwise loss, which assumes that different categories (such as users, items and tags) have fixed margins. Different categories often have different intra-class variations. With fixed margins, it is often difficult to accurately distinguish between positive and negative samples, thus reducing recommendation performance and limiting the expression ability of the model. In this study, the metric learning method is used to explore the distance relationship among user × item × tag triplet, and the existing metric learning based methods (namely LRML, CML, SML) are applied to the tag recommendation. A pairwise metric learning method with angular margin is proposed, named PMLT. The pairwise distance relationship between user-tag and item-tag is modeled for the information of different entities. And an extra angular margin regularizer is added to the original pairwise loss to control the size of angular margin for user-tag and item-tag respectively. The strength of the constrained angular margin regularizer is controlled to dynamically adjust the distance changes of entity. This method constrains the fixed margin and also the angular margin of user-tag and item-tag. Compared with the traditional metric learning method, this method can capture additional relationship structure and has good recommendation performance. Finally, we conducted extensive experiments on two datasets, LastFm and Movielens, and the experimental results showed that the proposed method PMLT outperform the state-of-the-art baseline in the evaluation metrics Recall@N and NDCG@N, and obtain better prediction quality. We also analyze the influence of different parameters and internal components on the performance of the proposed method, which improves the interpretability of the proposed method.

Read full abstract

Pairwise Loss Research Articles

Related Topics

Articles published on Pairwise Loss

COTR: Efficient Job Task Recognition for Occupational Information Systems with Class-Incremental Learning

Apparatus for producing single strontium atoms in an optical tweezer array

Imbalanced image classification algorithm based on fine-grained analysis

Towards sharper excess risk bounds for differentially private pairwise learning

Ranking Model Averaging: Ranking Based on Model Averaging

Code and Data Repository for Ranking Model Averaging: Ranking Based on Model Averaging

An Efficient Multi-Branch Attention Network for Person Re-Identification

Mask-guided discriminative feature network for occluded person re-identification

Limited Memory Online Gradient Descent for Kernelized Pairwise Learning with Dynamic Averaging

SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning

On the Effectiveness of Sampled Softmax Loss for Item Recommendation

Integrating listwise ranking into pairwise-based image-text retrieval

Improving top-[formula omitted] recommendations using batch approximation for weighted pair-wise loss

Application of deep metric learning in the verification process of wheel design similarity: Hyundai motor company case

Efficient block contrastive learning via parameter-free meta-node approximation

Discriminator-Enhanced Knowledge-Distillation Networks

Developing the Wheel Image Similarity Application with Deep Metric Learning: Hyundai Motor Company Case

Self-training on graph neural networks for recommendation with implicit feedback

Pairwise Metric Learning With Angular Margin for Tag Recommendation

Adversarial Binary Mutual Learning for Semi-Supervised Deep Hashing

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Pairwise Loss Research Articles

Related Topics

Articles published on Pairwise Loss

COTR: Efficient Job Task Recognition for Occupational Information Systems with Class-Incremental Learning

Apparatus for producing single strontium atoms in an optical tweezer array

Imbalanced image classification algorithm based on fine-grained analysis

Towards sharper excess risk bounds for differentially private pairwise learning

Ranking Model Averaging: Ranking Based on Model Averaging

Code and Data Repository for Ranking Model Averaging: Ranking Based on Model Averaging

An Efficient Multi-Branch Attention Network for Person Re-Identification

Mask-guided discriminative feature network for occluded person re-identification

Limited Memory Online Gradient Descent for Kernelized Pairwise Learning with Dynamic Averaging

SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning

On the Effectiveness of Sampled Softmax Loss for Item Recommendation

Integrating listwise ranking into pairwise-based image-text retrieval

Improving top-[formula omitted] recommendations using batch approximation for weighted pair-wise loss

Application of deep metric learning in the verification process of wheel design similarity: Hyundai motor company case

Efficient block contrastive learning via parameter-free meta-node approximation

Discriminator-Enhanced Knowledge-Distillation Networks

Developing the Wheel Image Similarity Application with Deep Metric Learning: Hyundai Motor Company Case

Self-training on graph neural networks for recommendation with implicit feedback

Pairwise Metric Learning With Angular Margin for Tag Recommendation

Adversarial Binary Mutual Learning for Semi-Supervised Deep Hashing