DMMP: A distillation-based multi-task multi-tower learning model for personalized recommendation

Qingqing Yi,Jingjing Tang,Yujian Zeng,Xueting Zhang,Weiqi Xu

doi:10.1016/j.knosys.2023.111236

Abstract

Recommendation systems involve the matching and ranking stages. The click-through rate (CTR) and the conversion rate (CVR) predictions are two fundamental modules in recommendation systems. Most candidate generation matching models leverage a two-tower architecture to model the CTR prediction task. However, items with low-quality but attractive titles, i.e., click baits, may be recommended to the user, which worsens the user’s experience. Therefore, both click and conversion tasks should be modeled during the matching stage to improve user engagement. An intuitive way is to model these two tasks with a three-tower matching model. However, its efficiency and effectiveness are limited. By inheriting the merits of knowledge distillation and multi-task learning, we propose a distillation-based multi-task multi-tower model (DMMP) for personalized recommendation. Specifically, the MTL-based teacher network builds the task-shared and task-specific expert networks, and employs a customized multi-gate control network to merge these expert networks adaptively within the task embedding layer. Two auxiliary pCTR and pCTCVR tasks are incorporated to model CVR directly across the whole space. By sharing the feature representation parameters with the CTR modeling, the CVR modeling can be trained with richer samples. Three-tower-based candidate generation student network incorporates a user preference gating network to learn the task scores so that personalized candidates can be produced. Furthermore, we design an adaptive weighting strategy for total loss to eventually adjust the importance and relevance of different networks. Extensive experiments on public and industrial datasets validate the effectiveness of DMMP.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DMMP: A distillation-based multi-task multi-tower learning model for personalized recommendation

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Nov 27, 2023
Citations: 1

Similar Papers

Multi-population genomic prediction using a multi-task Bayesian learning model.
Liuhong Chen ... Stephen Miller
BMC genetics | VOL. 15
Liuhong Chen, et. al.Liuhong Chen ... Stephen Miller
01 Jan 2014
BMC genetics | VOL. 15

Abstract 279: Multi-task Learning Improves Model Performance in Predicting Rare Catastrophic Events in Healthcare Claims Dataset
Chienyu Chi ... Kuan-Chun Fu
Circulation | VOL. 142
Chienyu Chi, et. al.Chienyu Chi ... Kuan-Chun Fu
17 Nov 2020
Circulation | VOL. 142

An overview of multi-task learning
Yu Zhang ... Qiang Yang
National Science Review | VOL. 5
Yu Zhang, et. al.Yu Zhang ... Qiang Yang
01 Sep 2017
National Science Review | VOL. 5

Multitask Deep Learning Model with Efficient Encoding Layer and Enhanced Parallel Convolution Block
Anupam Biswas ... Subham Chakraborty
-
Anupam Biswas, et. al.Anupam Biswas ... Subham Chakraborty
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DMMP: A distillation-based multi-task multi-tower learning model for personalized recommendation

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems