A co-learning framework for learning user search intents from rule-generated training data

Jun Yan,Zeyu Zheng,Shuicheng Yan,Li Jiang,Yan Li,Zheng Chen

doi:10.1145/1835449.1835670

Abstract

Learning to understand user search intents from their online behaviors is crucial for both Web search and online advertising. However, it is a challenging task to collect and label a sufficient amount of high quality training data for various user intents such as compare products, plan a travel, etc. Motivated by this bottleneck, we start with some user common sense, i.e. a set of rules, to generate training data for learning to predict user intents. The rule-generated training data are however hard to be used since these data are generally imperfect due to the serious data bias and possible data noises. In this paper, we introduce a Co-learning Framework (CLF) to tackle the problem of learning from biased and noisy rule-generated training data. CLF firstly generates multiple sets of possibly biased and noisy training data using different rules, and then trains the individual user search intent classifiers over different training datasets independently. The intermediate classifiers are then used to categorize the training data themselves as well as the unlabeled data. The confidently classified data by one classifier are added to other training datasets and the incorrectly classified ones are instead filtered out from the training datasets. The algorithmic performance of this iterative learning procedure is theoretically guaranteed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A co-learning framework for learning user search intents from rule-generated training data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Meta learning addresses noisy and under-labeled data in machine learning-guided antibody engineering
Mason Minot ... Sai T Reddy
Cell Systems | VOL. 15
Mason Minot, et. al.Mason Minot ... Sai T Reddy
01 Jan 2024
Cell Systems | VOL. 15

A Novel Contrast Co-learning Framework for Generating High Quality Training Data
Zeyu Zheng ... Ning Liu
-
Zeyu Zheng, et. al.Zeyu Zheng ... Ning Liu
01 Dec 2010
01 Dec 2010

Quantum learning Boolean linear functions w.r.t. product distributions
Matthias C Caro
Quantum Information Processing | VOL. 19
Matthias C CaroMatthias C Caro
20 Apr 2020
Quantum Information Processing | VOL. 19

Simultaneously Removing Noise and Selecting Relevant Features for High Dimensional Noisy Data
Boseon Byeon ... Khaled Rasheed
-
Boseon Byeon, et. al.Boseon Byeon ... Khaled Rasheed
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A co-learning framework for learning user search intents from rule-generated training data

Abstract

Talk to us

Similar Papers