Online Learning With Incremental Feature Space and Bandit Feedback

Shilin Gu,Chenping Hou,Tingjin Luo,Ming He

doi:10.1109/tkde.2023.3272313

Abstract

Online learning is a fundamental paradigm for learning from continuous data stream. Tradition online learning approaches usually assume that the feature space of data stream is fixed and the incoming instance can always get the true label after making its prediction. However, in many real-world applications, such as the personalized recommender systems, the feature space may keep expanding due to the accumulation of user behaviors. Besides, we may only get bandit feedback, i.e., we only know whether the prediction is correct or not. To solve this important but rarely studied problem, we propose a novel algorithm LIFBF, together with its two variants LIFBF-I and LIFBF-II, to learn from data stream with incremental feature space and bandit feedback. Specifically, when an instance arrives with augmented features, we first utilize the exploration-exploitation strategy to guess its best label, then, a new loss function considering both bandit feedback and guessed label is proposed. Finally, we design a highly dynamic multi-class classifier, which updates the shared and augmented features by adopting the passive-aggressive rule and structural risk minimization principle, respectively. We theoretically analyze the cumulative loss bound of LIFBF. Besides, empirical studies on various datasets further validate the effectiveness of our proposed algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online Learning With Incremental Feature Space and Bandit Feedback

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Similar Papers

Online Mining Changes of Items over Continuous Append-only and Dynamic Data Streams
...
Zenodo (CERN European Organization for Nuclear Research) | VOL. -
, et. al. ...
01 Jan 2004
Zenodo (CERN European Organization for Nuclear Research) | VOL. -

Incremental Feature Spaces Learning with Label Scarcity
Shilin Gu ... Chenping Hou
ACM Transactions on Knowledge Discovery from Data | VOL. 16
Shilin Gu, et. al.Shilin Gu ... Chenping Hou
08 Sep 2022
ACM Transactions on Knowledge Discovery from Data | VOL. 16

Online Spectral Learning on a Graph with Bandit Feedback
Quanquan Gu ... Jiawei Han
-
Quanquan Gu, et. al.Quanquan Gu ... Jiawei Han
01 Dec 2014
01 Dec 2014

Online imbalance learning with unpredictable feature evolution and label scarcity
Jiahang Tu ... Chenping Hou
Neurocomputing | VOL. 610
Jiahang Tu, et. al.Jiahang Tu ... Chenping Hou
03 Sep 2024
Neurocomputing | VOL. 610

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Learning With Incremental Feature Space and Bandit Feedback

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering