Stochastic Online Learning with Probabilistic Graph Feedback

Shuai Li,Wei Chen,Kwong-Sak Leung,Zheng Wen

doi:10.1609/aaai.v34i04.5899

Abstract

We consider a problem of stochastic online learning with general probabilistic graph feedback, where each directed edge in the feedback graph has probability pij. Two cases are covered. (a) The one-step case, where after playing arm i the learner observes a sample reward feedback of arm j with independent probability pij. (b) The cascade case where after playing arm i the learner observes feedback of all arms j in a probabilistic cascade starting from i – for each (i,j) with probability pij, if arm i is played or observed, then a reward sample of arm j would be observed with independent probability pij. Previous works mainly focus on deterministic graphs which corresponds to one-step case with pij ∈ {0,1}, an adversarial sequence of graphs with certain topology guarantees, or a specific type of random graphs. We analyze the asymptotic lower bounds and design algorithms in both cases. The regret upper bounds of the algorithms match the lower bounds with high probability.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic Online Learning with Probabilistic Graph Feedback

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 6

Similar Papers

Elementary School Teachers’ Problems in Online Learning during the Pandemic
Gusti Ayu Putu Sintha Arma Erawati ... I Wayan Widiana
International Journal of Elementary Education | VOL. 5
Gusti Ayu Putu Sintha Arma Erawati, et. al.Gusti Ayu Putu Sintha Arma Erawati ... I Wayan Widiana
04 Oct 2021
International Journal of Elementary Education | VOL. 5

Online Learning in Markov Decision Processes with Continuous Actions
Yi-Te Hong ... Chi-Jen Lu
-
Yi-Te Hong, et. al.Yi-Te Hong ... Chi-Jen Lu
01 Jan 2015
01 Jan 2015

Finding the optimal exploration-exploitation trade-off online through Bayesian risk estimation and minimization
Stewart Jamieson ... Yogesh Girdhar
Artificial Intelligence | VOL. 330
Stewart Jamieson, et. al.Stewart Jamieson ... Yogesh Girdhar
21 Feb 2024
Artificial Intelligence | VOL. 330

Problematika Pembelajaran Daring Siswa SD Negeri 24 Kota Bengkulu di Masa Pandemi Covid-19
Neda Lesminiarti
Journal of Primary Education (JPE) | VOL. 2
Neda LesminiartiNeda Lesminiarti
19 Jun 2022
Problematika Pembelajaran Daring Siswa SD Negeri 24 Kota Bengkulu di Masa Pandemi Covid-19
Neda Lesminiarti

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic Online Learning with Probabilistic Graph Feedback

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence