Abstract
The article is devoted to investigating the application of hedging strategies to online expert weight allocation under delayed feedback. As the main result we develop the General Hedging algorithm G based on the exponential reweighing of experts’ losses. We build the artificial probabilistic framework and use it to prove the adversarial loss bounds for the algorithm G in the delayed feedback setting. The designed algorithm G can be applied to both countable and continuous sets of experts. We also show how algorithm G extends classical Hedge (Multiplicative Weights) and adaptive Fixed Share algorithms to the delayed feedback and derive their regret bounds for the delayed setting by using our main result.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.