Interior-Point Methods for Full-Information and Bandit Online Learning

Jacob D Abernethy,Elad Hazan,Alexander Rakhlin

doi:10.1109/tit.2012.2192096

Interior-Point Methods for Full-Information and Bandit Online Learning

Jacob D Abernethy, Elad Hazan + Show 1 more

https://doi.org/10.1109/tit.2012.2192096

Copy DOI

Export

Save

Cite

Journal: IEEE Transactions on Information Theory	Publication Date: Jul 1, 2012
Citations: 95

Affiliation: University of California, Berkeley, Technion – Israel Institute of Technology, University of Pennsylvania

#Regret Minimization Algorithm #Bandit Setting #Interior-point Methods For Optimization #Linear Loss #Problem Of Linear Optimization #Full Feed #Methods For Optimization #Problem Of Optimization #Online Linear Optimization #Methods For Convex Optimization

Abstract
Full-Text
Similar Papers

Abstract

Listen

We study the problem of predicting individual sequences with linear loss with full and partial (or bandit) feed- back. Our main contribution is the first efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal Õ(√(T)) regret. In addition, for the full-information setting, we give a novel regret minimization algorithm. These results are made possible by the introduction of interior-point methods for convex optimization to online learning.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IEEE Transactions on Information Theory

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Interior-Point Methods for Full-Information and Bandit Online Learning