Learning Optimal Online Advertising Portfolios with Periodic Budgets

Lennart Baardman,Abhishek Pani,Elaheh Fata,Georgia Perakis

doi:10.2139/ssrn.3346642

Abstract

Online advertising enables advertisers to reach customers with personalized ads. Advertisers need to determine the right targets for their ads and how much they are willing to pay to engage those targets. A large portion of online ads are priced using real-time auctions, thus advertisers need to decide which targets to bid on in these auctions. Collaborating with one of the largest ad-tech firms in the world, we develop new algorithms that help advertisers bid optimally on target portfolios while taking into account some limitations inherent to online advertising. We study this problem as a Multi-Armed Bandit (MAB) problem with periodic budgets. At the beginning of each time period, the advertiser needs to determine which portfolio of target to select to maximize the expected total revenue (revenue from clicks/conversions), while maintaining the total cost of auction payments within the advertising budget. In this paper, we formulate the problem and develop an Optimistic-Robust Learning (ORL) algorithm that uses ideas from Upper Confidence Bound (UCB) algorithms and robust optimization. We prove that the expected cumulative regret of the algorithm is bounded. Additionally, simulations on synthetic and real-world data show that the ORL algorithm reduces regret by at least 10-20% compared to benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Optimal Online Advertising Portfolios with Periodic Budgets

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal

Lead the way for us

Journal: SSRN Electronic Journal	Publication Date: Mar 27, 2019
Citations: 6

Similar Papers

In-depth Exploration and Implementation of Multi-Armed Bandit Models Across Diverse Fields
Jiazhen Wu
Highlights in Science, Engineering and Technology | VOL. 94
Jiazhen WuJiazhen Wu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

Enhancing UCB-tuned and Asymptotically Optimal UCB Algorithms through Weighted Average Techniques in Multi-Armed Bandit Scenarios
Chang Qu
Highlights in Science, Engineering and Technology | VOL. 94
Chang QuChang Qu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

Mechanisms with learning for stochastic multi-armed bandit problems
Shweta Jain ... Divya Padmanabhan
Indian Journal of Pure and Applied Mathematics | VOL. 47
Shweta Jain, et. al.Shweta Jain ... Divya Padmanabhan
01 Jun 2016
Indian Journal of Pure and Applied Mathematics | VOL. 47

Some Variations of Upper Confidence Bound for General Game Playing
Iván Francisco-Valencia ... José Raymundo Marcial-Romero
-
Iván Francisco-Valencia, et. al.Iván Francisco-Valencia ... José Raymundo Marcial-Romero
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Optimal Online Advertising Portfolios with Periodic Budgets

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal