Real-Time Bidding with Side Information

© 2017 Neural information processing systems foundation. All rights reserved. We consider the problem of repeated bidding in online advertising auctions when some side information (e.g. browser cookies) is available ahead of submitting a bid in the form of a d-dimensional vector. The goal for the ad...

Full description

Bibliographic Details
Main Authors: Flajolet, Arthur, Jaillet, Patrick
Other Authors: Massachusetts Institute of Technology. Operations Research Center
Format: Article
Language:English
Published: 2021
Online Access:https://hdl.handle.net/1721.1/137606
Description
Summary:© 2017 Neural information processing systems foundation. All rights reserved. We consider the problem of repeated bidding in online advertising auctions when some side information (e.g. browser cookies) is available ahead of submitting a bid in the form of a d-dimensional vector. The goal for the advertiser is to maximize the total utility (e.g. the total number of clicks) derived from displaying ads given that a limited budget B is allocated for a given time horizon T. Optimizing the bids is modeled as a contextual Multi-Armed Bandit (MAB) problem with a knapsack constraint and a continuum of arms. We develop UCB-type algorithms that combine two streams of literature: the confidence-set approach to linear contextual MABs and the probabilistic bisection search method for stochastic root-finding. Under mild assumptions on the underlying unknown distribution, we establish distribution-independent regret bounds of order Õ(d · √T) when either B = ∞ or when B scales linearly with T.