You can edit almost every page by Creating an account. Otherwise, see the FAQ.

Automated Bidding in a Repeated Bidding Game

From EverybodyWiki Bios & Wiki

Script error: No such module "AfC submission catcheck".


An automated bidding agent is a software used for buying ad impressions on behalf of an advertiser. An advertiser specifies aggregate objectives and constraints, and a dynamic bidding algorithm tunes the bids of an ad campaign to meet those objectives. Automated bidding agents are now standard in online advertising auctions.

Online ad impressions are sold by repeated auctions. Bids to those actions are specified through advertiser campaigns and can vary by keyword, user demographic, etc. The campaign will also specify a budget that will constrain the total spend over many instances of the bids.

There are many advantages of having campaigns instead of companies purchasing the advertisements directly, but the down side is that campaign management is a complex task. Auction details may vary from one to another, especially when there are different ad formats and a single campaign needs to manage them all. In addition, due to budget constraints, even second-price auction is not truthful, so allocating budget across time is not a simple task. Complexity of campaign managers ultimately led to development of automated bid managers, which are now used to manage auctions.

Automated bidders are extremely popular up to the point that some online advertisement platforms don't even support manual bidding any more.

Model: Value-Maximizing Auto-bidders[edit]

The automated bidders act on behalf of the advertisers. They are fed with high-level preferences, constraints and a bidding algorithm that dynamically adjusts the campaign details:

Bidding is done in rounds, one impression auctioned off each round.

There are advertisers, each with an associated auto-bidder.

Input: advertiser specifies a budget and values (max bids) for every round .

In each round, the values (that are i.i.d. over time) are drawn from a known distribution (no correlation across advertisers in each round).

An auto-bidder for advertiser observes and places bid .

Output: the auction outcome, i.e. an allocation and payment (that depends on the type of auction being held).

Goal: to maximize , subject to .

Budget Pacing[edit]

There are numerous ways for the auto-bidders to divide their budget between the rounds.

Multiplicative Budget Pacing[edit]

The advertiser specifies budget and maximum bids .

The auto-bidders maintain a shading factor .

In round , an auto-bidder bids and selects adaptively, based on the observed history.

When the auction is second-price, there exists a pure Nash equilibrium of the multiplicative pacing game.

Note: Finding a Nash equilibrium is PPAD-hard.[1][2]

Known Results

It was shown that at any equilibrium, the resulting allocation obtains at least half of the optimal liquid welfare (optimized willingness to pay)[3], where liquid welfare is defined as a

maximal willingness to spend for the goods received: [4]

In addition, if a single-round auction is truthful, then there is a choice of such that bidding in every auction instance is optimal in hindsight.[5]

Adaptive Pacing[edit]

Using an adaptive algorithm presented below will achieve good results for a broad class of auctions (including First-price, Second-price, GSP auctions).

Adaptive Multiplicative Pacing Algorithm[6][edit]

1. Initialize
2. In each round :
2.1. Bid .
2.2. Observe payment .
2.3. (If then set ) where and learning rate If we spend more then , reduce bid next round If we spend less then , increase bid next round.

Note: A bid can be at most the remaining budget, so if we exhaust the budget early, all the following bids will be 0.

Class of Auctions[edit]

The algorithm yields good results for auctions satisfying the following conditions:

Input: bids

Output: allocation and payments

Condition 1: Core Auction

  1. Payments are at most bids: .
  2. For any subset of bidders , and any feasible allocation , . Intuition: If we take , then the auction maximizes declared welfare, and if , then the payments are at least the VCG payments.

Condition 2: Monotone Bang-Per-Buck

  1. is weakly decreasing as increases.

As mentioned above, first-price, second-price and GSP auctions are all in this class.

Results[edit]

Using this algorithm on the defined class of auctions, the following outcomes can be observed:

  1. Simultaneous Pacing: When auto-bidders run simultaneously, the expected liquid welfare is at least half of the expected optimum.
  2. Stochastic Regret: The algorithm achieves vanishing regret for a single advertiser in a stochastic environment.

Simultaneous Pacing[edit]

Suppose that all agents use adaptive pacing simultaneously.

Recall that denote allocation of auto-bidder in round (a random variable). For any auction in the class, the expected liquid welfare of is at least half of the expected optimal liquid welfare in hindsight:

(left side of the equation is the expected liquid welfare of and right side is the expected optimal welfare)

Proof approach

Let's define Epoch to be a maximal interval on which (where is taken from the interval).

Lemma 1. If is an epoch, then (Average spend over an epoch per round). Lemma 1 proof outline. If , then using a telescopic sum we can conclude that .

If the expected number of rounds with is , then the expected spend is at least , so this is a good approximation to the optimal liquid welfare.

Recalling the core auction property: for , .(Leftmost sum is the value obtained by agents with , middle sum is the collected payment, and right sum is the maximal obtainable value for agents with )

Note: When for some , is high, but is low (e.g. a different auto-bidder wins instead), any lost value can be charged to payments received.

Stochastic Regret[edit]

For any auction in the previously mentioned class, the total expected value obtained by the adaptive pacing algorithm in a stochastic environment is .

(The expected per-round value of bidding , where , sets expected per-round payment to ).

In a stochastic environment, on each round, the advertiser's value and the profile of competing bids are drawn i.i.d. (value and competing bids can be arbitrarily correlated).

Proof approach

Interpret update rule as following a gradient, and consider adaptive pacing as Stochastic Gradient Descent.[7]

If we take a -neighborhood of , and is a set that is a neighborhood of , the following claims will hold.

Claim 1. is nearly always in the set .

Proof of Claim 1. Recall that , then .

This implies that if , then is -biased towards .

Furthermore,

Claim 2. For , bidding each round is -approximation.

Proof of Claim 2.

Case 1: (overspend): only participate in rounds instead of , but value is obtained per round is weakly higher.

Case 2: (underspend): might win fewer impressions in expectation, but bang-per-buck (value per dollar spent) is weakly higher. This means that

[fraction of value lost] [fraction of budget unspent] .

External Links[edit]

A Lecture on "Automated Bidding: Efficiency and Regret in a Repeated Bidding Game" by Brendan Lucier

References[edit]

  1. List of PPAD-complete problems
  2. [Chen Kroer Kumar '21]
  3. [Balseiro Gur '17],[Conitzer Kroer Sodomka Stier-Moses '18],[Babaioff Cole Hartline Immorlica L. '21]
  4. [Dobzinski Paes-Leme '14]
  5. [Aggarwal Badanidiyuru Mehta '19], [Babaioff Cole Hortline Immorlica L. '21]
  6. [Balseiro Gur EC'17]
  7. Extends result of [BG'17] for second-price auction, indep. values


This article "Automated Bidding in a Repeated Bidding Game" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:Automated Bidding in a Repeated Bidding Game. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.