regret minimizing equilibria of games with strict type uncertainty

Post on 08-Jan-2016

40 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

DESCRIPTION

Regret Minimizing Equilibria of Games with Strict Type Uncertainty. Stony Brook Conference on Game Theory Nathana ë l Hyafil and Craig Boutilier Department of Computer Science University of Toronto. Overview. 1. Motivation / Background Automated Mechanism design Strict Uncertainty - PowerPoint PPT Presentation

TRANSCRIPT

Regret Minimizing Equilibria of Games with

Strict Type Uncertainty

Stony Brook Conference on Game Theory

Nathanaël Hyafil and Craig Boutilier

Department of Computer ScienceUniversity of Toronto

2

Overview

• 1. Motivation / Background– Automated Mechanism design– Strict Uncertainty– Minimax Regret

• 2. Games with Strict Type Uncertainty– Definition of equilibrium– Existence of equilibrium

• 3. Applications / Conclusion– Partial Revelation Mechanism Design

3

Automated MD (AMD)

• VCG: always pick efficient outcome

• Myerson auction: – not always optimal outcome – but maximizes expected objective

(revenue) given a prior over agents’ types

• AMD:– for general objectives (not just revenue)– general outcome space (not just auctions)

4

Automated MD (AMD)

• Given:– sets of types, outcomes– objective function f(,o) (SW, revenue, ...)– prior over types

• Optimization problem: – find mechanism (outcome for each type vector)– maximize expected objective value– subject to Constraints:

• Incentive Compatibility (BNE or DS)• ( Individual Rationality , Budget Balance , ...)

5

Where do priors over types come from?

• “Experts”?– Costly!– Can rule out inappropriate valuations– But hard to quantify probabilistically– simple distribution (unrealistic but needed)

• Observation of past behavior?– Gives linear constraints on values– Not probability distributions

6

Strict Uncertainty

• No probability distribution but subset of possible types

• Agents cannot maximize expected utility use MiniMax Regret as decision criterion

• Mechanism Designer: can’t use Bayes-Nash Eq., can’t maximize expected objective

Mech Designer minimizes his regret too

7

MiniMax Regret

• Different from:– regret used to converge to equilibrium in

repeated games (e.g., Hart & MasColell)– regret of Regret Theory (Bell; Loomes & Sugden)

• Savage’s MiniMax Regret criterion from Decision Theory

• recently used for uncertainty about utilities (as opposed to outcomes)

8

MiniMax Regret

• Single agent: make decision dD with incomplete utility function u U

9

Why MiniMax Regret?

x

x’

x’x

x

x’

x’

x

xx’

x

x’

u1 u2 u4 u5u3

• In this context, MaxiMin not good:

u6

10

2. Games of Incomplete Information with Strict Type Uncertainty

• N players, and for each:

• Actions: Ai

• Types: i

• Utility: ui: A i R

• Each agent knows its type, not the others’, but:

• Common prior: Strict: T

• Strategy: i: i (Ai)

11

Regret definitions

• Regret of strategy i for agent i of type i, given type i and strategy i of the others:

• MaxRegret of strategy i for i of type i, given strategy i of the others (for prior T):

12

Example

• First-Price Auction– 2 agents ; – 3 actions: .25 , .5 , .75 – Ties broken randomly

(V-.75)/2V-.75V-.750.75

0(V-.5)/2V-.50.5

00(V-.25)/20.25

0.75 0.50.25

13

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 }

.5.8 .75

• What is MR1(bid = .25|1=.4 ; 2 ) ?

14

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 }

.5.8 .75

• What is MR1(bid = .25|1=.4 ; 2 ) ?

R1(bid = .25) if 2=.2:

15

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 } .5.8 .75

• What is MR1(bid = .25|1=.4 ; 2 ) ?

R1(bid = .25) if 2 = .2:

0.25

0.25 (1-.25) / 2

0.5 (1 - .5)

0.75 (1 - .75)

16

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 } .5.8 .75

• What is MR1(bid = .25|1=.4 ; 2 ) ?

R1(bid = .25) if 2 = .2:

Regret vs. 0.25

0.25 0

0.5 (1 - .5) - (1-.25) / 2

0.75 (1 - .75) - (1-.25) / 2

17

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 } .5.8 .75

• What is MR1(bid = .25|1=.4 ; 2 ) ?

R1(bid = .25) if 2 = .2:

Regret vs. 0.25

0.25 0 0

0.5 (1 - .50) - (1-.25) / 2 - 0.175 < 0

0.75 (1 - .75) - (1-.25) / 2 - 0.425 < 0

18

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 } .5.8 .75

• What is MR1(bid = .25|1=.4 ; 2 ) ?

R1(bid = .25) if 2 = .2 0

19

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 } .5

.8 .75

• What is MR1(bid = .25|1=.4 ; 2 ) ?

R1(bid = .25) if 2 = .2 0

R1(bid = .25) if 2 = .4 0

R1(bid = .25) if 2 = .6 0

R1(bid = .25) if 2 = .8 0

20

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 }

.5.8 .75

• MR1(bid = .25|1=.4 ; 2 ) = max { 0, 0, 0, 0}= 0

21

Example: Agent 1’s reasoning

• 2 {.2, .4, .6, .8} 2 : .2 .25 {.4 , .6 } .5

.8 .75

• MR1(bid = .25|1=.4 ; 2 ) = max { 0, 0, 0, 0}= 0

• so argmina MR1(a| 1=.4 ; 2) = .25

• and MMR = 0

22

Equilibrium definitions

• MiniMax Regret Best Response to -i :

iTi,

• is a MiniMax Regret Equilibrium iff i is a MiniMax Regret best resp. to -i, i

• i is a MiniMax Regret Dominant Strategy iffit is a MiniMax Regret best resp. to all -i

23

Example

First-Price Auction

Ti = {.2 , .4 , .6 , .8}

• MiniMaxRegret Equilibrium: (i,i) with i:

.2 bid .25 (MMR = 0)

.4 bid .25 (MMR = 0)

.6 bid (.25,.5) with p=(.6,.4) (MMR = 0.03)

.8 bid (.5,.75) with p=(10/11,1/11) (MMR =.0227)

24

Existence Results

• Theorem: There exists a MiniMax Regret Eq in all games with finite number of agents, actions and types

• Proposition: is a MiniMax Regret dominant strategy equ. for a Strict incomplete information game iff it is a DS for any corresponding Bayesian game

• Observation: is a MiniMax Regret Eq. with zero regret for all types of all agents iff it is an Ex-Post Eq.

25

Non-finite Games?

• Proof relies on Kakutani’s fixed point theorem

• main difference with Bayesian games: expected

utility is linear, Max Regret is not

• so any extension (e.g., continuous games) that

doesn’t require linearity should apply to MMR

(e.g., Milgrom & Weber 1987)

26

3. Applications:

• Strict Automated Mechanism Design: – designer is regret minimizer too

– regret of mechanism M1 vs. M2: difference in objective

value (SW, …) between M1 and M2 when an

‘adversary’ picks the types of the agents

• (Hyafil & Boutilier, UAI 2004):– formulation as optimization subject to IC, IR, …

– infinite number of constraints, some non-linear

– algorithm to solve as sequence of linear problems

27

Application:Partial Revelation MD

• Revelation Principle Direct, truthful mechanisms: – agents directly report their full type

• But:– hard/costly valuation problem– privacy concerns– communication costs

28

Partial Revelation MD

• Instead: partial type– v [.4 , .6]

• Partial Revelation: – Type space is partitioned in finite number

of sets– Report is the subset containing full type – Choose outcome despite remaining

uncertainty

29

Partial Revelation MD

• For very general form of partitions, with no structure on (quasi-linear) outcome space:– “impossible” to impose truthfulness in

Dominant Strategies and Bayes-Nash equilibrium

• Use MiniMax Regret equilibrium concept in Partial Revelation MD

30

Conclusion

• Games with Strict Uncertainty:

– definition

– proposed MiniMax Regret as Rationality concept

– proved Existence of MiniMaxRegret Equilibria

• Applications:

– Partial Revelation MD

– Multi-Attribute Bargaining

– Sequential Strict Automated MD

top related