compressing mental model spaces and modeling human strategic intent

72
Compressing Mental Model Spaces and Modeling Human Strategic Intent

Upload: bianca

Post on 24-Feb-2016

28 views

Category:

Documents


0 download

DESCRIPTION

Compressing Mental Model Spaces and Modeling Human Strategic Intent. Prashant Doshi University of Georgia, USA. http://thinc.cs.uga.edu. Yifeng Zeng Reader, Teesside Univ. Previously: Assoc Prof., Aalborg Univ. Yingke Chen Doctoral student. Muthu Chandrasekaran Doctoral student. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Compressing Mental Model Spaces and

Modeling Human Strategic Intent

Page 2: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Prashant Doshi University of Georgia, USA

Page 3: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

http://thinc.cs.uga.edu

Page 4: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Yingke ChenDoctoral student

Yifeng ZengReader, Teesside Univ.

Previously: Assoc Prof., Aalborg Univ.

Hua MaoDoctoral student

Muthu ChandrasekaranDoctoral student

Page 5: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

What is a mental behavioral model?

Page 6: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

How large is the behavioral model space?

Page 7: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

How large is the behavioral model space?

General definitionA mapping from the agent’s history of

observations to its actions

Page 8: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

How large is the behavioral model space?

2H (Aj)Uncountably infinite

Page 9: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

How large is the behavioral model space?

Let’s assume computable models

Countable

A very large portion of the model space is not computable!

Page 10: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Daniel DennettPhilosopher and Cognitive Scientist

Intentional stanceAscribe beliefs, preferences and intent to explain others’ actions

(analogous to theory of mind - ToM)

Page 11: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Organize the mental models

Intentional modelsSubintentional models

Page 12: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Organize the mental modelsIntentional models

E.g., POMDP = bj, Aj, Tj, j, Oj, Rj, OCj BDI, ToM

Subintentional models

Frame(may give rise to recursive modeling)

Page 13: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Organize the mental modelsIntentional models

E.g., POMDP = bj, Aj, Tj, j, Oj, Rj, OCj BDI, ToM

Subintentional modelsE.g., (Aj), finite state controller, plan

Frame

Page 14: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Finite model space grows as the interaction progresses

Page 15: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Growth in the model space

Other agent may receive any one of |j| observations

|Mj| |Mj||j| |Mj||j|2 ... |Mj||j|t

0 1 2 t

Page 16: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Growth in the model space

Exponential

Page 17: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Absolute continuity condition (ACC)

Page 18: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

ACC1. Subjective distribution over histories2. True distribution over histories

Page 19: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

ACC is a sufficient and necessary condition for Bayesian update of belief over models

Page 20: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

How do we satisfy ACC?

Cautious beliefs (full prior) Grain of truth assumption

Prior with a grain of truth is sufficient but not necessary

Page 21: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

General model space is large and grows exponentially as the interaction progresses

Page 22: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

It would be great if we can compress this space!

No loss in value to the modelerFlexible loss in value for greater compression

LosslessLossy

Page 23: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Expansive usefulness of model space compression to many areas:

1. Sequential decision making (dt-planning) in multiagent settings

2. Bayesian plan recognition3. Games of imperfect information

Page 24: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

1. Sequential decision making in multiagent settings

Interactive POMDP framework (Gmytrasiewicz&Doshi05)

Include models of the other agent in the state spaceUpdate beliefs over the physical state and models

Page 25: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

General and domain-independent approach for compression

Establish equivalence relations that partition the model space and retain representative models from each equivalence class

Page 26: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Approach #1: Behavioral equivalence (Rathanasabapathy et al.06,Pynadath&Marsella07)

Intentional models whose complete solutions are identical are considered equivalent

Page 27: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Approach #1: Behavioral equivalence

Behaviorally minimal set of models

Page 28: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Lossless

Works when intentional models have differing frames

Approach #1: Behavioral equivalence

Page 29: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Multiagent tiger

Approach #1: Behavioral equivalence

Impact on dt-planning in multiagent settings

Multiagent tiger

Multiagent MM

Page 30: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Utilize model solutions (policy trees) for mitigating model growth

Approach #1: Behavioral equivalence

Model reps that are not BE may become BE next step onwards

Preemptively identify such models and do not update all of them

Page 31: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Redefine BE

Approach #2: -Behavioral equivalence(Zeng et al.11,12)

Page 32: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Intentional models whose partial depth-d solutions are identical and vectors of updated beliefs at the leaves of the partial trees

are identical are considered equivalent

Approach #2: Revisit BE(Zeng et al.11,12)

Sufficient but not necessary

Lossless if frames are identical

Page 33: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Approach #2: (,d)-Behavioral equivalence

Two models are (,d)-BE if their partial depth-d solutions are identical and vectors of updated beliefs at the leaves of the

partial trees differ by

Models are(0.33,1)-BE

Lossy

Page 34: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Approach #2: -Behavioral equivalence

Lemma (Boyen&Koller98): KL divergence between two distributions in a discrete Markov stochastic process reduces or remains the same after a transition, with the mixing rate acting as a discount factor

Mixing rate represents the minimal amount by which the posterior distributions agree with each other after one transition

Property of a problem and may be pre-computed

Page 35: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Given the mixing rate and a bound, , on the divergence between two belief vectors, lemma allows computing the depth, d, at which the bound is reached

Approach #2: -Behavioral equivalence

Compare two solutions up to depth d for equality

Page 36: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Discount factor F = 0.5

Multiagent Concert

Approach #2: -Behavioral equivalence

Impact on dt-planning in multiagent settings

Multiagent Concert

On a UAV reconnaissance problem in a 5x5 grid, allows the solution to scale to a 10 step look ahead in 20 minutes

Page 37: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

What is the value of d when some problems exhibit F with a value of 0 or 1?

Approach #2: -Behavioral equivalence

F=1 implies that the KL divergence is 0 after one step: Set d = 1

F=0 implies that the KL divergence does not reduce: Arbitrarily set d to the horizon

Page 38: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Intentional or subintentional models whose predictions at time step t (action distributions)

are identical are considered equivalent at t

Approach #3: Action equivalence(Zeng et al.09,12)

Page 39: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Approach #3: Action equivalence

Page 40: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Lossy

Works when intentional models have differing frames

Approach #3: Action equivalence

Page 41: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Approach #3: Action equivalence

Impact on dt-planning in multiagent settings

Multiagent tigerAE bounds the model space at each time

step to the number of distinct actions

Page 42: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Intentional or subintentional models whose predictions at time step t influence the subject agent’s plan

identically are considered equivalent at t

Regardless of whether the other agent opened the left or right door,the tiger resets thereby affecting the agent’s plan identically

Approach #4: Influence equivalence(related to Witwicki&Durfee11)

Page 43: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Influence may be measured as the change in the subject agent’s belief due to the action

Approach #4: Influence equivalence

Group more models at time step t compared to AE

Lossy

Page 44: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Compression due to approximate equivalence may violate ACC

Regain ACC by appending a covering model to the compressed set of representatives

Page 45: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Open questions

Page 46: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

N > 2 agents

Under what conditions could equivalent models belonging to different agents be

grouped together into an equivalence class?

Page 47: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Can we avoid solving models by using heuristics for identifying approximately

equivalent models?

Page 48: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Modeling Strategic Human Intent

Page 49: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Yifeng ZengReader, Teesside Univ.

Previously: Assoc Prof., Aalborg Univ.

Yingke ChenDoctoral student

Hua MaoDoctoral student

Muthu ChandrasekaranDoctoral student

Xia QuDoctoral student

Roi CerenDoctoral student

Matthew MeiselDoctoral student

Adam GoodieProfessor of Psychology, UGA

Page 50: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Computational modeling of human recursive thinking in sequential games

Computational modeling of probability judgment in stochastic games

Page 51: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Human strategic reasoning is generally hobbled by low levels of recursive thinking

(Stahl&Wilson95,Hedden&Zhang02,Camerer et al.04,Ficici&Pfeffer08)

(I think what you think that I think...)

Page 52: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

You are Player I and II is human. Will you move or stay?

Move MoveMove

Stay Stay Stay

Payoff for I:Payoff for II:

31

13

24

42

I II I IIPlayer to move:

Page 53: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Less than 40% of the sample population performed the rational action!

Page 54: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Thinking about how others think (...) is hard in general contexts

Page 55: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Move MoveMove

Stay Stay Stay

Payoff for I:

(Payoff for II is 1 – decimal)

0.6 0.4 0.2 0.8

I II I IIPlayer to move:

Page 56: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

About 70% of the sample population performed the rational action in this simpler and strictly competitive game

Page 57: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Simplicity, competitiveness and embedding the task in intuitive representations seem to facilitate

human reasoning (Flobbe et al.08, Meijering et al.11, Goodie et al.12)

Page 58: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

3-stage game

Myopic opponents default to staying (level 0) while predictive opponents think about the player’s

decision (level 1)

Page 59: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Can we computationally model these strategic behaviors using process models?

Page 60: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Yes! Using a parameterized Interactive POMDP framework

Page 61: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Replace I-POMDP’s normative Bayesian belief update with Bayesian learning that underweights evidence, parameterized by

Notice that the achievement score increases as more games are played indicating learning of the opponent modelsLearning is slow and partial

Page 62: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Replace I-POMDP’s normative expected utility maximization with quantal response model that selects actions proportional to their utilities, parameterized by

Notice the presence of rationality errors in the participants’ choices (action is inconsistent with prediction) Errors appear to reduce with time

Page 63: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Underweighting evidence during learning and quantal response for

choice have prior psychological support

Page 64: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Use participants’ predictions of other’s action to learn and participants’ actions to learn

Page 65: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Use participants’ actions to learn both and Let vary linearly

Page 66: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Insights revealed by process modeling:1. Much evidence that participants did not make rote use of BI, instead

engaged in recursive thinking2. Rationality errors cannot be ignored when modeling human decision

making and they may vary3. Evidence that participants’ could be attributing surprising

observations of others’ actions to their rationality errors

Page 67: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Open questions:1. What is the impact on strategic thinking if action outcomes

are uncertain?2. Is there a damping effect on reasoning levels if participants

need to concomitantly think ahead in time

Page 68: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Suite of general and domain-independent approaches for compressing agent model

spaces based on equivalence

Computational modeling of human behavioral data pertaining to strategic thinking

Page 69: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

Thank you for your time

Page 70: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

2. Bayesian plan recognition under uncertainty

Plan recognition literature has paid scant attention to finding general ways of reducing the set of feasible

plans (Carberry, 01)

Page 71: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

3. Games of imperfect information (Bayesian games)

Real-world applications often involve many player types Examples• Ad hoc coordination in a spontaneous team• Automated Poker player agent

Page 72: Compressing Mental Model Spaces  and Modeling Human Strategic Intent

3. Games of imperfect information (Bayesian games)

Real-world applications often involve many player types

Model space compression facilitates equilibrium computation