mutual information scheduling for ranking

27
Mutual Information Scheduling for Ranking Hamza Aftab Nevin Raj Paul Cuff Sanjeev Kulkarni Adam Finkelstein 1

Upload: nanda

Post on 22-Jan-2016

58 views

Category:

Documents


0 download

DESCRIPTION

Mutual Information Scheduling for Ranking. Hamza Aftab Nevin Raj Paul Cuff Sanjeev Kulkarni Adam Finkelstein. Applications of Ranking. Pair-wise Comparisons. Query: A > B ? Ask a voter whether candidate I is better than candidate J Observe the outcome of a match. Scheduling. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Mutual Information Scheduling for Ranking

Mutual Information Scheduling for Ranking

Hamza Aftab

Nevin Raj

Paul Cuff

Sanjeev Kulkarni

Adam Finkelstein

1

Page 2: Mutual Information Scheduling for Ranking

Applications of Ranking

2

Page 3: Mutual Information Scheduling for Ranking

Pair-wise Comparisons

3

Query:A > B ?Ask a voter whether candidate I is better

than candidate JObserve the outcome of a match

Page 4: Mutual Information Scheduling for Ranking

Scheduling

4

Design queries dynamically, based on past observations.

Page 5: Mutual Information Scheduling for Ranking

Example: Kitten Wars

5

Page 6: Mutual Information Scheduling for Ranking

Example: All Our Ideas(Matthew Salganik – Princeton)

6

Page 7: Mutual Information Scheduling for Ranking

Select Informative MatchesAssume matches are expensive but computation is

cheap

Previous Work (Finkelstein)Use Ranking Algorithm to make better use of informationSelect matches by giving priority based on two criterion

Lack of information: Has a team been in a lot of matches already?Comparability of the match: Are the two teams roughly equal in

strength?

Our innovationSelect matches based on Shannon’s mutual information

7

Page 8: Mutual Information Scheduling for Ranking

Related WorkSensor Management (tracking)

Information-Driven [Manyika, Durrant-Whyte 1994][Zhao et. al. 2002] – Bayesian filtering[Aoki et. al. 2011] – This session

Learning Network Topology[Hayek, Spuckler 2010]

Noisy Sort

8

Page 9: Mutual Information Scheduling for Ranking

Ranking Algorithms – Linear ModelEach player has a skill level µThe probability that Player I beats Player J is a

function of the difference µi - µj

TransitiveUse Maximum Likelihood

Thurstone-Mosteller ModelQ function

Performance has Gaussian distribution about the mean µ

Bradley-Terry ModelLogistic function

9

Page 10: Mutual Information Scheduling for Ranking

ExamplesElo’s chess ranking system

Based on Bradley-Terry modelSagarin’s sports rankings

10

Page 11: Mutual Information Scheduling for Ranking

Mutual Information

11

Mutual Information:

Conditional Mutual information

Page 12: Mutual Information Scheduling for Ranking

Entropy

12

Entropy:

Conditional Entropy

High entropy Low entropy

Page 13: Mutual Information Scheduling for Ranking

Mutual Information Scheduling

Let R be the information we wish to learn(i.e. ranking or skill levels)

Let Ok be the outcome of the kth match

At time k, scheduler chooses the pair (ik+1, jk+1):

13

Page 14: Mutual Information Scheduling for Ranking

Why use Mutual Information?Additive Property

Fano’s InequalityRelated entropy to probability of errorFor small error:

Continuous distributions: MSE bounds differential entropy

14

Page 15: Mutual Information Scheduling for Ranking

Greedy is Not Optimal

15

Consider Huffman codes---Greedy is not optimal

Page 16: Mutual Information Scheduling for Ranking

Performance (MSE)

16

Page 17: Mutual Information Scheduling for Ranking

Performance (Gambling Penalty)

17

Page 18: Mutual Information Scheduling for Ranking

Identify correct ranking

18

Page 19: Mutual Information Scheduling for Ranking

Find strongest player

19

Page 20: Mutual Information Scheduling for Ranking

Find strongest player

20

Page 21: Mutual Information Scheduling for Ranking

Evaluating Goodness-of-Fit

21

Ranking: Inversions

Skill Level Estimates:Mean squared error (MSE)Kullback-Leibler (KL) divergence

(relative entropy)Others

Betting riskSampling inconsistency

1234

1

4

Xx xq

xpxpqpD

)(

)(log)()||(

32

Page 22: Mutual Information Scheduling for Ranking

Numerical Techniques

Calculate mutual informationImportance samplingConvex Optimization (tracking of ML

estimate)

Page 23: Mutual Information Scheduling for Ranking

Summary of Main Idea

Get the most out of measurements for estimating a ranking

Schedule each match to maximize(Greedy, to make the computation tractable)Flexible

S is any parameter of interest, discrete or continuous(skill levels; best candidate; etc.)

Simple design---competes well with other heuristics

Page 24: Mutual Information Scheduling for Ranking

Ranking Based on Pair-wise ComparisonsBradley Terry Model:

Examples:A hockey team scores Poisson- goals in a

gameTwo cities compete to have the tallest person

is the population

Page 25: Mutual Information Scheduling for Ranking

Computing Mutual Information

25

Importance Sampling:Multidimensional integralProbability distributionsSkill level estimates

• Why is it good for estimating skill levels?– Faster than convex optimization– Efficient memory use

Skill level of player 1

Ski

ll le

vel o

f pl

ayer

2

Page 26: Mutual Information Scheduling for Ranking

0 100 200 300 400 5000

0.5

1

1.5

2

2.5

3

3.5

4

4.5

Number of games

Ave

rage

num

ber

of in

vers

ions

ELOTrueSkillRandom SchedulingMinGames/ClosestSkillMutual InformationGraph Based

Results

26

(for a 10 player tournament and100 experiments)

220 230 240 250 260 270 2800

0.1

0.2

0.3

0.4

0.5

Number of games

Aver

age

num

ber o

f inv

ersi

ons

20 30 40 50 60 70

0.3

0.4

0.5

0.6

0.7

Number of games

Ave

rage

num

ber

of in

vers

ions

Page 27: Mutual Information Scheduling for Ranking

Visualizing the Algorithm

27

Player A B C D

A 0 2 3 3

B 0 0 7 2

C 0 2 0 5

D 1 2 2 0

Player A B C D

A 0 0.031 0.025 0.024

B 0.031 0 0.023 0.033

C 0.025 0.023 0 0.030

D 0.024 0.033 0.030 0

A B

C D

?

Outcomes

Scheduling