-rum - rensselaer polytechnic institutehomepages.rpi.edu/~zhaoz6/docs/mixrum_poster.pdf · k-rum...

1
L EARNING M IXTURES OF R ANDOM U TILITY M ODELS Zhibing Zhao, Tristan Villamil and Lirong Xia Computer Science Department, Rensselaer Polytechnic Institute R ANK A GGREGATION Alternatives: {Donald Trump, Hillary Clinton, Gary Johnson}. Mechanism: plurality rule. Decision: Donald Trump. R ANKING M ODELS Set of alternatives: A = {a 1 ,a 2 ,...,a m }. Parameter space: Θ. Sample space: i.i.d. rankings over A. Distributions: {Pr θ : θ Θ}. R ANDOM U TILITY M ODELS (RUM S ) Sample a random utility for each alternative independently. Rank the alternatives w.r.t. these utilities. The Plackett-Luce model: all utility distributions are Gumbel distributions. M IXTURES OF R ANDOM U TILITY M ODELS ( k -RUM S ) Parameter: mixing coefficients (α 1 2 ,...,α k ); RUM components: RUM 1 , RUM 2 , ..., RUM k . Sample a component r w.r.t. mixing coefficients. Sample a ranking from component r . M OTIVATION :M ODEL F ITNESS ON P REFLIB D ATA Given the number of parameters d, the number of rankings n and the likelihood of the estimate L: AIC =2d - 2 ln(L). AICc = AIC + 2d(d+1) n-d-1 . BIC = d ln(n) - 2 ln(L). Observation: k -RUM k -PL RUM PL, where A B means that the number of datasets where A beats B is more than that where B beats A. T HEORETICAL R ESULTS :I DENTIFIABILITY Def: For all ~ θ 1 , ~ θ 2 Θ, Pr ~ θ 1 = Pr ~ θ 1 ~ θ 1 = ~ θ 2 . Theorem 1. Let M be any symmetric RUM from the location family. When m 2k - 1, k -RUM M over m alternatives is non-identifiable. Figure 1: The label switching problem Theorem 2. For any RUM M where all utility distributions have support (-∞, ), when m max{4k - 2, 6}, k -RUM M over m alternatives is generically identifiable. F IRST A LGORITHMS TO L EARN k -RUM Generalizd Method of Moments (GMM). 1. Choose q events: all pairwise comparisons and selected triple-wise comparisons. 2. Compute the empirical probabilities b 1 ,...,b q . 3. Find the parameter that minimize q i=1 (b i - p i ( ~ θ )) 2 , where p i ( ~ θ ) is the probability computed from the model. E-GMM. 1. E-step: compute the probabilities that a ranking belongs to each component. 2. M-step: compute the parameter of each compo- nent using the GMM algorithm by Azari Soufiani (2014). The Sandwich Algorithm (GMM-E-GMM). 1. Run GMM for a good starting point. 2. Run E-GMM to improve accuracy. Figure 2: The sandwich algorithm E XPERIMENTS Figure 3: 2-RUM over 6 alternatives Figure 4: 4-RUM over 15 alternatives C OMPARISON WITH k -PL S k -PLs k -RUMs Closed-form likelihood? Yes No Proving identifiability? Hard Harder R EFERENCES Hossein Azari Soufiani, David C. Parkes, and Lirong Xia, “Computing Parametric Ranking Models via Rank-Breaking". ICML-14. Zhibing Zhao, Peter Piech, and Lirong Xia, “Learning Mixtures of Plackett-Luce Models". ICML-16

Upload: others

Post on 27-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: -RUM - Rensselaer Polytechnic Institutehomepages.rpi.edu/~zhaoz6/docs/mixRUM_poster.pdf · k-RUM ˜k-PL ˜RUM ˜PL; where A ˜ B means that the number of datasets where A beats B

LEARNING MIXTURES OF RANDOM UTILITY MODELSZhibing Zhao, Tristan Villamil and Lirong Xia

Computer Science Department, Rensselaer Polytechnic Institute

RANK AGGREGATION

• Alternatives: {Donald Trump, Hillary Clinton,Gary Johnson}.

• Mechanism: plurality rule.• Decision: Donald Trump.

RANKING MODELS

• Set of alternatives: A = {a1, a2, . . . , am}.• Parameter space: Θ.• Sample space: i.i.d. rankings over A.• Distributions: {Prθ : θ ∈ Θ}.

RANDOM UTILITY MODELS (RUMS)• Sample a random utility for each alternative independently.• Rank the alternatives w.r.t. these utilities.• The Plackett-Luce model: all utility distributions are Gumbel distributions.

MIXTURES OF RANDOM UTILITY MODELS (k-RUMS)• Parameter: mixing coefficients (α1, α2, . . . , αk); RUM components: RUM1, RUM2, . . ., RUMk.• Sample a component r w.r.t. mixing coefficients.• Sample a ranking from component r.

MOTIVATION: MODEL FITNESS ON PREFLIB DATA

Given the number of parameters d, the number ofrankings n and the likelihood of the estimate L:• AIC = 2d− 2 ln(L).• AICc = AIC + 2d(d+1)

n−d−1 .• BIC = d ln(n)− 2 ln(L).

Observation:

k-RUM � k-PL � RUM � PL,

where A � B means that the number of datasetswhere A beats B is more than that where B beats A.

THEORETICAL RESULTS: IDENTIFIABILITY

Def: For all ~θ1, ~θ2 ∈ Θ, Pr~θ1 = Pr~θ1 ⇒~θ1 = ~θ2.

Theorem 1. Let M be any symmetric RUM from thelocation family. When m ≤ 2k − 1, k-RUMM over malternatives is non-identifiable.

Figure 1: The label switching problem

Theorem 2. For any RUMM where all utility distributions have support (−∞,∞), when m ≥ max{4k − 2, 6},k-RUMM over m alternatives is generically identifiable.

FIRST ALGORITHMS TO LEARN k-RUMGeneralizd Method of Moments (GMM).

1. Choose q events: all pairwise comparisons andselected triple-wise comparisons.

2. Compute the empirical probabilities b1, . . . , bq .3. Find the parameter that minimize

∑qi=1(bi −

pi(~θ))2, where pi(~θ) is the probability computed

from the model.

E-GMM.

1. E-step: compute the probabilities that a rankingbelongs to each component.

2. M-step: compute the parameter of each compo-nent using the GMM algorithm by Azari Soufiani(2014).

The Sandwich Algorithm (GMM-E-GMM).

1. Run GMM for a good starting point.2. Run E-GMM to improve accuracy.

Figure 2: The sandwich algorithm

EXPERIMENTS

Figure 3: 2-RUM over 6 alternatives Figure 4: 4-RUM over 15 alternatives

COMPARISON WITH k-PLS

k-PLs k-RUMsClosed-form likelihood? Yes NoProving identifiability? Hard Harder

REFERENCES

Hossein Azari Soufiani, David C. Parkes, and LirongXia, “Computing Parametric Ranking Models viaRank-Breaking". ICML-14.Zhibing Zhao, Peter Piech, and Lirong Xia, “LearningMixtures of Plackett-Luce Models". ICML-16