-rum - rensselaer polytechnic institutehomepages.rpi.edu/~zhaoz6/docs/mixrum_poster.pdf · k-rum...

L EARNING M IXTURES OF R ANDOM U TILITY M ODELS Zhibing Zhao, Tristan Villamil and Lirong Xia Computer Science Department, Rensselaer Polytechnic Institute R ANK A GGREGATION • Alternatives: {Donald Trump, Hillary Clinton, Gary Johnson}. • Mechanism: plurality rule. • Decision: Donald Trump. R ANKING M ODELS • Set of alternatives: A = {a 1 ,a 2 ,...,a m }. • Parameter space: Θ. • Sample space: i.i.d. rankings over A. • Distributions: {Pr θ : θ ∈ Θ}. R ANDOM U TILITY M ODELS (RUM S ) • Sample a random utility for each alternative independently. • Rank the alternatives w.r.t. these utilities. • The Plackett-Luce model: all utility distributions are Gumbel distributions. M IXTURES OF R ANDOM U TILITY M ODELS ( k -RUM S ) • Parameter: mixing coefficients (α 1 ,α 2 ,...,α k ); RUM components: RUM 1 , RUM 2 , ..., RUM k . • Sample a component r w.r.t. mixing coefficients. • Sample a ranking from component r . M OTIVATION :M ODEL F ITNESS ON P REFLIB D ATA Given the number of parameters d, the number of rankings n and the likelihood of the estimate L: • AIC =2d - 2 ln(L). • AICc = AIC + 2d(d+1) n-d-1 . • BIC = d ln(n) - 2 ln(L). Observation: k -RUM k -PL RUM PL, where A B means that the number of datasets where A beats B is more than that where B beats A. T HEORETICAL R ESULTS :I DENTIFIABILITY Def: For all ~ θ 1 , ~ θ 2 ∈ Θ, Pr ~ θ 1 = Pr ~ θ 1 ⇒ ~ θ 1 = ~ θ 2 . Theorem 1. Let M be any symmetric RUM from the location family. When m ≤ 2k - 1, k -RUM M over m alternatives is non-identifiable. Figure 1: The label switching problem Theorem 2. For any RUM M where all utility distributions have support (-∞, ∞), when m ≥ max{4k - 2, 6}, k -RUM M over m alternatives is generically identifiable. F IRST A LGORITHMS TO L EARN k -RUM Generalizd Method of Moments (GMM). 1. Choose q events: all pairwise comparisons and selected triple-wise comparisons. 2. Compute the empirical probabilities b 1 ,...,b q . 3. Find the parameter that minimize ∑ q i=1 (b i - p i ( ~ θ )) 2 , where p i ( ~ θ ) is the probability computed from the model. E-GMM. 1. E-step: compute the probabilities that a ranking belongs to each component. 2. M-step: compute the parameter of each component using the GMM algorithm by Azari Soufiani (2014). The Sandwich Algorithm (GMM-E-GMM). 1. Run GMM for a good starting point. 2. Run E-GMM to improve accuracy. Figure 2: The sandwich algorithm E XPERIMENTS Figure 3: 2-RUM over 6 alternatives Figure 4: 4-RUM over 15 alternatives C OMPARISON WITH k -PL S k -PLs k -RUMs Closed-form likelihood? Yes No Proving identifiability? Hard Harder R EFERENCES Hossein Azari Soufiani, David C. Parkes, and Lirong Xia, “Computing Parametric Ranking Models via Rank-Breaking". ICML-14. Zhibing Zhao, Peter Piech, and Lirong Xia, “Learning Mixtures of Plackett-Luce Models". ICML-16

Upload: others

Post on 27-Jun-2020

2 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: -RUM - Rensselaer Polytechnic Institutehomepages.rpi.edu/~zhaoz6/docs/mixRUM_poster.pdf · k-RUM ˜k-PL ˜RUM ˜PL; where A ˜ B means that the number of datasets where A beats B

LEARNING MIXTURES OF RANDOM UTILITY MODELSZhibing Zhao, Tristan Villamil and Lirong Xia

Computer Science Department, Rensselaer Polytechnic Institute

RANK AGGREGATION

• Alternatives: {Donald Trump, Hillary Clinton,Gary Johnson}.

• Mechanism: plurality rule.• Decision: Donald Trump.

RANKING MODELS

• Set of alternatives: A = {a1, a2, . . . , am}.• Parameter space: Θ.• Sample space: i.i.d. rankings over A.• Distributions: {Prθ : θ ∈ Θ}.

RANDOM UTILITY MODELS (RUMS)• Sample a random utility for each alternative independently.• Rank the alternatives w.r.t. these utilities.• The Plackett-Luce model: all utility distributions are Gumbel distributions.

MIXTURES OF RANDOM UTILITY MODELS (k-RUMS)• Parameter: mixing coefficients (α1, α2, . . . , αk); RUM components: RUM1, RUM2, . . ., RUMk.• Sample a component r w.r.t. mixing coefficients.• Sample a ranking from component r.

MOTIVATION: MODEL FITNESS ON PREFLIB DATA

Given the number of parameters d, the number ofrankings n and the likelihood of the estimate L:• AIC = 2d− 2 ln(L).• AICc = AIC + 2d(d+1)

n−d−1 .• BIC = d ln(n)− 2 ln(L).

Observation:

k-RUM � k-PL � RUM � PL,

where A � B means that the number of datasetswhere A beats B is more than that where B beats A.

THEORETICAL RESULTS: IDENTIFIABILITY

Def: For all ~θ1, ~θ2 ∈ Θ, Pr~θ1 = Pr~θ1 ⇒~θ1 = ~θ2.

Theorem 1. Let M be any symmetric RUM from thelocation family. When m ≤ 2k − 1, k-RUMM over malternatives is non-identifiable.

Figure 1: The label switching problem

Theorem 2. For any RUMM where all utility distributions have support (−∞,∞), when m ≥ max{4k − 2, 6},k-RUMM over m alternatives is generically identifiable.

FIRST ALGORITHMS TO LEARN k-RUMGeneralizd Method of Moments (GMM).

1. Choose q events: all pairwise comparisons andselected triple-wise comparisons.

2. Compute the empirical probabilities b1, . . . , bq .3. Find the parameter that minimize

∑qi=1(bi −

pi(~θ))2, where pi(~θ) is the probability computed

from the model.

E-GMM.

1. E-step: compute the probabilities that a rankingbelongs to each component.

2. M-step: compute the parameter of each compo-nent using the GMM algorithm by Azari Soufiani(2014).

The Sandwich Algorithm (GMM-E-GMM).

1. Run GMM for a good starting point.2. Run E-GMM to improve accuracy.

Figure 2: The sandwich algorithm

EXPERIMENTS

Figure 3: 2-RUM over 6 alternatives Figure 4: 4-RUM over 15 alternatives

COMPARISON WITH k-PLS

k-PLs k-RUMsClosed-form likelihood? Yes NoProving identifiability? Hard Harder

REFERENCES

Hossein Azari Soufiani, David C. Parkes, and LirongXia, “Computing Parametric Ranking Models viaRank-Breaking". ICML-14.Zhibing Zhao, Peter Piech, and Lirong Xia, “LearningMixtures of Plackett-Luce Models". ICML-16

Corporate Venture Capital - Rensselaer Polytechnic Institutehomepages.rpi.edu/~oconng/CorpEntrepreneurship/Model 2/Corporate... · Corporate Venture Capital Managing Equity Investments

Tropical Impairments - Gypsy Poolside CafeTropical Impairments Wines Beers 9 Light rum, da rk rum, tropical schnapps, ﬂavored brandy with fruit juices. 9 Light rum, coconut rum,

Rum Kasato

Dc rum 12.4 why add dc rum to app mon slideshare

Rum Jungle

Pala Casino Spa & Resort€¦ · Bacardi Rum, Rumhaven Coconut Rum, Myers Dark Rum Pineapple and Orange Juice 11.00 Mai Tai Myers's Dark Rum, Bacardi Light Rum Crème de Almond and

Le kä 37se fer Rum stilzpel chen Le kä 37se fer mal ler€¦ · Rum stilzpel chen Rum stilzpel chen Rum stilzpel chen Rum stilzpel chen Noch ein woll der Kö mehr Gold. Die Mül

Prezentacija - Rum

Rum Making

Menu Drink Side€¦ · Blended Light Rum, Spiced Rum, and Strawberry Mix. Topped with a Dark Rum Floater. MIAMI VICE $15 $15 Blended Coconut Rum, Spiced Rum, and Pina Colada Mix

liaions - champps.com...Bacardi Superior Rum, Cruzan Rum and Captain Morgan Rum, pineapple and orange juice, topped with Myers’s Dark Rum // 11.00 SPIKED STRAWBERRY LEMONADE Skyy

Revolutionary Rum