efficient multiple-click models in web search

39
Fan Guo 1 , Chao Liu 2 and Yi-Min Wang 2 1 Carnegie Mellon University 2 Microsoft Research Feb 11, 2009

Upload: darius

Post on 06-Jan-2016

36 views

Category:

Documents


0 download

DESCRIPTION

Efficient Multiple-Click Models in Web Search. Fan Guo 1 , Chao Liu 2 and Yi-Min Wang 2 1 Carnegie Mellon University 2 Microsoft Research Feb 11, 2009. Overview. Web Search Activity. Search Engine Presentation. User Interaction Logs. Models for Leveraging User Feedbacks. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Efficient Multiple-Click Models in Web Search

Fan Guo1, Chao Liu2 and Yi-Min Wang2

1 Carnegie Mellon University2 Microsoft Research

Feb 11, 2009

Page 2: Efficient Multiple-Click Models in Web Search

Overview

Web Search Activity

User Interaction

Logs

Search Engine

Presentation

Models for Leveraging

User Feedbacks

Page 3: Efficient Multiple-Click Models in Web Search

Overview

Web Search Activity

User Interaction

Logs

Search Engine

Presentation

Models for Leveraging

User Feedbacks

Logging

Page 4: Efficient Multiple-Click Models in Web Search

User Implicit FeedbacksA very rich set of candidate features (Agichtein et al., SIGIR’06)

Page 5: Efficient Multiple-Click Models in Web Search

User Implicit FeedbacksPractical consideration

Recording effort (how much tweaking?)Noise elimination (e.g., dwelling time?)Would the answer change after scaling up to

10x? 100x?

Page 6: Efficient Multiple-Click Models in Web Search

Click LogsQuery term

cmuDocument URL

www.cmu.edu Position/Rank

1Clicked or Not

Yes

Query termcmu

Document URLwww.cmu.edu

Position/Rank1

Clicked or NotYes

Query termcmu

Document URLwww.cmu.edu

Position/Rank1

Clicked or NotYes

Query termcmu

Document URLwww.ri.cmu.ed

u Position/Rank

10Clicked or Not

No

Page 7: Efficient Multiple-Click Models in Web Search

Click LogsQuery term

cmuDocument URL

www.cmu.edu Position/Rank

1Clicked or Not

Yes

Time stamp01/19/2009,

4:00:01pmSession ID

...User ID

...

Query termcmu

Document URLwww.cmu.edu

Position/Rank1

Clicked or NotYes

Query termcmu

Document URLwww.cmu.edu

Position/Rank1

Clicked or NotYes

Query termcmu

Document URLwww.ri.cmu.ed

u Position/Rank

M = 10Clicked or Not

No

Page 8: Efficient Multiple-Click Models in Web Search

A Snapshot

Page 9: Efficient Multiple-Click Models in Web Search

Overview

Web Search Activity

User Interaction

Logs

Search Engine

Presentation

Models for Leveraging

User Feedbacks

Cleanup

Page 10: Efficient Multiple-Click Models in Web Search

Overview

Web Search Activity

User Interaction

Logs

Search Engine

Presentation

Models for Leveraging

User Feedbacks

Modeling

Page 11: Efficient Multiple-Click Models in Web Search

Clicks are biased

Images are copied from (Joachims et al., 2007) in ACM TOIS

Page 12: Efficient Multiple-Click Models in Web Search

Examination HypothesisA web document must be examined before

clicked.

Relevance is defined as the conditional probability of click upon examination.

P(Click) = P(Examination) * Relevance, formulated in (Richardson et al., WWW’07)

Page 13: Efficient Multiple-Click Models in Web Search

Most Simple Click ModelVariable Definition

Ei: binary r.v. for examination on position i;

Ci: binary r.v. for click on position i;

rdi: relevance for document di on position i.

Page 14: Efficient Multiple-Click Models in Web Search

Most Simple Click ModelIndependent Click Model

P(Ei = 1) = 1

P(Ci=1) = rdi

Therefore we have

Page 15: Efficient Multiple-Click Models in Web Search

Cascade HypothesisProposed in (Craswell et al., WSDM’08)Strict Linear order of examination and click

Ei = 0 => Ei+1 = 0

Cascade Model

Page 16: Efficient Multiple-Click Models in Web Search

Cascade ModelModeling the first click (Craswell et al., WSDM’08)

P(Ci=1|Ei=0) = 0, P(Ci=1|Ei=1) = rdi

P(E1=1) =1, P(Ei+1=1|Ei=0) = 0

P(Ei+1=1|Ei=1, Ci=0) = 1

P(Ei+1=1|Ei=1, Ci=1) = ?

examination hypothesiscascade hypothesis

modeling a single click

Page 17: Efficient Multiple-Click Models in Web Search

Dependent Click ModelModeling multiple clicks

P(Ci=1|Ei=0) = 0, P(Ci=1|Ei=1) = rdi

P(E1=1) =1, P(Ei+1=1|Ei=0) = 0

P(Ei+1=1|Ei=1, Ci=0) = 1

P(Ei+1=1|Ei=1, Ci=1) = λi

examination hypothesiscascade hypothesis

modeling 1+ click(s)

Page 18: Efficient Multiple-Click Models in Web Search

Dependent Click Modelλi (1 ≤ i < 10) are global user behavior

parameters.The full picture

Page 19: Efficient Multiple-Click Models in Web Search

DCM AlgorithmsTwo sets of unknown values for estimation

Query-specific: document relevance rGlobal: user behavior parameter λ

Approximate Maximum-Likelihood LearningMaximizing a lower bound of log-likelihoodWould be exact if the chain length is infinite

Page 20: Efficient Multiple-Click Models in Web Search

Estimation Formulae

empirical CTR measured before last clicked position

empirical probability of “clicked-but-not-last”

DCM Algorithms

Page 21: Efficient Multiple-Click Models in Web Search

DCM ImplementationKeep 3 counts for each query-document pair

Then

Page 22: Efficient Multiple-Click Models in Web Search

Overview

Web Search Activity

User Interaction

Logs

Search Engine

Presentation

Models for Leveraging

User Feedbacks

Application

Page 23: Efficient Multiple-Click Models in Web Search

Click Model ApplicationsUser-Perceived Relevance

A good candidate feature for ranking, but…As input for existing ranking algorithms v.s. human judgment

Page 24: Efficient Multiple-Click Models in Web Search

Click Model ApplicationsSearch Engline Evaluation

Summary of the overall user experienceExpected clickthrough rate

Browsing Behavior Statistics

Application in Sponsored SearchBuilding block for auction/pricing mechanism(Kempe and Mahdian, WINE 2008) (Aggarwal et al., WINE

2008)

(Guo et al., WSCD’09)

Page 25: Efficient Multiple-Click Models in Web Search
Page 26: Efficient Multiple-Click Models in Web Search

Click Data SetCollected in 2 weeks in July 2008.Discard query sessions with no clicks.178 most frequent queries removed.

Page 27: Efficient Multiple-Click Models in Web Search

Click Data SetAfter preprocessing:

110,630 distinct queries4.8M/4.0M query sessions in the training/test

set

Training Time: ~7 min

Page 28: Efficient Multiple-Click Models in Web Search

Evaluation CriteriaTest Data Log-likelihood

Given the document impression in the test setCompute the chance to recover the entire click

vectorAveraged over different query sessions

Page 29: Efficient Multiple-Click Models in Web Search

Test Data Log-LikelihoodICM: -1.401, DCM: -1.327 (26.5% chance)

Page 30: Efficient Multiple-Click Models in Web Search

Evaluation CriteriaPredicting First/Last Clicks

Given the document impression in the test setDraw 100 click vector samples (with 1+ clicks)Compute the corresponding RMS error

Page 31: Efficient Multiple-Click Models in Web Search

First/Last Clicked Position

Page 32: Efficient Multiple-Click Models in Web Search

Examination/Click Distribution

Page 33: Efficient Multiple-Click Models in Web Search

Difference by User Goals

From (Guo et al., WSCD’09)on a different data set

Page 34: Efficient Multiple-Click Models in Web Search

An AlternativeUser Browsing Model (Dupret et al., SIGIR’08)Examine probability depends on

Preceding clicked positionDistance to preceding clicked position

Comparing with DCM:Allow jump in examination (no longer strictly

linear)Parameter set is an order of magnitude largerMore expensive algorithms (iterative, EM-like)

Page 35: Efficient Multiple-Click Models in Web Search

ConclusionClick Models:

A principled way of integrating user click dataAll current models are based on examination

hypothesisShould be scalable and incremental

DCM:built upon cascade model, for multiple-clicksintroduce a set of position-dependent to

characterize the examine-next probabilityprovides a fast, simple, yet effective solution

Page 36: Efficient Multiple-Click Models in Web Search

Open ProblemsTrade-off in effectiveness/efficiency

Evaluation (test bed and test metric)

Click-model relevance v.s. human-judged relevance

Click model for universal search?

Page 37: Efficient Multiple-Click Models in Web Search

Acknowledgement

Chao Liu Yi-Min Wang

Ethan Tu Li-Wei He Nick Craswell

Page 38: Efficient Multiple-Click Models in Web Search

Acknowledgement

Christos Faloutsos Lei Li

Page 39: Efficient Multiple-Click Models in Web Search

Thank you!