wscd09 workshop on web search click data 2009

37
WSCD09 Workshop on Web Search Click Data 2009 Survey and evaluation of query intent detection methods David J. Brenes, Daniel Gayo Avello, Kilian Pérez-González

Upload: dolan

Post on 25-Feb-2016

28 views

Category:

Documents


0 download

DESCRIPTION

WSCD09 Workshop on Web Search Click Data 2009. Survey and evaluation of query intent detection methods David J. Brenes, Daniel Gayo Avello , Kilian Pérez-González . phenoxyphenyl entertainment tonight jewish christmas mortgage calculator eoc phi famington , new mexico - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: WSCD09 Workshop on Web Search Click Data 2009

WSCD09Workshop on WebSearch Click Data 2009

Survey and evaluation of query intent detection methodsDavid J. Brenes, Daniel Gayo Avello, Kilian Pérez-González

Page 2: WSCD09 Workshop on Web Search Click Data 2009
Page 3: WSCD09 Workshop on Web Search Click Data 2009

phenoxyphenylentertainment tonight

jewish christmasmortgage calculator

eoc phifamington, new mexico"oneida indian nation"

"oneida, ny" and "employment"sim city 4 cheatsbenihana of novitotal body kcferenc molnar

cucumber shampoo molton brownwho likes grinding in wow?

solitude utah skinick nilsson training in orlando

homunculi tattoo"robert louis stevenson"+quotesww ii military installations

south carolinabogen 3016

sch 1830v samsungwww.brookdaleliving.comair temperature gaugeold calvary church

steelers playoff game

Page 4: WSCD09 Workshop on Web Search Click Data 2009
Page 5: WSCD09 Workshop on Web Search Click Data 2009

a query log (pretty raw, huh?)

one query(still raw)

Page 6: WSCD09 Workshop on Web Search Click Data 2009
Page 7: WSCD09 Workshop on Web Search Click Data 2009

apple

Page 8: WSCD09 Workshop on Web Search Click Data 2009

apple

Page 9: WSCD09 Workshop on Web Search Click Data 2009

reading the user’s mind. i.e. inferring query intent.

Page 10: WSCD09 Workshop on Web Search Click Data 2009

Intent is just one of the

queries dimensions.

Page 11: WSCD09 Workshop on Web Search Click Data 2009

st. thomas bed and breakfast

Page 12: WSCD09 Workshop on Web Search Click Data 2009

intent (transactional?)

st. thomas bed and breakfast

Page 13: WSCD09 Workshop on Web Search Click Data 2009

intent (transactional?)

topic (shopping/travel/lodging?)st. thomas bed and breakfast

Page 14: WSCD09 Workshop on Web Search Click Data 2009

intent (transactional?)geographical location (Virgin Islands?)

st. thomas bed and breakfast

topic (shopping/travel/lodging?)

Page 15: WSCD09 Workshop on Web Search Click Data 2009

Intent is just one of the

queries dimensions.query topic

geographical locationcommercial/product/job seeking

...human vs ‘robot’ queries

Page 16: WSCD09 Workshop on Web Search Click Data 2009

KDD Cup 2005LogCLEF 2009

WSCD’09

Page 17: WSCD09 Workshop on Web Search Click Data 2009

KDD Cup 2005LogCLEF 2009

WSCD’09

query topic classificationlog analysis and geographic query identificationuser goalshumans vs robotstopic analysis of queriesetc.

Page 18: WSCD09 Workshop on Web Search Click Data 2009

However…Topic and Intent

arethe classical

onesand we are mainly interested in query intent detection.

Page 19: WSCD09 Workshop on Web Search Click Data 2009

Our proposal for the WSCD’09

Page 20: WSCD09 Workshop on Web Search Click Data 2009

“To replicate most of the techniques to perform automatic query intent detection in addition to study the feasibility of a pooling strategy à-la-TREC to evaluate the different techniques.”

“We hope to anticipate some of the benefits and difficulties from a hypothetical bakeoff on broad query classification.”

Page 21: WSCD09 Workshop on Web Search Click Data 2009

Andrei Z. Broder

Informational. The intent is to acquire some information assumed to be present on one or more web pages.

Navigational. The immediate intent is to reach a particular site.

Transactional. The intent is to perform some web-mediated activity.

Query intent

Page 22: WSCD09 Workshop on Web Search Click Data 2009

Andrei Z. Broder

Informational. The intent is to acquire some information assumed to be present on one or more web pages.

Navigational. The immediate intent is to reach a particular site.

Transactional. The intent is to perform some web-mediated activity.

Query intent

difficult to distinguish...

Page 23: WSCD09 Workshop on Web Search Click Data 2009

Query intentInformational Navigational

Transactionalsymptoms separation anxiety dogsshould snow dogs wear booties

what is addison's diseases for dogs

yellowpagesthe huns yellow pages.com

Page 24: WSCD09 Workshop on Web Search Click Data 2009

Query intentInformational Navigational

Transactionalsymptoms separation anxiety dogsshould snow dogs wear booties

what is addison's diseases for dogs

yellowpagesthe huns yellow pages.com

find the ‘right’ websitefind the ‘right’ answertelling apart informational

from navigational/transactional queries

matters.

Page 25: WSCD09 Workshop on Web Search Click Data 2009

Query intent detection methods replicated

Lee et al. 2005, click-through data + document collection, ad hoc thresholdsLiu et al. 2006, click-through data, ad hoc evidencesJansen et al. 2008, rule-basedBrenes and Gayo 2008, click-through data, ad hoc evidencesthese were just the evaluated methods.those requiring training and/or external corporawere left for future research.

Page 26: WSCD09 Workshop on Web Search Click Data 2009

Query intent detection methods replicated

Lee et al. 2005Click distribution (from clickthrough-data) and anchor-link distribution (from document collection)Flat distributions informational queriesSkewed distributions navigational/transactionalAdditionally, average number of clicks per query

Page 27: WSCD09 Workshop on Web Search Click Data 2009

Query intent detection methods replicated

Liu et al. 2006n Clicks Satisfied (nCS)number of sessions where the user clicked, at most, n results (n=2)top n Results Satisfied (nRS)number of sessions where the user clicked, at most, the top n results (n=5)Additionally, click distribution (Lee et al. 2005)

Page 28: WSCD09 Workshop on Web Search Click Data 2009

Query intent detection methods replicated

Jansen et al. 2008Navigational queries:

Contain names of companies, businesses, organizations or peopleContain domain suffixesThey have less than three terms

Most of the data obtained from Freebasehttp://www.freebase.com

Page 29: WSCD09 Workshop on Web Search Click Data 2009

Query intent detection methods replicated

Brenes and Gayo 2008Weight of the most popular result (cPopular)Number of distinct visited results (cDistinct)Navigational queries tend to appear issolated (cSession)

Page 30: WSCD09 Workshop on Web Search Click Data 2009

Proposed evaluation methodPrecision, recall and F-measure6,624 queries (1‰) were manually tagged10 judges plus the authorsEach judge tagged 1,000 queries (aprox.)Third judge polished inconsistencies

Page 31: WSCD09 Workshop on Web Search Click Data 2009

What about the pooling strategy?

Unfeasible :(1. 81% actual navigational queries appear in

the pool2. Most of the queries flagged as

navigational80% of the queries (5 million) to be manually evaluated

Page 32: WSCD09 Workshop on Web Search Click Data 2009

But the sample was pretty “small”…

Sure. Bigger samples!

Dolores Labs Blog:

a must read!

Page 33: WSCD09 Workshop on Web Search Click Data 2009

And what about theresults?

Page 34: WSCD09 Workshop on Web Search Click Data 2009

Best achieversLiu et al. nCS (0.378), nRS (0.398)Brenes and Gayo cPopular (0.374)Jansen et al. heuristic rules (0.360)

Promising featuresnRSusing domains and website namesMonte Carlo simulation (?)

Page 35: WSCD09 Workshop on Web Search Click Data 2009

Best achieversLiu et al. nCS (0.378), nRS (0.398)Brenes and Gayo cPopular (0.374)Jansen et al. heuristic rules (0.360)

Promising featuresnRSusing domains and website namesMonte Carlo simulation (?)

users behaviourexternal knowledgefilling the gap

Page 36: WSCD09 Workshop on Web Search Click Data 2009

Conclusions and Future work

Feasible evaluation methodologyPooling evaluation is unfeasible

Larger tagged samples are needed

Further refinement to improve performance

Page 37: WSCD09 Workshop on Web Search Click Data 2009

Uhh… Questions?