understanding and predicting personal navigation

19
Understanding and Predicting Jaime Teevan, Daniel J. Liebling and Gayathri Ravichandran Geetha Microsoft Research Personal Navigation

Upload: victoria-maud-little

Post on 19-Jan-2016

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Understanding and Predicting Personal Navigation

Understanding and Predicting

Jaime Teevan, Daniel J. Liebling and Gayathri Ravichandran Geetha

Microsoft Research

Personal Navigation

Page 2: Understanding and Predicting Personal Navigation

7th

33% queries repeated73% of those are navigational

[Teevan et al. SIGIR 2007]

(tomorrow @ 14:00)

Page 3: Understanding and Predicting Personal Navigation

Authors Tutorials New content: WSDM 2012 Attending Workshops to be held February 9-12 in Sponsors Conference Venue Seattle, WA.www.wsdm2011.org

33% queries repeated73% of those are navigational

[Teevan et al. SIGIR 2007]

Page 4: Understanding and Predicting Personal Navigation

Road Map of Talk

• General Navigation– Identifying general navigation– Understanding general navigation

• Personal Navigation wsdm– Identifying personal navigation– Compare with general navigation– Coverage and accuracy of prediction– Consistency of behavior over time

• Bridging general and personal navigation

Bing search logs70 million queries21 million users

microsoft research

Page 5: Understanding and Predicting Personal Navigation

Road Map of Talk

• General Navigation– Identifying general navigation– Understanding general navigation

• Personal Navigation wsdm– Identifying personal navigation– Compare with general navigation– Coverage and accuracy of prediction– Consistency of behavior over time

• Bridging general and personal navigation

Bing search logs70 million queries21 million users

microsoft research

Page 6: Understanding and Predicting Personal Navigation

Identifying General Navigation

• Ask people (“Were you looking for this site?”)– 1 in 4 queries reported to be navigational

• Query string (wsdm.org or microsoft)– 10% of queries identified as navigational

• Click behavior– Look for low click entropy– Need lots of data (query instances, users, clicks)

Page 7: Understanding and Predicting Personal Navigation

Understanding General Navigation

• Identified 390 general navigation queries– 12% of query volume

• Query strings straightforward– facebook, youtube, myspace – Short (½ the length of typical Web queries)– Contain a URL fragment 20% of the time

• Navigation target usually first result

Page 8: Understanding and Predicting Personal Navigation

General Navigation Mistakes

• Click predicted only 72% of the time– Double the accuracy for the average query– But what’s going on the other 28% of the time?

• Many typical navigation queries not identified– craigslist– weather.com

(people visit interior pages)(people visit related pages)

3% visit http://geo.craigslist.org/iso/us/ca

17% visit http://weather.yahoo.com

Page 9: Understanding and Predicting Personal Navigation

Road Map of Talk

• General Navigation– Identify high quality common queries– Look navigational ≠ navigational

• Personal Navigation wsdm– Identifying personal navigation– Compare with general navigation– Coverage and accuracy of prediction– Consistency of behavior over time

• Bridging general and personal navigation

Bing search logs70 million queries21 million users

microsoft research

Page 10: Understanding and Predicting Personal Navigation

Road Map of Talk

• General Navigation– Identify high quality common queries– Look navigational ≠ navigational

• Personal Navigation wsdm– Identifying personal navigation– Compare with general navigation– Coverage and accuracy of prediction– Consistency of behavior over time

• Bridging general and personal navigation

Bing search logs70 million queries21 million users

microsoft research

Page 11: Understanding and Predicting Personal Navigation

Identifying Personal Navigation

• Repeat queries are often navigational• The same navigation used over and over again• Was there a unique click on the same result

the last 2 times the person issued the query?

wsdmwsdmhong kongwsdmwsdmsheratonsigirwsdm cfp

Page 12: Understanding and Predicting Personal Navigation

Understanding Personal Navigation

• Identified millions of navigation queries– Most occur fewer than 25 times in the logs– 15% of the query volume

• Queries more ambiguous– Rarely contain a URL fragment– Click entropy the same as for general Web queries– enquirer (multiple meanings)– bed bugs (found navigation)– etsy (serendipitous encounters)

National Enquirer

Cincinnati Enquirerhttp://www.medicinenet.com/bed_bugs/article.htm

[Informational]Etsy.com

Regretsy.com (parody)

Page 13: Understanding and Predicting Personal Navigation

Personal Navigation Accurate

• Target less likely to be ranked first ..– .. than target of general navigation– .. than the average Web search click

• Nonetheless, prediction very accurate– Correct 95% of the time

Page 14: Understanding and Predicting Personal Navigation

Prediction Consistent Over Time

• Looked at different history intervals– How much do we need

to know about a person?– Offline predictions?

• Prediction accuracy consistent over time

• Coverage decreases with stale history

Accuracy Coverage

1 month 95% 15%

1 week 94% 13%

Last week 95% 11%

A week ago 90% 5%

Page 15: Understanding and Predicting Personal Navigation

Road Map of Talk

• General Navigation– Identify high quality common queries– Look navigational ≠ navigational

• Personal Navigation wsdm– Re-finding often navigational– Identify unusual navigational queries– High coverage and accuracy– Behavior consistent over time

• Bridging general and personal navigation

Bing search logs70 million queries21 million users

microsoft research

Page 16: Understanding and Predicting Personal Navigation

Road Map of Talk

• General Navigation– Identify high quality common queries– Look navigational ≠ navigational

• Personal Navigation wsdm– Re-finding often navigational– Identify unusual navigational queries– High coverage and accuracy– Behavior consistent over time

• Bridging general and personal navigation

Bing search logs70 million queries21 million users

microsoft research

Page 17: Understanding and Predicting Personal Navigation

Bridging Personal and General

• Some personal navigation queries are general navigation queries

Personal

15%General

12%5%

Accuracy of prediction: Personal Navigation 95% General Navigation 72%

Opportunity to combine aggregate and individual

data to increase coverage and drop inaccurate general navigation

Page 18: Understanding and Predicting Personal Navigation

Summary of Talk

• General Navigation– Identify high quality common queries– Look navigational ≠ navigational

• Personal Navigation wsdm– Re-finding often navigational– Identify unusual navigational queries– High coverage and accuracy– Behavior consistent over time

• General & personal navigation complementary

An opportunity for personalization

that works!

microsoft research

Page 19: Understanding and Predicting Personal Navigation

Questions?

Jaime Teevan, Daniel J. Liebling and Gayathri Ravichandran Geetha

Microsoft Research