collaborative personalized twitter search with topic-language models

Collaborative Personalized

Twitter Search with Topic-Language Models

Jan Vosecky

Kenneth Wai-Ting Leung

Wilfred Ng

Supported by SIGIR Travel Grant

Microblogs

Tweet 1

Tweet 2

User-generated content

– Short length

– Informal language, free-form

– Diverse topics

Very high volume

Information overload

Searching on Twitter

“When you've got 5 minutes to fill,

Twitter is a great way to fill 35 minutes”

@mattcutts

Searching for “ipad” on Twitter

Around 50 tweets

mentioning “iPad”

posted within

1-minute

Personalizing

Twitter Search

Microblog data

• Compared with traditional domains

(e.g. web search, news search):

– Explicitly stated user interests

• tweets, conversations, re-tweets

– Social network structure

• following

• Individual user’s data

– Diverse

– Sparse

• User’s social connections

Personalization challenge

Putting all kinds of information into a single user model

inaccurate, noisy

– Diverse

– Sparse

– Diverse friends, topics

– Need to carefully organize friends’ informatio

Short messages

Few messages

Few social connections

Little search history

– Diverse

– Sparse

– Need to carefully organize friends’ information

for it to be useful

– Diverse

– Sparse

Topics

Contributions

Novel User Model

structure

Collaborative User

Language

modeling IR

Query likelihood model

– Given a query Q and a

document D,

Topic Models

A latent topic in LDA:

“Information Technology”

Google 0.00040

Android 0.00020

Microsoft 0.00010

App 0.00010

Security 0.00009

Email 0.00008

Login 0.00005

Virus 0.00004

Scope of our approach

• Input to our algorithm:

– Set of n documents returned by Twitter given

query Q

• Our task:

– Rank the documents according to:

• Query

• User model

Proposed Framework

At a Glance: Proposed User Model

Individual User Model

ITW = 2/5 = 40%

W = 2/5 = 40%

Manchester: 5

Play: 4

Win: 2

Android: 6

Coding: 2

Java: 2

ID Tweet Time Topic

1 Manchester playing tonight 1. 1. Sport

2 Doing some android coding 2. 1. IT

3 Great game, great win for manchester! 5. 1. Sport

4 Had a great apple cake with chocolate 6. 1. Food

5 My java code keeps throwing exceptions 10. 1. IT

W = 1/5 =

Cake: 6

Apple: 5

Oven: 2

Individual User Model (IM)

Is u interested in word w from topic k?

Is u interested in topic k?

Is word w related to topic k?

Prior prob. of topic k

Recent interest is more important:

From user From topic model

Personalization using IM

Is the Query relevant to topic k?

Is Q related to topic k in general?

Is the User interested in topic k?

Is Q related to the words in topic k that User is interested in?

Is the Document relevant to topic k?

Is D related to topic k in general?

Is the User interested in topic k?

Is D related to the words in topic k that User is interested in?

Prior Document probability

Q = australia

I’m interested in IT and travel

I’ve never tweeted about Australia

Travel

Politics

Business

Top 10 restaurants in Australia

iPhones, iPads, and Macs Hacked and Hijacked

for Ransom in Australia - Gotta Be Mobile

Tweet (D):

Q = australia

I’m interested in IT and travel

I have tweeted about IT in Australia

Travel

Politics

Business

Top 10 restaurants in Australia

iPhones, iPads, and Macs Hacked and Hijacked

for Ransom in Australia - Gotta Be Mobile

Tweet (D):

Collaborative User Model

Sport Food

Manchester: 5

Play: 4

Win: 2

Cake: 6

Apple: 5

Oven: 2

Friend 1

Manchester: 5

Play: 4

Win: 2

Friend 2

IT Music

Radiohead: 4

Listen: 2

Song: 5

Android: 6

Coding: 2

Java: 2

Friend 3

Manchester: 5

Play: 4

Win: 2

Android: 6

Coding: 2

Java: 2

Radiohead: 4

Listen: 2

Song: 5

Cake: 6

Apple: 5

Oven: 2

Collaborative Model

Collaborative User Model

• Weighted sum of IM’s of the top-n friends– based on the amount of interactions (re-tweets, mentions,

conversations)

• Weight of each friend f:

– wP(f): Popularity of f

– wA(u,f): Affinity of u and f

• Weight of each f’s topic k:

– wB(u,k): Topic bias

– wI(u,f,k): Topic-interaction between u and f

Personalization using IM and CM

From user From topic modelFrom friends

Dirichlet smoothing

Depends on the amount of user’s tweets

Search User Model (SM)

• Feedback sources: Queries + clicks

• What does a ‘click’ mean?

URL clickre-tweetfavorite

Search User Model (SM)

• Feedback sources: Queries + clicks

• Feedback from a ‘click’:

– Query-topic: preference for topic k when issuing Q

– Topic-word: preference for words in topic k

– Topic: user’s search bias towards topic k

Evaluation

Query log collection

• Evaluation interface

– Submit query, returns tweets from Twitter API

– Rate relevant tweets

Datasets

• Controlled user study (Log_CoS)

– 11 users

• In-the-wild user study (Log_IwS)

– 24 users

Log_CoS Log_IwS

Ranking Results

Baselines:

Query likelihood (J-M smoothing)

Topic model-based IR

Personalized search (User-specific language models)

Collaborative search (Cluster-specific language models)

Collaborative Personalized search

Ranking Results

Average per-user ranking performance

after processing i user’s queries

Comparison of models

(a) Log_CoS (b) Log_IwS

Query types

(a) Log_CoS (b) Log_IwS

Performance by query type

In summary

• Collaborative Personalized Twitter Search

– User’s tweets

– User’s friends’ tweets

– User’s search activity

– Organized around topics

• topic-specific language models

Future work

• Query-dependent personalization

strategies

• Selection of an optimal set of friends for

collaborative model

• Integrating spatial and temporal features

Thank You!

Jan Vosecky

Kenneth Wai-Ting Leung

Wilfred Ng

Supported by SIGIR Travel Grant

collaborative personalized twitter search with topic-language models

Social Media

a collaborative filtering approach to personalized ... · a...

umap 2011: analyzing user modeling on twitter for...

personalized cinemagraphs using semantic...

personalized corporate twitter presence

new wine in no bottles: immersive, personalized ... ·...

center for collaborative education: massachusetts...

personalized recommender system using entropy based...

decentralized collaborative learning of personalized

(icmia 2013) personalized community detection using...

personalized filtering of twitter stream

personalized news recommendation based on twitter user...

algorithmic approaches to personalized health care principal...

ohio appalachian collaborative · 2019-11-12 · with a...

social-personalized versus computer-personalized methods...

analyzing user modeling on twitter for personalized news...

what is twitter? real-time information network news, sports,...

rapid collaborative knowledge building via twitter after...

collaborative filtering with personalized...

twitter is faster: personalized time-aware video...

personalized recommender by exploiting domain based expert...