nuanced graph representation to improve recommendation: the case of browsing and social networks
DESCRIPTION
Graphs are ubiquitous representations of a wide range of online traces generated by user activities including browsing, messaging, social linking, and many more. For their simplicity and power, graphs (like other similar representations of relational data) have been used in a plethora of applications, most of them falling under the umbrella of recommendation and personalization. However, very often the notion of graph and its atomic components (nodes and edges) are adopted uncritically, without giving much thought to their nature or meaning. In real-world scenarios the meaning of a link can vary broadly even within the same system or interaction type. We study browsing and social graph and show how a to obtain a more nuanced representation of their links to help gaining a deeper understanding of their nature and, in turn, to properly exploit the information about link type in recommendation tasks. First, we present the use of the BrowseGraph and its decomposition into ReferrerGraphs for image and news recommendation. Last, we will show how conversation graphs can be decomposed in subgraphs carrying different information about the type of resources exchanged between peers, providing an overview on the potential that such nuanced representation can have in the field of recommendation. Our analysis is conducted on large datasets extracted from Yahoo News, Flickr, and aNobii.TRANSCRIPT
![Page 1: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/1.jpg)
Nuanced graph representation to improve recommendationThe case of browsing and social networks
1st International Workshop on Social Personalisation (SP 2014)
Luca Maria Aiello
![Page 2: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/2.jpg)
Who’s this guy?
![Page 3: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/3.jpg)
Network analysis
?
??
? ?
![Page 4: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/4.jpg)
Roadmap
Part I : Browsing graphs in contextTo surface interesting content and address cold start-
scenario
Part II : Pragmatics of communication graphs To decompose the dyadic interaction and profile
user-to-user ties
![Page 5: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/5.jpg)
Browsing graphs
![Page 6: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/6.jpg)
Team
Luca Maria Aiello
Michele Trevisiol
Alejandro Jaimes
Luca Chiarandini
Rossano Schifanella
![Page 7: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/7.jpg)
Browse Graph
• Nodes are pages• Edges are aggregated
browsing transitions
Trevisiol et al. “Image Ranking Based on User Browsing Behaviour” SIGIR 2012
• Centrality is a “good” indicator of content interestingness
• External layers add useful information
![Page 8: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/8.jpg)
Flickr browsegraph
flickr
• Flickr browsing data– 2 months, 10M users, 50M nodes, 300M pageviews
![Page 9: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/9.jpg)
Most central nodes in Flickr BrowseGraph
• Comparison with PageRank (no external nodes), Favorites, Clicks, View time– High quality– Higher topical variety– Surfaces photos related to real world events or interesting but not
popular
Trevisiol et al. “Image Ranking Based on User Browsing Behaviour” SIGIR 2012
![Page 10: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/10.jpg)
Top 10 photos
Art Series OddEvents
![Page 11: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/11.jpg)
Referrer Graph
• External accesses come from heterogeneous environment
Trevisiol et al. “Cold-start News Recommendation with Domain-dependent Browse Graph” RecSys 2014
• Extract subgraphs induced by the browsing traces from the same entry point
• Study their structural differences
![Page 12: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/12.jpg)
Browsing in News
![Page 13: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/13.jpg)
Yahoo News ReferrerGraphs
• 1 month of Yahoo News browsing log– 0.5 B entries
• Avg. number of hops per session =~2
![Page 14: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/14.jpg)
Domain-dependent consumption
Jaccard similarity of node sets Kendall tau of nodes pageranks
![Page 15: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/15.jpg)
Domain-dependent consumption
![Page 16: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/16.jpg)
News consumption in time
Normalized article lifespan
PD
F(vie
ws)
![Page 17: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/17.jpg)
Cold start recommendation
• Fingerprint of traffic depends of the referrer domain• Can we use this for recommendation?
40
30
2010
50
80
25
15
• Random• Most popular• Edge-based• Content-based
• Cosine sim + TF-IDF• (Full and mix graph
variants)
60
90
![Page 18: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/18.jpg)
BenchmarkAveraged over 1,438 hourly graphs (~350k users per hour)
![Page 19: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/19.jpg)
BenchmarkAveraged over 1,438 hourly graphs (~350k users per hour)
![Page 20: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/20.jpg)
Takeaways
• Graph structure can be more useful than other simple indicators of user feedback to surface interesting content
• Browsing structure changes radically wrt referrer domain
• Historical browsing information is more effective than other cold-start indicators to predict next view (surprising?)
![Page 21: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/21.jpg)
Conversation graphs
![Page 22: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/22.jpg)
Team
Luca Maria Aiello
Rossano Schifanella
Bogdan State
Aiello et al. “Reading the source code of social ties” WebSci 2014
![Page 23: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/23.jpg)
Conversation graph
![Page 24: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/24.jpg)
Beyond simple edges
• Structure• Content
– Syntactics– Semantics
• Pragmatics (beyond saying)– Communication acts that define the type of social
relationship
35
7
hello!
Topic modeling, sentiment analysis, NLP, …
?What is the “nature” of a social tie?
![Page 25: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/25.jpg)
Beyond simple edges
• Blau’s Social Exchange Theory– Exchange of non-material resources
• Objective: Label message with resources it conveys
5
Peter Blau “Exchange and power in social life” 1964
User profilingLink profilingVisualization…
![Page 26: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/26.jpg)
How?
1
23
4 5
6
7
8 9
10 111. Preprocessing
– Stopwords, stemming
2. Message bucketing– NMF, LDA, …
3. Transition graph– Buckets as nodes
transitions as edges
• Intuition: conversations tend to stick to the same resource (“You’re very good at it” “You are pretty good as well”)
4. Resource extraction– Community detection
on transition graph
Input: directed comm. multigraph, arcs labeled with time and textOutput: (probabilistic) assignment message resource DISCOVERY!
A C
B D
![Page 27: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/27.jpg)
Experiments
![Page 28: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/28.jpg)
Dataset (anobii.com)
Aiello et al. “People are Strange when you're a Stranger: Impact and Influence of Bots on Social Networks” ICWSM'12
![Page 29: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/29.jpg)
Status
Knowledge
Support
Anobii transition graph
Status exchangeKnowledge exchange Social support
Technical knowledge of a domain (stackoverflow)
Request for knowledge
“I read a very good review of that book”
Expression of admiration or esteem
Recognition of the partner’s higher status
“You are very smart!”
Emotional valuation
Everyday minute exchanges
“Hope your dad is feeling better now”
80% of messages are correctly assigned (human coders)
![Page 30: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/30.jpg)
![Page 31: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/31.jpg)
Gilbert, Karahalios “Predicting tie strength with social media” CHI 2009
Tie composition and strength
Communication networks induced by the exchange of a single resource
• Status: highly reciprocal, short lived, pervasive• Support: sentiment involved, long lived, between similar actors• Knowledge: long messages, between similar actors
![Page 32: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/32.jpg)
Inequality
• Gini coefficient ~0.7 for all networks, higher for status
Lorenz curve Assortativity
• People receive status from people with lower status
Indegree (/instrength) = amount of resource owned
![Page 33: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/33.jpg)
Tie evolution
• Knowledge prevails after three exchanges• Support increases steadily• Status-exchange fades away quickly
Conversation length
Rati
o o
f re
sourc
e in c
onvers
ati
on
![Page 34: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/34.jpg)
Generality? (Flickr!)
![Page 35: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/35.jpg)
Flickr
Conversation lengthRati
o o
f re
sourc
e in c
onvers
ati
on
![Page 36: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/36.jpg)
Takeaways
• Need for a description of social interaction that goes beyond topics/sentiment/etc.
• Big potential impact on related fields on network studies (e.g., information propagation)
• Social tie sequence of individual exchanges Computational properties of social rituals “Grammar of society”
![Page 37: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/37.jpg)
Conclusion
Graphs are usually not isolated, homogeneous entities. Do not oversimplify
when possible.
![Page 38: Nuanced graph representation to improve recommendation: The case of browsing and social networks](https://reader034.vdocuments.site/reader034/viewer/2022051513/5468e66eaf7959c23c8b614f/html5/thumbnails/38.jpg)
Quick announcement(I’ll be ready for questions in few seconds!)