identity and search in social networks

26
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman

Upload: natara

Post on 29-Jan-2016

22 views

Category:

Documents


0 download

DESCRIPTION

Identity and search in social networks. Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman. Presented by Pooja Deodhar. Presentation Outline. Introduction Contentions – Social Networks Algorithm explanation Our model and Milgram’s findings Further Extensions Applications. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Identity and search in social networks

Identity and search in social networks

Presented by Pooja Deodhar

Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman

Page 2: Identity and search in social networks

Presentation OutlinePresentation OutlineIntroductionContentions – Social NetworksAlgorithm explanationOur model and Milgram’s findingsFurther ExtensionsApplications

2

Page 3: Identity and search in social networks

IntroductionIntroductionSocial Networks are “Searchable”Our model offers explanation of

searchability in terms of recognizable personal identities

Personal identities - sets of characteristics in different social dimensions

Class of searchable networks and method for searching them applicable to many real world problems

3

Page 4: Identity and search in social networks

IntroductionIntroductionSmall World Network

◦Network in which most nodes are not neighbors of each other but most nodes can be reached from every other node by a number of hops

4

Page 5: Identity and search in social networks

IntroductionIntroduction

Milgram’s Experiment ◦ Short paths exist between individuals in large

social network◦ Ordinary people can find these short paths◦ People rarely have more than local knowledge

about the network

5

Source

Page 6: Identity and search in social networks

IntroductionIntroductionSearchability

◦Property of being able to find a target quickly

Shown to exist in networks◦With certain fraction of hubs (highly

connected nodes which once reached can distribute messages to all parts of the network)

◦Built upon underlying geometric lattice

6

Page 7: Identity and search in social networks

IntroductionIntroductionLimited hubs in social networksSocial Networks are more like a

peer-to-peer networkNeed for a hierarchical modelSome measure of distance

between individualsCan be based on targets identity,

friends identity, friend’s popularity

7

Page 8: Identity and search in social networks

Contentions – Social Contentions – Social NetworksNetworksIndividual identities – sets of

characteristics attributed to them by virtue of association, participation in social groups

Groups – Collection of individuals with well-defined set of social characteristics

8

Page 9: Identity and search in social networks

Contentions – Social Contentions – Social NetworksNetworksBreaking down of world into set

of layersTop layer – whole populationLower layers – specific division

into groups

9

Page 10: Identity and search in social networks

Contentions – Social Contentions – Social NetworksNetworksSimilarity xij – between individuals i, j xij – Height of the lowest common

ancestor level between i and jIndividuals in same group are at

distance of one from each other

10

Page 11: Identity and search in social networks

Contentions – Social Contentions – Social NetworksNetworks

Combined social distance yij = minh xij

In the above figure H = 2In 1st heirarchy, yij = 1 and yjk = 1

in 2nd

But yik = 4 > yij + yjk = 211

Page 12: Identity and search in social networks

Contentions – Social Contentions – Social NetworksNetworksProbability of acquaintance

between i and j decreases with decreasing similarity of groups to which they belong

Link distance x for individual i has probability

p(x) = ce-αx

Measure of homophily – tendency of like to associate with like

12

Page 13: Identity and search in social networks

Contentions – Social Contentions – Social NetworksNetworksIndividuals hierarchically

partition the social world in more than one way.◦h = 1, …, H hierarchies

Node’s identity is the vector ◦ is position of node i in hierarchy

h.Social distance

hiv

hiv

hij

hij x y min

13

Page 14: Identity and search in social networks

Contentions – Social Contentions – Social NetworksNetworksAt each step the holder i of the

message passes it to one of its friends who is closest to the target t in terms of social distance

Individuals know the identity vectors of:◦themselves◦their friends,◦the target

Two kinds of partial information – social distance and network paths

14

Page 15: Identity and search in social networks

Algorithm ExplanationAlgorithm ExplanationPrincipal objective – determine

conditions for average path length L of a message chain is small

Define q as probability of an arbitrary message chain reaching a target.

Searchable network - Any network for which q ≥ rfor a desired r.

15

Page 16: Identity and search in social networks

SearchabilitySearchabilitySearchable networks occupy a

broad region of parameter space <α,H> which are sociologically plausible

Searchability is generic property of social networks

16

Page 17: Identity and search in social networks

Algorithm ExplanationAlgorithm ExplanationIn terms of chain length L,

q = (1 - p)L ≥ rL = length of message chainP = message failure probability

From above, L can be obtained by the approximate inequality,

L <= ln r / ln (1 - p)

17

Page 18: Identity and search in social networks

Our model and Milgram’s Our model and Milgram’s findingsfindingsAll searchable networks have α > 0, H

> 1Individuals are essentially homophilous

but judge similarity along more than one social dimension

Best performance is achieved for H = 2 or 3

Thus, use of 2 or 3 dimensions used by individuals in small world experiments when forwarding a message

18

Page 19: Identity and search in social networks

Searchable NetworksSearchable Networks

19

Solid boundary – N=102,400Dot-dash – N=204800Dash – N=409,600p = 0.25, b = 2, g = 100, r = 0.25

at least

Page 20: Identity and search in social networks

Our model and Milgram’s Our model and Milgram’s findingsfindingsIncreasing number of independent

dimensions from H = 1 yields dramatic reduction in delivery time for α > 0

This improvement lost as H is increased further

Thus, network ties become less correlated as H increases

For large H, network becomes a random graph, search algorithm becomes random walk

20

Page 21: Identity and search in social networks

Searchable NetworksSearchable Networks

21

Probability of message completion when for α = 0 (squares) and for α = 2 (circles) for N = 102,400

Horizontal line – pos of the threshold Open symbols indicate network is

searchable – q <= r

Page 22: Identity and search in social networks

Our model and Milgram’s Our model and Milgram’s datadata

n(L) – no. of completed chains of length L taken from original small world expt. (shown by bar graphs)

Taken for example of our model for N = 10^8 individuals and for 42 completed chains shown by filled circles

22

Page 23: Identity and search in social networks

Our model and Milgram’s Our model and Milgram’s findingsfindingsComparison of distribution of

chain lengths in our model with that of Travers and Milgram

Avg. chain length for Milgrams expt = 6.5

Avg. chain length for our model = 6.7

23

Page 24: Identity and search in social networks

SummarySummarySimple greedy algorithm.Represents properties present

in real social networks:◦Considers local clustering.◦Reflects the notion of locality.

High-level structure + random links.

24

Page 25: Identity and search in social networks

Further ExtensionsFurther ExtensionsShould we consider other parameters such as friend’s popularity information in addition to homophily?◦Allow variation in node degrees?

Assume correlation between hierarchies?

Are all hierarchies equally important?

25

Page 26: Identity and search in social networks

ApplicationsApplicationsBroad class of decentralized

problems◦Peer to peer networking

Any data structure in which data elements can be judged along more than one dimension

Designing of databases◦Eg. Music files – same genre/same

year

26