on the evolution of statistical evidence in dynamic social ... · dynamic social networks joris...
TRANSCRIPT
On the Evolution of Statistical Evidence inDynamic Social Networks
Joris Mulder & Roger Leenders
Tilburg University, the Netherlands
Seminar at Social Networks Working Group, UC Irvine, 10/24/17
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 1 / 44
Outline
1 Dynamic social networks of colleagues in a large organization
2 Relational event modelModelCapturing the dynamic nature using a moving windowAnalysis I of email data: Estimation
3 Quantifying Statistical Evidence Using the Bayes FactorMethodologyAnalysis II of email data: Evolution of Statistical Evidence
4 Conclusions and furter research
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 2 / 44
Dynamic social networks of colleagues in a large organization
Outline
1 Dynamic social networks of colleagues in a large organization
2 Relational event modelModelCapturing the dynamic nature using a moving windowAnalysis I of email data: Estimation
3 Quantifying Statistical Evidence Using the Bayes FactorMethodologyAnalysis II of email data: Evolution of Statistical Evidence
4 Conclusions and furter research
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 3 / 44
Dynamic social networks of colleagues in a large organization
Dynamic social networks
The concept of time
The concept of time (e.g., speed, pacing, rhythm) is often notexplicitly included in social theories (merely “X causes Y”, Monge,1991; Leenders et al., 2016).
If there is an effect, how quickly does it have an effect? How longdoes it have an effect? And does the effect change linearly ornonlinearly?
The activity of the network is generally not constant over time.
Earlyaction
Earlyaction
Steadyaction
Steadyaction
Deadlineaction
Deadlineaction
time time
activity
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 4 / 44
Dynamic social networks of colleagues in a large organization
Dynamic social networks
Social network of colleagues
How do colleagues share and seek information from each other inlarge organizations?
Is this behavior of information-sharing and information-seekingdynamic in nature? Yes!
How is the process affected by hierarchy, team membership,location?
How is the process affected by interventions?
How can we quantify statistical evidence between hypotheses aboutthis continuous process?
Data
Approx. 14,000 email messages were sent in 2010 between approx.2,500 colleagues in a large organization.
The emails contained information about new developments andtechnologies.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 5 / 44
Dynamic social networks of colleagues in a large organization
Drivers of the interaction process
Endogenous and exogenous effects
The interaction process is driven by endogenous effects andexogenous effects.
Examples of endogenous effects include inertia, reciprocity,transitivity.
Examples of exogenous effects include hierarchical position, location,team membership.
These effects are time-dependent.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 6 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t1
Location 1 Location 2
t1
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 7 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t2
Location 1 Location 2
t2
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 8 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t3
Location 1 Location 2
t3
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social Networks Tiburg University, Tilburg 9 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t4
Location 1 Location 2
t4
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 10 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t5
Location 1 Location 2
t5
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 11 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t6
Location 1 Location 2
t6
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 12 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t7
Location 1 Location 2
t7
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 13 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t8
Location 1 Location 2
t8
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 14 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t9
Location 1 Location 2
t9
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 15 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t10
Location 1 Location 2
t10
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 16 / 44
Dynamic social networks of colleagues in a large organization
Email messages
Message at time t11
Location 1 Location 2
t11
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 17 / 44
Relational event model
Outline
1 Dynamic social networks of colleagues in a large organization
2 Relational event modelModelCapturing the dynamic nature using a moving windowAnalysis I of email data: Estimation
3 Quantifying Statistical Evidence Using the Bayes FactorMethodologyAnalysis II of email data: Evolution of Statistical Evidence
4 Conclusions and furter research
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 18 / 44
Relational event model Model
Relational event model
Relational events
Event = {sender, receiver, time, weight, type, modality, ...}Events are recorded in an event list or ‘relational event history’.
Relational event model of Butts (2008)
Key feature: Model the timing of social interactions.
Related to Cox’s proportional hazards model.
The time until the next relational event from sender s to receiver r ,where the pair (s, r) lies in the risk set Rt , is modeled using anexponential distribution with rate parameter λ(s, r , t).
The rate at time t is modeled as a log linear function oftime-dependent endogenous and exogenous covariates, e.g.,
λ(s, r , t) = exp{xs,tβinertia,t+xreciprocity ,tβreciprocity ,t+xhierarchy ,tβhierarchy}
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 19 / 44
Relational event model Model
Relational event model
Likelihood
1
23l(2,1,t)
l(1,2,t)l(3,2,t)
l(2,3,t)
l(1,3
,l(3,1
,t)
t)
Time till the next event: tm − tm−1 ∼ Exp(∑
(s′,r ′)∈Rtλ(s ′, r ′, t)
).
Which relational event: P(s, r) = λ(s,r ,t)∑(s′,r′)∈Rt
λ(s′,r ′,t) .
The events that did not occur are right-censored.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 20 / 44
Relational event model Model
Relational event model
Likelihood
23l(2,1,t)
l(1,2,t)l(3,2,t)
l(2,3,t)
l(3,1
,t)l(
1,3,t)
1
Time till the next event: tm − tm−1 ∼ Exp(∑
(s′,r ′)∈Rtλ(s ′, r ′, t)
).
Which relational event: P(s, r) = λ(s,r ,t)∑(s′,r′)∈Rt
λ(s′,r ′,t) .
Events that did not occur are right-censored (DuBois et al. 2013).
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 21 / 44
Relational event model Capturing the dynamic nature using a moving window
Capturing the dynamic nature
Moving window
A moving window was used to get insights about the dynamicnature of the effects.
The moving window was set to 60 days. This implies that only theemail messages that occurred within the last 60 days we taken intoaccount to fit the model.
A moving window can be seen an operationalization of the memoryof the process.
March 7Jan 4
window
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 22 / 44
Relational event model Capturing the dynamic nature using a moving window
Capturing the dynamic nature
Moving window
A moving window was used to get insights about the dynamicnature of the effects.
The moving window was set to 60 days. This implies that only theemail messages that occurred within the last 60 days we taken intoaccount to fit the model.
A moving window can be seen an operationalization of the memoryof the process.
april 15Feb 13
window
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 23 / 44
Relational event model Capturing the dynamic nature using a moving window
Capturing the dynamic nature
Moving window
A moving window was used to get insights about the dynamicnature of the effects.
The moving window was set to 60 days. This implies that only theemail messages that occurred within the last 60 days we taken intoaccount to fit the model.
A moving window can be seen an operationalization of the memoryof the process.
Nov 4Sep 4
window
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 24 / 44
Relational event model Capturing the dynamic nature using a moving window
Capturing the dynamic nature
Moving window
A moving window was used to get insights about the dynamicnature of the effects.
The moving window was set to 60 days. This implies that only theemail messages that occurred within the last 60 days we taken intoaccount to fit the model.
A moving window can be seen an operationalization of the memoryof the process.
Dec 15Oct 15
window
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 25 / 44
Relational event model Analysis I of email data: Estimation
Relational event model for email data
Email data
In the case of 2500 colleagues, the risk set consists of2500× 2499 = 6, 247, 500 possible relational events.
Every relational event results in 6, 247, 499 right censored events andone observed event.
14, 000 email messages were recorded. This makes a dataset of6, 247, 500× 14, 000 = 87, 465, 000, 000 events of which 14, 000 areobserved and the rest is right censored.
Hence the complete dataset that is analyzed is huge whichcomplicates the analysis.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 26 / 44
Relational event model Analysis I of email data: Estimation
Relational event model for email data
Subset of email data
I took a subset of all the events based on the 55 most active personsin the network. This resulted an relational event history of 1,505events.
The risk set consisted of 55× 54 = 2970 possible events.
Thus the complete data set that is analyzed consisted of2970× 1, 505 = 4, 469, 850 events out of which 1,505 were observedand the rest were right-censored.
The rem-package of Butts was quite slow. Analysis was done usingthe coxph function in the survival-package.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 27 / 44
Relational event model Analysis I of email data: Estimation
Relational event model for email data
Endogenous covariates
Recency send
Quantifies if recent sending activity results in future sending activity.Statistic: 1
1+TimeUntilLastSendingEmail
Recency receive
Quantifies if recent receiving activity results in future receivingactivity.Statistic: 1
1+TimeUntilLastReceivingEmail
Sender out-degree
Quantifies if node was a highly active sender in the last period, thisresults in high future sending activity.Statistic: log #SendingEvents+1
i+N,
for node i and N potential events (DuBois et al., 2013).
Sender in-degree
Quantifies if node was a highly active receiver in the last period, thisresults in high future receiving activity.Statistic: log #ReceivingEvents+1
i+N.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 28 / 44
Relational event model Analysis I of email data: Estimation
Relational event model for email data
Exogenous covariates
Same gender (0,1)
Same location (0,1)
Same job description (0,1)
Difference between hierarchical levels (−3,−2, . . . , 3).
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 29 / 44
Relational event model Analysis I of email data: Estimation
Evolution of dynamic interaction process over time
Some empirical results
50 100 150 200 250 300 350
−2−1
01
23
time
estim
ates
Same locationHierarchical differenceSame genderSame function
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 30 / 44
Relational event model Analysis I of email data: Estimation
Evolution of dynamic interaction process over time
Some empirical results
50 100 150 200 250 300 350
−3−2
−10
12
time
estim
ates
send out−degreesend in−degree
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 31 / 44
Quantifying Statistical Evidence Using the Bayes Factor
Outline
1 Dynamic social networks of colleagues in a large organization
2 Relational event modelModelCapturing the dynamic nature using a moving windowAnalysis I of email data: Estimation
3 Quantifying Statistical Evidence Using the Bayes FactorMethodologyAnalysis II of email data: Evolution of Statistical Evidence
4 Conclusions and furter research
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 32 / 44
Quantifying Statistical Evidence Using the Bayes Factor Methodology
Quantifying Statistical Evidence Using the Bayes Factor
Definition
The Bayes factor between hypothesis H1 and H2 is defined as the ratio ofthe marginal likelihoods
B12 =p1(Data)
p2(Data).
The Bayes factor B12 quantifies the relative evidence in the data for H1
against H2.
Testing multiple hypotheses
Bayes factors can straightforwardly be used for testing multiplehypotheses. Furthermore, Bayes factors satisfy the transtivity property.
B12 = 10B23 = 5
}⇒ B13 = B12 × B23 = 50.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 33 / 44
Quantifying Statistical Evidence Using the Bayes Factor Methodology
Quantifying Statistical Evidence Using the Bayes Factor
How to specify the priors?
In order to compute Bayes factors, priors need to be specified for theunknown parameters under each hypothesis.
Priors
Priors reflect our beliefs about the model parameters before observing thedata.
Default approach
We consider the situation where prior information is weak. For this reasonwe want to quantify statistical evidence using default Bayes factors basedon default priors where subjective prior specification is avoided.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 34 / 44
Quantifying Statistical Evidence Using the Bayes Factor Methodology
Quantifying Statistical Evidence Using the Bayes Factor
Default Bayes factors
In the methodology of Gu et al. (2017), only the ML estimates andthe estimated covariance matrix of the estimates are needed.
The default prior is a scaled version of the posterior (O’Hagan,1995), such that (i) it contains minimal information and (ii) it iscentered at the null value.
Multiple hypothesis test (some Greek...)
Consider the hypotheses: H0 : β = 0, H1 : β < 0, H2 : β > 0, Hu : β ∈ R:
B0u =p(β = 0|Data)
p(β = 0)B1u =
P(β < 0|Data)
P(β < 0)B2u =
P(β > 0|Data)
P(β > 0)
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 35 / 44
Quantifying Statistical Evidence Using the Bayes Factor Methodology
Quantifying Statistical Evidence Using the Bayes Factor
Multiple hypothesis test (some Greek...)
Consider the hypotheses: H0 : β = 0, H1 : β < 0, H2 : β > 0, Hu : β ∈ R:
B0u =p(β = 0|Data)
p(β = 0)B1u =
P(β < 0|Data)
P(β < 0)B2u =
P(β > 0|Data)
P(β > 0)
=.1
.4= .25 =
.98
.50= 1.96 =
.02
.50= .04.
.4
.10 b
posteriorposterior
priorprior0 b
P(P(b<0 b<0 | Data)=.98| Data)=.98
P(P(b<0)=.5<0)=.50
P(P(b>0 b>0 | Data)=.02| Data)=.02
P(P(b>0)=.5>0)=.50
0 b
H0: b=0 versus Hu: b H1: b<0 versus Hu: b H2: b>0 versus Hu: b
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 36 / 44
Quantifying Statistical Evidence Using the Bayes Factor Methodology
Quantifying Statistical Evidence Using the Bayes Factor
Posterior probabilities
The Bayes factors can be used to update prior odds of the hypotheses toobtain posterior odds:
P(H0|Data)
P(H1|Data)= B01 ×
P(H0)
P(H1).
Three hypotheses
If we assume the three hypotheses, H0, H1, and H2, to be equally likely apriori, then the posterior probabilities can be computed as
P(H0|Data) =.25
.25 + 1.96 + .04= .11
P(H1|Data) =1.96
.25 + 1.96 + .04= .87
P(H1|Data) =.04
.25 + 1.96 + .04= .02.
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 37 / 44
Quantifying Statistical Evidence Using the Bayes Factor Analysis II of email data: Evolution of Statistical Evidence
The Evolution of Statistical Evidence
Hypothesis test I
H0 : no hierarchical effect
H1 : bottom-up behavior
H2 : top-down behavior
50 100 150 200 250 300 350
−8−6
−4−2
02
46
days
weig
ht o
f evid
ence
(log
(BF)
)
H0 vs H1H0 vs H2
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 38 / 44
Quantifying Statistical Evidence Using the Bayes Factor Analysis II of email data: Evolution of Statistical Evidence
The Evolution of Statistical Evidence
Hypothesis test I
H0 : no hierarchical effect
H1 : bottom-up behavior
H2 : top-down behavior
50 100 150 200 250 300 350
0.0
0.2
0.4
0.6
0.8
1.0
days
post
erio
r pr
obab
ility
H0H1H2
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 39 / 44
Quantifying Statistical Evidence Using the Bayes Factor Analysis II of email data: Evolution of Statistical Evidence
The Evolution of Statistical Evidence
Hypothesis test II
H0 : function = location = gender
H1 : function > location = gender
H2 : function > location > gender
50 100 150 200 250 300 350
050
100
days
weig
ht o
f evid
ence
(log
(BF)
) H2 vs H1H2 vs H0
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 40 / 44
Quantifying Statistical Evidence Using the Bayes Factor Analysis II of email data: Evolution of Statistical Evidence
The Evolution of Statistical Evidence
Hypothesis test II
H0 : function = location = gender
H1 : function > location = gender
H2 : function > location > gender
50 100 150 200 250 300 350
0.00.2
0.40.6
0.81.0
1.21.4
days
poste
rior p
roba
bility
H0:func=loc=genderH1:func>loc=genderH2:func>loc>gender
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 41 / 44
Conclusions and furter research
Outline
1 Dynamic social networks of colleagues in a large organization
2 Relational event modelModelCapturing the dynamic nature using a moving windowAnalysis I of email data: Estimation
3 Quantifying Statistical Evidence Using the Bayes FactorMethodologyAnalysis II of email data: Evolution of Statistical Evidence
4 Conclusions and furter research
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 42 / 44
Conclusions and furter research
Conclusions
Relational event model is useful for modeling email data in socialnetworks of colleagues.
Moving windows is useful to get insight about the dynamic behaviorover time.
The moving windows can be seen as a form of process memory.
Default Bayes factors can be used to see how statistical evidenceevolves over time.
Still many open problems...
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 43 / 44
Conclusions and furter research
Future work
Incorporate more realistic forms of process memory such as half-lifeperiods (e.g., Brandes et al., 2009), and estimate/test half-lifeperiods.
Improve efficiency of statistical computing.
Posterior predictive checking to evaluate model fit.
...
Mulder (Tilburg University) On the Evolution of Statistical Evidence in Dynamic Social NetworksTiburg University, Tilburg 44 / 44