multiple randomization designs for interference · 2019. 12. 17. · a wi" r c;tx binary treatment,...
TRANSCRIPT
-
Multiple Randomization Designs
for Interference
Pat Bajari, Brian Burdick, Guido Imbens#
James McQueen,Thomas Richardson&, & Ido Rosen
( Amazon, # Stanford University, & Washington University)
NABE, ASSA
San Diego, January 3th, 2020
-
Outline
1. Introduction & Motivation
2. Conventional Randomized Experiments (A/B Tests)
3. Multiple Randomization Designs (MRDs)
4. Analyzing a Simple Double Randomization Design
5. Extensions
6. Conclusion
1
-
1. Introduction & Motivation
In conventional experiments (A/B tests) units from well-defined population are randomly assigned to treatment.
Average outcomes are compared by treatment status.
Here: Two or more populations, with outcomes indexed byboth (movies/viewers, drivers/riders, properties/renters).
Could choose to randomize units from one of the two popu-lations
But: Could assign at the level of pair movie/viewer.
a Double randomization designs can detect spillovers/interference
a Double randomization designs can adjust for richer patternsof spillovers/interference
2
-
Example:
Population 1: Movies
Population 2: Viewers
Treatment: provide more information about movie to viewer(e.g., pictures instead of written description, or more detailabout ratings of previous viewers)
Decision: should we implement the treatment for all moviesand viewers or for no one?
Statistical Question to Inform Decision: By how muchwould exposing all movies/viewers to the treatment improveaverage viewer satisfaction?
Key: In experiment we can vary the treatment and measurethe outcome at the level of the movie/viewer pair.
3
-
2. Conventional Randomized Experiments (A/B Tests)
(goes back to Fisher (1935), Neyman (1928))
a Wi " rC,Tx binary treatment, randomly assigneda Population of N units (e.g., viewers or movies).
a Potential Outcomes defined: Yi�C�,Yi�T�, interest in averageeffect
τ �1N
N
9i�1
�Yi�T� � Yi�C�
a Estimator for average treatment effect:
τ̂ � Y T � Y C
4
-
3. Double Randomization Designs
Now suppose we have two populations:
a I Movies, i � 1, . . . , I.
a J Viewers, j � 1, . . . , J
Outcomes and treatments are measured for pair movie/viewer:
a Yij is outcome for movie i and viewer j
a Wij " rC,Tx is binary treatment for movie i and viewer jY and W are I � J matrices of outcomes and treatments.
Question: Design of Experiment, choice of distrib p�w�.5
-
Assignments:
W �
������������
C C C C C CC C C C C CT T T T T TC C C C C CT T T T T T
�����������Movie ExperimentRandomize Movies
W �
������������
C T C C T TC T C C T TC T C C T TC T C C T TC T C C T T
�����������Viewer ExperimentRandomize Viewers
6
-
Multiple Randomization Design
W �
������������������������
viewers� 1 2 3 4 5 6 7 8movies�
1 C C C C C C C C2 C C C C C C C C3 C C C C T T T T4 C C C C T T T T5 C C C C T T T T
�����������������������Randomize Both Movies and Viewers.
7
-
Assignment Types defined by consistency of treatment expe-rience:
T �
������������
c c c c c cc c c c c ct t t t t tc c c c c ct t t t t t
�����������
Movie ExperimentRandomize Movies
2 Groups to Compare1 Control Group
T �
������������
c t c c t tc t c c t tc t c c t tc t c c t tc t c c t t
�����������
Viewer ExperimentRandomize viewers
2 Groups to Compare1 Control Group
T �
������������
c c c iv iv ivc c c iv iv iv
im im im t t tim im im t t tim im im t t t
�����������
Double Random. DesignRandomize Both
4 Groups to Compare3 Control Groups to Compare
8
-
Problems with Movie Experiments
a Viewers do not have a consistent experience, treated for
some movies, not for other movies. They may shift engage-
ment from control movies to treated movies.
Formally: Violations of SUTVA (stable-unit-treatment-value-
assumption, Rubin) because there are spillovers from treating
movie/viewer �i, j� to different movie / same viewer �i¬, j�.If wij � w
¬
ij, but wi¬j j w¬
i¬j (for other movie i¬ but same viewer
j) then it may be that
Yij�w� j Yij�w¬�
9
-
Main Issues
a Both within movie and withing viewer spillovers/interference
may be present: Viewer or movie experiments cannot de-
tect the presence of such interference.
a Double randomization designs can detect spillovers: More
ex ante comparable groups that may differ systematically
ex post.
a Double randomization designs can adjust for spillovers
that viewer or movie experiments cannot adjust for: but
need to put structure on interference (role for models).
10
-
Double Randomization Designs
a Randomize movies into NM ' 2 groups with indicator WMi
a Randomize viewers into NV ' 2 groups with indicator WVj
Assignment is
Wij � f �WMi ,WVj For example NM � NV � 2 (Simple Double RandomizationDesign):
W � f �WM,WV ���������
C C C C C CC C C C C CC C C T T TC C C T T T
�������.
a Creates NM�NV (� 4 here) ex ante comparable groups thathave systematically different experiences ex post.
11
-
4. Analyzing a Simple Double Randomization Design
WMi " r0,1x, WVj " r0,1xWij � f �WMi ,WVj � w T if WMi �WVj � 1,C otherwise.
For example:
W � f �WM,WV ���������
C C C C C CC C C C C CC C C T T TC C C T T T
�������.
12
-
The pair �WMi ,WVj � defines four types of movie/viewer pairs:
Tij �
~
c �consistent control� if WM � 0,WVj � 0,im �inconsistent movie control� if WM � 1,WVj � 0,iv �inconsistent viewer control� if WM � 0,WVj � 1,t �treated� if WM � 1,WVj � 1.
The assignment and type matrices are
W �
��������C C C C C CC C C C C CC C C T T TC C C T T T
�������
T �
��������c c c c iv iv iv ivc c c c iv iv iv iv
im im im im t t t tim im im im t t t t
�������13
-
Limits on Spillovers: main assumption
The potential outcomes satisfy the local interference as-
sumption if
Yij�w� � Yij�w¬�,for any pair �i, j�, any w, and any w¬, with wij¬ � w¬ij¬ for allj¬
" J, and wi¬j � w¬
i¬j for all i¬
" I.
To change outcome for movie i and viewer j �Yij�w�� youneed to change treatment either for the same movie,
or for the same viewer (or both).
Could use richer models for spillovers.
14
-
Assignments with the same outcome for movie 3 and viewer
4 under local interference:
W �
��������C C T T C CC T C C C TT C C T T TT C C C C C
�������
W¬
�
��������T C T T C CT C T C C CT C C T T TT C C C T T
�������
W¬¬
�
��������C C T T T TC C C C T CT C C T T TT T C C T T
�������Implication � Y34�W� � Y34�W¬� � Y34�W¬¬�
-
Define:
wMi �
1J
J
9j�1
wij, wVj �
1I
I
9i�1
wij.
Assume: There are functions hij � r0,1x ( R, hMij � �0,1� ( Rand hVij � �0,1� ( R such that for all i " I, all j " J, and allw " W,
Yij�w� � hij�wij� � hMij �wMi � hVij �wVj . (1)h
Mij ��� �hVij���� captures movie (viewer) spillovers.
Define class of average effects
τ �pM, pV � 1IJ9i,j
�hij�1��hij�0�� 1IJ9i,j
hMij �pM� 1IJ9
i,j
hVij �pV16
-
Define the average outcome by movie/viewer pair type, for
s " rc, im, iv, tx:Y s �
1Ns
I
9i�1
J
9j�1
Yij1Tij�s
Estimators:
τ̂ � Y t � Y c estimates ave treatment effect τ �pM, pV�τ̂direct � Y t � Y im � Y iv � Y c estimates τ �0,0�Y im � Y c estimates τ �pM,0� � τ �0,0� �movie spillovers�Y iv � Y c estimates τ �0, pV� � τ �0,0� �viewer spillovers�
Exact variances of estimators: see paper.
17
-
5. Extensions
A. Multiple Randomization Designs
B. Equilibrium Designs
C. Clustered Double Randomization Designs
D. Synergistic double randomization designs
18
-
5.A Multiple Randomization Designs
W �
����������������
C C C C C C T TC C C C C C T TC C C C T T T TC C C C T T T TC C T T T T T TC C T T T T T T
���������������.
More ex ante comparable groups exposed to control treat-
ment (six in this example).
¼ More comparisons to shed light on interactions:
more variation in pM and pV in τ �pM, pV�.20
-
5.B Equilbrium Designs
Suppose a decision maker makes a promotion decision sep-
arately for each movie, taking into account the demand for
that movie given the treatment assignment for all movie/viewer
pairs. Hence the promotion for each movie may depend on
the fraction of viewers in the treatment group Pi � Pi�WMi �.Of particular interest are PTi � Pi�1�, the promotion if (al-most) all the viewers for movie i are exposed to the treat-
ment, and PCi � Pi�0�, the promotion if no or few viewers formovie i are exposed to the treatment.
Viewers make movie renting decisions that depend both on
the promotion and on the treatment status. Denote the
renting decision by viewer j for movie i, given price p and
given the treatment w " rC,Tx as Yij�w, p�.21
-
Experimental Design
First we split the viewers randomly into two groups, with
fractions pVA and pVB � 1 � p
VA. Let T
Vj " rA,Bx denote the
assignment for viewers to these two groups.
For the first group of viewers (viewers with assignment TVj �
A) we do a movie experiment. Let WMi " rC,Tx be theassignment for the movies in this experiment.
For the second group of viewers (viewers with assignment
TCj � B) we do a viewer experiment, with assignment W
Vj "rC,Tx for the viewers in this experiment.
22
-
For example (movies are rows, viewers are columns):
W �
�����������������������������������������������������
Movie Experiment V iewer Experimentviewers� 1 2 3 4 5 6 7 8 9 10 11 12 13 14
A A A A A A A A B B B B B Bmovies�
1 C C C C C C C C T C T C C T2 C C C C C C C C T C T C C T3 T T T T T T T T T C T C C T4 C C C C C C C C T C T C C T5 T T T T T T T T T C T C C T6 T T T T T T T T T C T C C T7 C C C C C C C C T C T C C T8 T T T T T T T T T C T C C T9 T T T T T T T T T C T C C T10 C C C C C C C C T C T C C T
����������������������������������������������������23
-
There are three sets of movie/viewer pairs that receive the
control treatment but have systematically different environ-
ments:
1. TCj � B, WVj � C, W
Mi � C (red C) in Viewer experiment,
assigned to control group, and the movie in the Movie
experiment is in the control group.
2. TCj � B, WVj � C, W
Mi � T (blue C) in Viewer experiment,
assigned to control group, but the movie in the Movie
experiment is in the treatment group.
3. TCj � A, WMi � C, (green C): in Movie experiment, as-
signed to control group.
24
-
Compare the red C and the blue C outcomes:
This comparison reflects on the (equilibrium) spillover effect
of other viewers for the same movie being in the treatment
(versus control) group.
The comparison between the average outcomes for these two
sets of pairs of viewers/movies is informative about the indi-
rect effect of the treatment on purchases through the effect
of average treatments on prices, that is, it is estimating an
average of Yij�C, PTi � � Yij�C, PCi �.
25
-
5.C Clustered Double Randomization Designs
a Suppose movies are clustered in groups.
a A regular movie experiment would be:
w �
���������������������������������
cluster�
1 C C C C C C C C1 T T T T T T T T2 T T T T T T T T2 T T T T T T T T3 C C C C C C C C3 T T T T T T T T4 C C C C C C C C4 T T T T T T T T
��������������������������������where the first column denotes the movie-cluster.
Such an experiment may suffer from biases if there are treat-ment effects that affect all movies within a cluster.
26
-
A cluster-randomized movie experiment would be
w �
���������������������������������
cluster�
1 C C C C C C C C1 C C C C C C C C2 T T T T T T T T2 T T T T T T T T3 C C C C C C C C3 C C C C C C C C4 T T T T T T T T4 T T T T T T T T
��������������������������������a Such a cluster-randomized movie experiment would be ro-
bust to interference within clusters of movies.
a It will not inform us about the presence of such interference.
27
-
We could combine this with double randomization in a num-
ber of ways.
5.C.1 We could do separate cluster-randomized movie ex-
periments for two sets of viewers:
w �
���������������������������������
A A A A B B B B
1 C C C C T T T T1 C C C C T T T T2 T T T T T T T T2 T T T T T T T T3 C C C C C C C C3 C C C C C C C C4 T T T T C C C C4 T T T T C C C C
��������������������������������a This would be informative about within-movie spillovers.
28
-
5.C.2 We could combine a cluster-randomized movie exper-
iments for one sets of viewers with an movie experiment for
a second set of viewers:
w �
���������������������������������
A A A A B B B B
1 C C C C T T T T1 C C C C C C C C2 T T T T T T T T2 T T T T T T T T3 C C C C T T T T3 C C C C C C C C4 T T T T C C C C4 T T T T C C C C
��������������������������������a This would be informative about within-movie spillovers.
29
-
5.D Synergistic Double Randomization Designs
W �
����������������
C C C C C C CC C C C C C CC C C C C T TC C C C C T TC C T T T C CC C T T T C C
���������������T �
����������������
c c iv iv iv iv ivc c iv iv iv iv iv
im im imv imv imv t tim im imv imv imv t tim im t t t imv imvim im t t t imv imv
���������������Here there are four sets of movie/viewer pairs that are ex-
posed to the control treatment:
1. consistent controls (c): movie/viewers where the movies
are always exposed to the control treatment, and viewers
who are always exposed to the control treatment.
30
-
2. inconsistent movie controls (im): movie/viewers where
the movies are exposed to the new treatment for other
viewers, but the viewers are always exposed to the control
treatment
3. inconsistent viewer controls (iv): movie/viewers where
the movies are always exposed to the control treatment,
but the viewers are exposed to the new treatment for
some other movies.
4. inconsistent viewer and movie controls (imv): movie/viewers
exposed to the control treatment where both the movies
and viewers have an inconsistent experience.
-
6. Conclusion
a In settings with multiple populations more complex experi-
mental designs are possible.
a Such designs (e.g. multiple randomization designs) can
answer more questions about interference/spillovers than con-
ventional designs by creating multiple comparison groups.
a Opens up lots of design questions.
a Opens up lots of inference questions.
a Important role for modeling (limits on) spillovers.
31