tmle analysis
DESCRIPTION
TMLE ANALYSIS. The Causal effect of display advertising on conversion. NYC Predictive Analytics Ori Stitelman ( [email protected] ) Brian Dalessandro Claudia Perlich foster provost August 11, 2011. QUESTION OF INTEREST. WHAT IS THE EFFECT OF DISPLAY ADVERTISING - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/1.jpg)
TMLE ANALYSIS
NYC PREDICTIVE ANALYTICSORI STITELMAN ([email protected])BRIAN DALESSANDROCLAUDIA PERLICHFOSTER PROVOSTAUGUST 11, 2011
THE CAUSAL EFFECT OF DISPLAY ADVERTISING ON CONVERSION
![Page 2: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/2.jpg)
2
QUESTIONOF INTEREST
WHAT IS THE EFFECT OFDISPLAY ADVERTISINGON CUSTOMER CONVERSION? ?
![Page 3: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/3.jpg)
3
WHAT ISTMLE?A SEMI-PARAMETRIC method
for estimating CAUSAL PARAMETERS
that DIRECTLY answer a
business question of interest?
![Page 4: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/4.jpg)
4
SEMI-PARAMETRICRealistic assumptions
CAUSAL PARAMETERSEffect/Impact (NOT Coefficients)
DIRECTLYActionable
WHAT ISTMLE?A SEMI-PARAMETRIC methodfor estimating CAUSAL PARAMETERSthat DIRECTLY answer abusiness question of interest?
![Page 5: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/5.jpg)
5
OUTLINE1. BACKGROUND: DISPLAY ADVERTISING &
2. PEEK AT RESULTS
3. A/B TESTING
4. ALTERNATIVE APPROACHES (NICE)
5. RESULTS
6. CONCLUSION
![Page 6: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/6.jpg)
6
THEBROWSERPROCESS
2. Use observed data to build list of prospects
3. Subsequently observe same browser surfing the web the next day
4. Browser visits a site where a display ad spot exists and bid requests are made
5. Auction is held for display spot
6. If auction is won display the ad
7. Observe browsers actions after displaying the ad
1. Observe people taking actions and visiting content
![Page 7: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/7.jpg)
7
RESULTSPREVIEW
RETARGETING
M6D
PROSPEC...
RETARGETING
M6D
PROSPEC...
RETARGETING
M6D
PROSPEC...
0%
2%
4%
6%
8%
10%
12%
14%
RELATIVE LIFT:EXPOSED VS. UNEXPOSED USERS
DID NOT SEE AD SAW ADCO
NVER
SIO
N RA
TE
1.05X
2.62X
1.11X
1.31X
0.92X2.26X
TELECOM COMPANY A
TELECOM COMPANY B
TELECOM COMPANY C
![Page 8: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/8.jpg)
8
P
GENERAL APPROACH
?
Ψ(P)
1. State Question
3. Define Parameter
Ψ(Pn)4. Estimate Parameter
2. Define Causal Assumption/Likelihood
![Page 9: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/9.jpg)
9
WHAT IS THE EFFECT OFDISPLAY ADVERTISING ON CUSTOMERCONVERSION?
?1. STATE QUESTION
DISPLAY ADVERTISINGShowing/Not showing a browser a display ad.
CUSTOMER CONVERSIONVisiting the advertisers website in the next 5 days.
![Page 10: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/10.jpg)
10
P2. DEFINE CAUSAL ASSUMPTIONS/LIKELIHOOD
O = (W,A,Y) ~ P0
W – Baseline VariablesA – Binary Treatment (Ad)Y – Binary Outcome (Site Visit)
P0(O) = P(W)P(A|W)P(Y|A,W)QW
QY
gA
![Page 11: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/11.jpg)
11
DATA STRUCTURE: OURVIEWERS
CHARACTERISTICS(W)
TREATMENT(A)
CONVERSION(Y)
Color Sex HeadShape
Ad No Ad
No Yes
![Page 12: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/12.jpg)
12
Ψ(P)3. DEFINE PARAMETER
P0(O) = P(W)P(A|W)P(Y|A,W)
P0,a(O) = P(W)P(Y|A=a,W)
P0,a(O) = P(W)P(Ya|W)=
LIKELIHOOD
DISTRIBUTION UNDER INTERVENTION
![Page 13: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/13.jpg)
13
Ψ(P)3. DEFINE PARAMETER
ΨAI(P)=E[YA=ad] – E[YA=no ad]
ΨRI(P)=E[YA=ad]/E[YA=no ad]
1. ADDITIVE IMPACT
2. RELATIVE IMPACT
P0,a(O) = P(W)P(Ya|W)
![Page 14: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/14.jpg)
14
E[YA=ad] E[YA=no ad]
φn,ad/φn,no ad
1. PARAMETERS ARE COMBINATIONS OF TREATMENT SPECIFIC CONVERSION RATES
2. SO WE CAN COMBINE ESTIMATES OF THESE THESE RATES
Ψ(Pn)4. ESTIMATE PARAMETER
φn,ad-φn,no ad
![Page 15: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/15.jpg)
15
Ψ(Pn)4. ESTIMATE PARAMETER
1. Optimal Experiment
2. A/B Testing
3. MLE Based Substitution Estimator (MLE)
4. Inverse Probability Estimators (IPTW)
5. Double Robust Estimating Equations (DR-IPW)
6. Targeted Maximum Likelihood Estimation (TMLE)
NICE
![Page 16: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/16.jpg)
16
OPTIMALEXPERIMENT:
3.6 per 1,000
1.2 per 1,000
Compare conversion rates of seeing an ad to conversion rate without ad for SAME individuals.
![Page 17: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/17.jpg)
17
OPTIMALEXPERIMENT:
BROWSER
SHOW AD?
OBSERVEOUTCOME
OR
OR
![Page 18: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/18.jpg)
18
REALITY: OR
OR
OR
OR
![Page 19: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/19.jpg)
19
OPTIMAL EXPERIMENT:Compare conversion rates of seeing an ad to conversion rate without ad for SAME individuals.
3.6 per 1,000
1.2 per 1,000
![Page 20: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/20.jpg)
20
COMMON APPROACH:A/B TESTINGSince we can not both treat and not treat SAME individuals. Randomization is used to create “EQUIVALENT” groups to treat and not treat.
3.4 per 1,000
1.6 per 1,000
![Page 21: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/21.jpg)
21
SIGNIFICANT COSTSASSOCIATED WITH DOING A/B TESTING WELL
1. Cost of displaying PSAs to the control(untreated group).
2. Overhead cost of implementing A/B test and ensuring that it is done CORRECTLY (Kohavi et al.)
3. Wait time necessary to evaluate the results.
![Page 22: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/22.jpg)
22
NON INVASIVECAUSAL ESTIMATION (NICE)Estimate The Effects In The Natural Environment (Observed Data)
![Page 23: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/23.jpg)
23
“WHAT IF”CAUSAL ANALYSIS ADJUSTING FOR CONFOUNDING
Need to adjust for the fact that the group that saw the advertisement and the group that didn’t may be very different.
![Page 24: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/24.jpg)
24
TWO MACHINES:QY,n AND gA,n
gA
QY
Pn(A| )
Pn(Y| )Pn(Y| )
gA,n
QY,n
![Page 25: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/25.jpg)
ESTIMATING QY AND gA
1. Many tools exist for estimating binary conditional distributions.
1. Logistic regression, SVM, GAM, Regression Trees, etc.2. Data adaptive estimation methods that use cross
validation.3. SuperLearner (R package, Eric Polley)
2. Causal analyses will benefit from the advances in parallelized routines for data adaptive estimation.
![Page 26: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/26.jpg)
26
INVERSE PROBABILITY WEIGHTED ESTIMATORS (IPTW)
•Adjust For Confounding Through gn.
•Weight individuals that are unlikely to be shown an advertisement more than individuals that are likely to be shown an advertisement
STRATEGY
ESTIMATOR
Pn(A| )=1/2
=
1/10
Pn(A| )
![Page 27: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/27.jpg)
QY
QY
MLE BASED SUBSTITUTION ESTIMATOR (MLE)
•Adjust For Confounding Through Q.•Predict how each observed browser will behave had they been shown an ad and had they not been shown an ad.
STRATEGY
ESTIMATOR
QY,n
QY,n27
![Page 28: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/28.jpg)
28
DOUBLE ROBUST ESTIMATORSWhat if QY,n or gA,n are broken? The MLE based estimator and IPTW rely on consistent estimates of Q and g respectively.
gA
QY
P(A| )
P(Y| )P(Y| )
gA,n
QY,n
![Page 29: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/29.jpg)
29
OR
STRATEGY
ESTIMATOR
AUGMENTED – IPTW (A-IPTW)
•Adjust For Confounding Through Q and g.
•Augments IPTW estimator with information from Q.
•Alternatively adjusts MLE with information from g.
![Page 30: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/30.jpg)
Ψ(QY)
TARGETED MAXIMUM LIKELIHOOD ESTIMATOR (TMLE)
•Adjust For Confounding Through Q and g.
•Predict how each observed browser will behave had they been shown ad and had they not been shown ad.
•The new machine Q* is calibrated with concern for the parameter of interest.
•R package “tmle” at CRAN
STRATEGY
ESTIMATORQY
Ψ(QY)QY
Q*Y,n
Q*Y,n
30
![Page 31: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/31.jpg)
CREATING Q*YN
•QYn is updated to Q*Yn using a clever covariate that is a function of g.
•Update is done through the use of a parametric submodel.
•Update is a univariate regression with the initial QYn as an offset.
QY
gA
Ψ(QY)QY
Q*Y,n
31
![Page 32: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/32.jpg)
32
SAMPLING/ANALYSIS1. Select Prospects that we got a bid request for on day
0.
2. Observe if treated on day 1. For those treated A=1 and those not treated A=0. Collect W.
3. Create outcome window that is the next five days following treatment and observe if event occurs.
4. Estimate parameters using the methods previously described.
![Page 33: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/33.jpg)
33
RESULTS
![Page 34: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/34.jpg)
34
IPTW AND A-IPTW BLOW UP UNDER POSITIVITY VIOLATIONS
![Page 35: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/35.jpg)
35
METHOD VALIDATION: A/B TEST VS TMLE
![Page 36: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/36.jpg)
36
METHOD VALIDATION: NEGATIVE TEST
![Page 37: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/37.jpg)
37
GROSSCONVERSIONRATESARE ONLY PARTOF THE STORY
RETARGETING
M6D
PROSPECTING
RETARGETING
M6D
PROSPECTING
RETARGETING
M6D
PROSPECTING
0.0%
0.1%
0.2%
0.3%
0.4%
0.5%
0.6%
0.7%
0.8%
0.9%
1.0%
0.5%
0.9%
0.1%
0.8%0.7%
ADDITIVE IMPACT: EXPOSED VS. UNEXPOSED USERS
ADDI
TIVE
IMPA
CT IN
CO
NVER
SIO
N RA
TE
-0.2%
TELECOM COMPANY A
TELECOM COMPANY B
TELECOM COMPANY C
![Page 38: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/38.jpg)
38
EFFECTIVENESSVARIES BYMARKETER& CAMPAIGN
RETARGETING PROSPECTING RETARGETING PROSPECTING0%
2%
4%
6%
8%
10%
12%
14%
16%
RELATIVE IMPACTDID NOT SEE AD SAW AD
CONV
ERSI
ON
RATE
B2B COMPANY
A
B2B COMPANY
B
1.08X
1.08X4.23X
3.77X
![Page 39: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/39.jpg)
39
RETARGETING
M6D P
ROSPECTING
RETARGETING
M6D P
ROSPECTING
RETARGETING
M6D P
ROSPECTING
RETARGETING
M6D P
ROSPECTING
0.0%
1.0%
2.0%
3.0%
4.0%
5.0%
6.0%
7.0%
8.0%
RELATIVE IMPACTDID NOT SEE AD SAW AD
CONV
ERSI
ON
RATE
1.71X1.34X
2.57X
1.17X
3.00X
2.33X
1.84X
1.11X
CAR RENTAL RESTAURANTHOTELAIRLINE
Effective creative and
targeting drive lift across
marketers and verticals
![Page 40: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/40.jpg)
TEC
HN
OLO
GY
A
TRA
VE
L A
B2B
A
TRA
VE
L B
TELE
CO
M A
TRA
VE
L C
TELE
CO
M B
TRA
VE
L D
RE
TAIL
A
RE
TAIL
B
TRA
VE
L E
ED
UC
ATI
ON
A
TRA
VE
L F
RE
STA
UR
AN
TA
AU
TO A
RE
TAIL
C
RE
STA
UR
AN
T B
TRA
VE
L G
ED
UC
ATI
ON
B
TEC
HN
OLO
GY
B
RE
TAIL
D
TELE
CO
M C
RE
TAIL
E
AU
TO B
ED
UC
ATI
ON
C
FIN
AN
CE
A
TRA
VE
L H
B2B
B
RE
TAIL
F
TRA
VE
L I
-100%
0%
100%
200%
300%
400%
500%
600%
700%
800%
-100%
0%
100%
200%
300%
400%
500%
600%
700%
800%
M6D PROSPECTING LIFT RETARGETING LIFT
M6D
PRO
SPEC
TING
LIF
T
RETA
RGET
ING
LIF
T
For 25 out of 30 marketers,
relative lift was
higher for prospecting
candidates than for retargeting
candidates .
SUMMARY RESULTSAVERAGE RELATIVE LIFT OF 90% FOR M6D PROSPECTING
![Page 41: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/41.jpg)
41
CONCLUSIONS1. Estimating causal effects allows one to directly estimate the
impact of the display advertisement.
2. Causal effects can be estimated in the observed data.
3. Display advertising works in varied ways. Causal analysis allows us to estimate effects on a case by case basis.
4. All methods for estimating effects are not equal.
5. These methods may be used for assessing other types of causal effects.
6. What is next?
![Page 42: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/42.jpg)
42
VISIT OUR BLOGmedia6degrees.com/blog
![Page 43: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/43.jpg)
43
OTHER RESOURCES FOR TMLE
Enter “Slide Show” view and click image to link to site.
![Page 44: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/44.jpg)
44
ACKNOWLEDGEMENTS
Edward CaprioloBrian DalessandroRod HookBrian MayRyan OttobreClaudia PerlichTom Phillips
Mark van der LaanSusan GruberEric Polley
Foster Provost
![Page 45: TMLE ANALYSIS](https://reader035.vdocuments.site/reader035/viewer/2022062521/5681688e550346895ddf139c/html5/thumbnails/45.jpg)
45
REFERENCES1. O. Stitelman, B. Dalessandro, C. Perlich, and F. Provost. Estimating The Effect Of Online Display
Advertising On Browser Conversion. In Proceedings of KDD, Annual International Workshop on Data Mining and Audience Intelligence for Online Advertising, ADKDD ’11.
2. M. van der Laan and S. Rose. Targeted Learning: Causal Inference for Observational and Experimental Data. New York, NY: Springer Publishing Company, 2011. http://www.targetedlearningbook.com/
3. ‘tmle’ R Package http://cran.r-project.org/web/packages/tmle/index.html
4. R. Kohavi and R. Longbotham. Unexpected results in online controlled experiments. ACM SIGKDD Explorations Newsletter, 12(2):31–35, 2010.
5. R. Lewis and D. Reiley. Does retail advertising work: Measuring the effects of advertising on sales via a controlled experiment on yahoo. Technical report, Working paper, 2010.
6. D. Chan, R. Ge, O. Gershony, T. Hesterberg, and D. Lambert. Evaluating online ad campaigns in a pipeline: causal models at scale. In Proceedings of KDD, KDD ’10, pages 7–16, New York, NY, USA, 2010. ACM.