using copulas to deal with endogeneity - an application to

67
Introduction Methodology Results Conclusion References Using copulas to deal with endogeneity An application to development economics Sanne Blauw and Philip Hans Franses Summer School in Development Economics Alba di Canazei, July 16 2013 Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Upload: others

Post on 19-Oct-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

Using copulas to deal with endogeneityAn application to development economics

Sanne Blauw and Philip Hans Franses

Summer School in Development EconomicsAlba di Canazei, July 16 2013

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 2: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

Overview

Introduction

MethodologyEndogeneityCopulasEstimation

ResultsSimulationsTelephone use in Uganda

Conclusion

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 3: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 4: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

Motivation

I Endogeneity is a common problem in (development)economics

I Simultaneous causalityI Omitted variablesI Measurement error

I E.g. the impact of telephone use on development

I We need exogenous and relevant instruments

I What if we cannot find exogenous instruments?

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 5: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

Motivation

I Endogeneity is a common problem in (development)economics

I Simultaneous causalityI Omitted variablesI Measurement error

I E.g. the impact of telephone use on development

I We need exogenous and relevant instruments

I What if we cannot find exogenous instruments?

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 6: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

Motivation

I Endogeneity is a common problem in (development)economics

I Simultaneous causalityI Omitted variablesI Measurement error

I E.g. the impact of telephone use on development

I We need exogenous and relevant instruments

I What if we cannot find exogenous instruments?

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 7: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Endogeneity

I Regression modely = Xβ + ε (1)

I Endogeneity impliesE (X ′ε) 6= 0 (2)

β is biased and inconsistent.

Key idea: Let’s model the correlation between X and ε anduse this information in the likelihood function to obtainconsistent estimates.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 8: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Endogeneity

I Regression modely = Xβ + ε (1)

I Endogeneity impliesE (X ′ε) 6= 0 (2)

β is biased and inconsistent.

Key idea: Let’s model the correlation between X and ε anduse this information in the likelihood function to obtainconsistent estimates.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 9: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Endogeneity

I Regression modely = Xβ + ε (1)

I Endogeneity impliesE (X ′ε) 6= 0 (2)

β is biased and inconsistent.

Key idea: Let’s model the correlation between X and ε anduse this information in the likelihood function to obtainconsistent estimates.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 10: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Copulas

I Park, S. and Gupta, S. (2012). Handling endogenousregressors by joint estimation using copulas. MarketingScience, 31(4):567-586.

I Use copulas to model correlation between X and ε.

I A copula is a function that maps multiple CDFs to their jointCDF.

Sklar’s theorem

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 11: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Copulas

I Park, S. and Gupta, S. (2012). Handling endogenousregressors by joint estimation using copulas. MarketingScience, 31(4):567-586.

I Use copulas to model correlation between X and ε.

I A copula is a function that maps multiple CDFs to their jointCDF.

Sklar’s theorem

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 12: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Copulas

I Park, S. and Gupta, S. (2012). Handling endogenousregressors by joint estimation using copulas. MarketingScience, 31(4):567-586.

I Use copulas to model correlation between X and ε.

I A copula is a function that maps multiple CDFs to their jointCDF.

Sklar’s theorem

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 13: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 14: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 15: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 16: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

I Regression model

y = x1β1 + X2β2 + ε, (3)

where x1 is endogenous and X2 is exogenous.

I Joint CDF (using Gaussian copula)

G (x1, ε) = N(Φ−1(Fx(x1)),Φ−1(Fε(ε))), (4)

where Φ denotes the standard normal CDF, N is the bivariatestandard normal CDF with correlation coefficient ρ, Fx and Fε

are the marginal CDFs of x1 and ε.

I Joint PDF

g(x1, ε) =δδG (x1, ε)

δx1δεfx fε, (5)

where fx and fε are the marginal PDFs of x1 and ε.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 17: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

I Regression model

y = x1β1 + X2β2 + ε, (3)

where x1 is endogenous and X2 is exogenous.

I Joint CDF (using Gaussian copula)

G (x1, ε) = N(Φ−1(Fx(x1)),Φ−1(Fε(ε))), (4)

where Φ denotes the standard normal CDF, N is the bivariatestandard normal CDF with correlation coefficient ρ, Fx and Fε

are the marginal CDFs of x1 and ε.

I Joint PDF

g(x1, ε) =δδG (x1, ε)

δx1δεfx fε, (5)

where fx and fε are the marginal PDFs of x1 and ε.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 18: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

I Regression model

y = x1β1 + X2β2 + ε, (3)

where x1 is endogenous and X2 is exogenous.

I Joint CDF (using Gaussian copula)

G (x1, ε) = N(Φ−1(Fx(x1)),Φ−1(Fε(ε))), (4)

where Φ denotes the standard normal CDF, N is the bivariatestandard normal CDF with correlation coefficient ρ, Fx and Fε

are the marginal CDFs of x1 and ε.

I Joint PDF

g(x1, ε) =δδG (x1, ε)

δx1δεfx fε, (5)

where fx and fε are the marginal PDFs of x1 and ε.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 19: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Two methods

I Copula method 1: MLEI Assumption: Linearity

I Copula method 2: Including generated regressorx∗1 = Φ−1(Fx(x1)) in OLS Proof

I Assumptions: Linearity, Gaussian copula and ε ∈ N(0, σ2ε )

I We use F̂x , the empirical CDF of x1: x̂∗1 = Φ−1(F̂x(x1)).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 20: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Two methods

I Copula method 1: MLEI Assumption: Linearity

I Copula method 2: Including generated regressorx∗1 = Φ−1(Fx(x1)) in OLS Proof

I Assumptions: Linearity, Gaussian copula and ε ∈ N(0, σ2ε )

I We use F̂x , the empirical CDF of x1: x̂∗1 = Φ−1(F̂x(x1)).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 21: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Two methods

I Copula method 1: MLEI Assumption: Linearity

I Copula method 2: Including generated regressorx∗1 = Φ−1(Fx(x1)) in OLS Proof

I Assumptions: Linearity, Gaussian copula and ε ∈ N(0, σ2ε )

I We use F̂x , the empirical CDF of x1: x̂∗1 = Φ−1(F̂x(x1)).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 22: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

EndogeneityCopulasEstimation

Two methods

I Copula method 1: MLEI Assumption: Linearity

I Copula method 2: Including generated regressorx∗1 = Φ−1(Fx(x1)) in OLS Proof

I Assumptions: Linearity, Gaussian copula and ε ∈ N(0, σ2ε )

I We use F̂x , the empirical CDF of x1: x̂∗1 = Φ−1(F̂x(x1)).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 23: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Simulations

Data generating process

Variable Mean Std.Er.true beta -1

beta ols #200 0.692 0.215beta ols #400 0.685 0.151beta ols #1000 0.690 0.096

beta iv #200 -1.031 0.316beta iv #400 -1.005 0.22beta iv #1000 -1.005 0.14

beta copula #200 -0.885 1.015beta copula #400 -0.906 0.711beta copula #1000 -0.968 0.448

Replications 1000

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 24: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Multicollinearity

I Endogenous and generated regressor can be highly correlated,implying multicollinearity.

I Multicollinearity is an efficiency problem.I Indicators of multicollinearity

I High correlation between endogenous and generated regressorI Joint significance, but separately insignificantI Inflated standard errorsI Variance inflation factor (VIF) > 10

Table 1: Simulation results VIF for copula method

Variable Mean Std. Dev.vif #200 22.622 4.306vif #400 22.42 2.975vif #1000 22.311 1.777Replications 1000

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 25: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Multicollinearity

I Endogenous and generated regressor can be highly correlated,implying multicollinearity.

I Multicollinearity is an efficiency problem.

I Indicators of multicollinearity

I High correlation between endogenous and generated regressorI Joint significance, but separately insignificantI Inflated standard errorsI Variance inflation factor (VIF) > 10

Table 1: Simulation results VIF for copula method

Variable Mean Std. Dev.vif #200 22.622 4.306vif #400 22.42 2.975vif #1000 22.311 1.777Replications 1000

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 26: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Multicollinearity

I Endogenous and generated regressor can be highly correlated,implying multicollinearity.

I Multicollinearity is an efficiency problem.I Indicators of multicollinearity

I High correlation between endogenous and generated regressorI Joint significance, but separately insignificantI Inflated standard errorsI Variance inflation factor (VIF) > 10

Table 1: Simulation results VIF for copula method

Variable Mean Std. Dev.vif #200 22.622 4.306vif #400 22.42 2.975vif #1000 22.311 1.777Replications 1000

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 27: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Multicollinearity

I Endogenous and generated regressor can be highly correlated,implying multicollinearity.

I Multicollinearity is an efficiency problem.I Indicators of multicollinearity

I High correlation between endogenous and generated regressorI Joint significance, but separately insignificantI Inflated standard errorsI Variance inflation factor (VIF) > 10

Table 1: Simulation results VIF for copula method

Variable Mean Std. Dev.vif #200 22.622 4.306vif #400 22.42 2.975vif #1000 22.311 1.777Replications 1000

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 28: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Empirical data

I What is the impact of telephone use on economicdevelopment of households? Literature

I Data from Uganda (N=196), collected March-April 2010I Economic development: Progress out of Poverty Index (PPI)

Scorecard

I Telephone use

I Proportion of mobile phone users in householdI Log years mobile phone ownership (HoH)I Log mobile phone calls per week (HoH)I Log public phone calls per week (HoH)

I Estimation

I Heteroskedasticity robust standard errorsI Nonparametric bootstrap for standard errors of generated

regressor

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 29: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Empirical data

I What is the impact of telephone use on economicdevelopment of households? Literature

I Data from Uganda (N=196), collected March-April 2010I Economic development: Progress out of Poverty Index (PPI)

Scorecard

I Telephone useI Proportion of mobile phone users in householdI Log years mobile phone ownership (HoH)I Log mobile phone calls per week (HoH)I Log public phone calls per week (HoH)

I Estimation

I Heteroskedasticity robust standard errorsI Nonparametric bootstrap for standard errors of generated

regressor

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 30: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Empirical data

I What is the impact of telephone use on economicdevelopment of households? Literature

I Data from Uganda (N=196), collected March-April 2010I Economic development: Progress out of Poverty Index (PPI)

Scorecard

I Telephone useI Proportion of mobile phone users in householdI Log years mobile phone ownership (HoH)I Log mobile phone calls per week (HoH)I Log public phone calls per week (HoH)

I EstimationI Heteroskedasticity robust standard errorsI Nonparametric bootstrap for standard errors of generated

regressor

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 31: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

ols copulaConstant 3.129***

(0.19)Proportion mobile phone users in household 0.843***

(0.13)Education, head of household (years) 0.098**

(0.05)Farmer -0.134**

(0.06)Household size 0.071

(0.06)Area 1 -0.028

(0.07)Area 2 -0.189***

(0.07)Generated regressor 1

Observations 193R-squared 0.354Normality of endogenous variable (p-value)Joint sign. generated and endogenous (p-value)Correlation generated and endogenousVIF

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 32: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

ols copulaConstant 3.129*** 3.293***

(0.19) (0.20)Proportion mobile phone users in household 0.843*** 0.349

(0.13) (0.37)Education, head of household (years) 0.098** 0.091*

(0.05) (0.05)Farmer -0.134** -0.123*

(0.06) (0.06)Household size 0.071 0.046

(0.06) (0.06)Area 1 -0.028 -0.023

(0.07) (0.07)Area 2 -0.189*** -0.175**

(0.07) (0.07)Generated regressor 1 0.132

(0.10)Observations 193 193R-squared 0.354 0.358Normality of endogenous variable (p-value)Joint sign. generated and endogenous (p-value)Correlation generated and endogenousVIF

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 33: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

ols copulaConstant 3.129*** 3.293***

(0.19) (0.20)Proportion mobile phone users in household 0.843*** 0.349

(0.13) (0.37)Education, head of household (years) 0.098** 0.091*

(0.05) (0.05)Farmer -0.134** -0.123*

(0.06) (0.06)Household size 0.071 0.046

(0.06) (0.06)Area 1 -0.028 -0.023

(0.07) (0.07)Area 2 -0.189*** -0.175**

(0.07) (0.07)Generated regressor 1 0.132

(0.10)Observations 193 193R-squared 0.354 0.358Normality of endogenous variable (p-value) 0.000Joint sign. generated and endogenous (p-value) 0.000Correlation generated and endogenous 0.958VIF 15.980

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 34: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

ols copulaConstant 3.555*** 3.296***

(0.16) (0.25)Log years mobile phone ownership 0.181*** 0.456**

(0.04) (0.21)Education, head of household (years) 0.081 0.093*

(0.05) (0.05)Farmer -0.170*** -0.183***

(0.06) (0.06)Household size -0.126** -0.124**

(0.05) (0.05)Area 1 -0.022 -0.011

(0.08) (0.08)Area 2 -0.140* -0.132*

(0.07) (0.07)Generated regressor 1 -0.279

(0.20)Observations 192 192R-squared 0.327 0.335Normality of endogenous variable (p-value) 0.000Joint sign. generated and endogenous (p-value) 0.000Correlation generated and endogenous 0.984VIF 32.813

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 35: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

ols copulaConstant 3.568*** 4.064***

(0.17) (0.37)Log mobile phone calls per week 0.122*** -0.306

(0.04) (0.27)Education, head of household (years) 0.096* 0.098*

(0.05) (0.05)Farmer -0.180*** -0.185***

(0.07) (0.07)Household size -0.134** -0.126**

(0.05) (0.05)Area 1 -0.063 -0.050

(0.08) (0.08)Area 2 -0.143* -0.121

(0.08) (0.08)Generated regressor 1 0.570

(0.35)Observations 193 193R-squared 0.283 0.301Normality of endogenous variable (p-value) 0.001Joint sign. generated and endogenous (p-value) 0.001Correlation generated and endogenous 0.987VIF 44.014

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 36: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

ols copulaConstant 3.660*** 3.552***

(0.17) (0.18)Log public phone calls per week -0.066 0.271*

(0.06) (0.16)Education, head of household (years) 0.152*** 0.157***

(0.06) (0.06)Farmer -0.239*** -0.227***

(0.06) (0.06)Household size -0.114** -0.122**

(0.05) (0.05)Area 1 -0.101 -0.097

(0.08) (0.08)Area 2 -0.234*** -0.216***

(0.07) (0.07)Generated regressor 1 -0.249**

(0.11)Observations 193 193R-squared 0.250 0.271Normality of endogenous variable (p-value) 0.000Joint sign. generated and endogenous (p-value) 0.052Correlation generated and endogenous 0.939VIF 8.716

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 37: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Summary of results

Significant Multicollinearity

Proportion of mobile phone users in household No Yes

Log years mobile phone ownership Yes Yes

Log mobile phone calls per week No Yes

Log public phone calls per week Yes No

Size of impact

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 38: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

SimulationsTelephone use in Uganda

Summary of results

Significant Multicollinearity

Proportion of mobile phone users in household No Yes

Log years mobile phone ownership Yes Yes

Log mobile phone calls per week No Yes

Log public phone calls per week Yes No

Size of impact

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 39: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

Conclusion

I Exploration of copula method, an instrument-free method tohandle endogeneity.

I Mobile and public phone use has a positive causal effect oneconomic development.

I However, multicollinearity poses problems in some cases.

I This method is not the holy grail. It seems like you have to belucky with the distribution of the endogenous regressor and/orthe size of the impact!

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 40: Using copulas to deal with endogeneity - An application to

IntroductionMethodology

ResultsConclusionReferences

THANK YOU!

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 41: Using copulas to deal with endogeneity - An application to

Appendix

Chen, S., Schreiner, M., and Woller, G. (2008). Progress out ofPoverty IndexTM: A Simple Poverty Scorecard for Kenya.Technical report, Grameen Foundation.

Park, S. and Gupta, S. (2012). Handling endogenous regressors byjoint estimation using copulas. Marketing Science,31(4):567–586.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 42: Using copulas to deal with endogeneity - An application to

Appendix

Copulas

Sklar’s theoremLet H be a joint distribution function with margins F and G. Thenthere exists a copula C such that for all x,y in R,

H(x , y) = C (F (x),G (y)), (6)

Conversely, if C is a copula and F and G are distribution functions,then the function H defined by (6) is a joint distribution functionwith margins F and G.

Jump back

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 43: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I x∗1 = Φ−1(Fx(x1))

I ε∗ = Φ−1(Fε(ε))I [x∗1 ε

∗]′ follows bivariate standard normal distribution(Assumption 1: Gaussian copula). Gaussian copula

I (x∗1ε∗

)=

(1 0

ρ√

1− ρ2

)(ν1

ν2

), (7)

where ν1 and ν2 are independent random variables drawnfrom a standard normal distribution.

I Or:ε∗ = ρν1 +

√1− ρ2ν2 = ρx∗1 +

√1− ρ2ν2. (8)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 44: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I x∗1 = Φ−1(Fx(x1))I ε∗ = Φ−1(Fε(ε))

I [x∗1 ε∗]′ follows bivariate standard normal distribution

(Assumption 1: Gaussian copula). Gaussian copula

I (x∗1ε∗

)=

(1 0

ρ√

1− ρ2

)(ν1

ν2

), (7)

where ν1 and ν2 are independent random variables drawnfrom a standard normal distribution.

I Or:ε∗ = ρν1 +

√1− ρ2ν2 = ρx∗1 +

√1− ρ2ν2. (8)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 45: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I x∗1 = Φ−1(Fx(x1))I ε∗ = Φ−1(Fε(ε))I [x∗1 ε

∗]′ follows bivariate standard normal distribution(Assumption 1: Gaussian copula). Gaussian copula

I (x∗1ε∗

)=

(1 0

ρ√

1− ρ2

)(ν1

ν2

), (7)

where ν1 and ν2 are independent random variables drawnfrom a standard normal distribution.

I Or:ε∗ = ρν1 +

√1− ρ2ν2 = ρx∗1 +

√1− ρ2ν2. (8)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 46: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I x∗1 = Φ−1(Fx(x1))I ε∗ = Φ−1(Fε(ε))I [x∗1 ε

∗]′ follows bivariate standard normal distribution(Assumption 1: Gaussian copula). Gaussian copula

I (x∗1ε∗

)=

(1 0

ρ√

1− ρ2

)(ν1

ν2

), (7)

where ν1 and ν2 are independent random variables drawnfrom a standard normal distribution.

I Or:ε∗ = ρν1 +

√1− ρ2ν2 = ρx∗1 +

√1− ρ2ν2. (8)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 47: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I x∗1 = Φ−1(Fx(x1))I ε∗ = Φ−1(Fε(ε))I [x∗1 ε

∗]′ follows bivariate standard normal distribution(Assumption 1: Gaussian copula). Gaussian copula

I (x∗1ε∗

)=

(1 0

ρ√

1− ρ2

)(ν1

ν2

), (7)

where ν1 and ν2 are independent random variables drawnfrom a standard normal distribution.

I Or:ε∗ = ρν1 +

√1− ρ2ν2 = ρx∗1 +

√1− ρ2ν2. (8)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 48: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I ε∗ = ρx∗1 +√

1− ρ2ν2.

I Remember: ε∗ = Φ−1(Fε(ε)).

I By Assumption 2 (normally distributed structural error):

ε = F−1ε (Φ(ε∗)) = Φ−1

σ2ε

(Φ(ε∗)) = σεε∗. (9)

I Including (9) in the regression model:

y = x1β1 + X2β2 + ε (10)

= x1β1 + X2β2 + σε(ρx∗1 + (√

1− ρ2)ν2).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 49: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I ε∗ = ρx∗1 +√

1− ρ2ν2.

I Remember: ε∗ = Φ−1(Fε(ε)).

I By Assumption 2 (normally distributed structural error):

ε = F−1ε (Φ(ε∗)) = Φ−1

σ2ε

(Φ(ε∗)) = σεε∗. (9)

I Including (9) in the regression model:

y = x1β1 + X2β2 + ε (10)

= x1β1 + X2β2 + σε(ρx∗1 + (√

1− ρ2)ν2).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 50: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I ε∗ = ρx∗1 +√

1− ρ2ν2.

I Remember: ε∗ = Φ−1(Fε(ε)).

I By Assumption 2 (normally distributed structural error):

ε = F−1ε (Φ(ε∗)) = Φ−1

σ2ε

(Φ(ε∗)) = σεε∗. (9)

I Including (9) in the regression model:

y = x1β1 + X2β2 + ε (10)

= x1β1 + X2β2 + σε(ρx∗1 + (√

1− ρ2)ν2).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 51: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I ε∗ = ρx∗1 +√

1− ρ2ν2.

I Remember: ε∗ = Φ−1(Fε(ε)).

I By Assumption 2 (normally distributed structural error):

ε = F−1ε (Φ(ε∗)) = Φ−1

σ2ε

(Φ(ε∗)) = σεε∗. (9)

I Including (9) in the regression model:

y = x1β1 + X2β2 + ε (10)

= x1β1 + X2β2 + σε(ρx∗1 + (√

1− ρ2)ν2).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 52: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I Structural error: ε = σε(ρx∗1 + (√

1− ρ2)ν2).

I (1) σερx∗1 (correlated with x1)I (2) σε(

√1− ρ2)ν2 (uncorrelated with x1)

I New regression model:

y = x1β1 + X2β2 + σερx∗1 + σε(√

1− ρ2)ν2. (11)

I Key result: New structural error is not correlated with x1.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 53: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I Structural error: ε = σε(ρx∗1 + (√

1− ρ2)ν2).I (1) σερx∗1 (correlated with x1)I (2) σε(

√1− ρ2)ν2 (uncorrelated with x1)

I New regression model:

y = x1β1 + X2β2 + σερx∗1 + σε(√

1− ρ2)ν2. (11)

I Key result: New structural error is not correlated with x1.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 54: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I Structural error: ε = σε(ρx∗1 + (√

1− ρ2)ν2).I (1) σερx∗1 (correlated with x1)I (2) σε(

√1− ρ2)ν2 (uncorrelated with x1)

I New regression model:

y = x1β1 + X2β2 + σερx∗1 + σε(√

1− ρ2)ν2. (11)

I Key result: New structural error is not correlated with x1.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 55: Using copulas to deal with endogeneity - An application to

Appendix

Proof copula method 2

Jump back

I Structural error: ε = σε(ρx∗1 + (√

1− ρ2)ν2).I (1) σερx∗1 (correlated with x1)I (2) σε(

√1− ρ2)ν2 (uncorrelated with x1)

I New regression model:

y = x1β1 + X2β2 + σερx∗1 + σε(√

1− ρ2)ν2. (11)

I Key result: New structural error is not correlated with x1.

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 56: Using copulas to deal with endogeneity - An application to

Appendix

Data generating process

Jump back

I From Park and Gupta (2012): ε∗

x∗

z∗

∼ N

000

, 1 0.5 0

0.5 1 0.80 0.8 1

(12)

I ε = F−1ε (Φ(ε∗)) = Φ−1(Φ(ε∗)) = ε∗ [error, standard normal]

I x = F−1x (Φ(x∗1 )) = Φ(x∗) [endogenous, uniform]

I z = Φ(z∗) [instrument, uniform]

I Dependent variable

y = β · x + ε = −1 · x + ε (13)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 57: Using copulas to deal with endogeneity - An application to

Appendix

Data generating process

Jump back

I From Park and Gupta (2012): ε∗

x∗

z∗

∼ N

000

, 1 0.5 0

0.5 1 0.80 0.8 1

(12)

I ε = F−1ε (Φ(ε∗)) = Φ−1(Φ(ε∗)) = ε∗ [error, standard normal]

I x = F−1x (Φ(x∗1 )) = Φ(x∗) [endogenous, uniform]

I z = Φ(z∗) [instrument, uniform]

I Dependent variable

y = β · x + ε = −1 · x + ε (13)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 58: Using copulas to deal with endogeneity - An application to

Appendix

Data generating process

Jump back

I From Park and Gupta (2012): ε∗

x∗

z∗

∼ N

000

, 1 0.5 0

0.5 1 0.80 0.8 1

(12)

I ε = F−1ε (Φ(ε∗)) = Φ−1(Φ(ε∗)) = ε∗ [error, standard normal]

I x = F−1x (Φ(x∗1 )) = Φ(x∗) [endogenous, uniform]

I z = Φ(z∗) [instrument, uniform]

I Dependent variable

y = β · x + ε = −1 · x + ε (13)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 59: Using copulas to deal with endogeneity - An application to

Appendix

Data generating process

Jump back

I From Park and Gupta (2012): ε∗

x∗

z∗

∼ N

000

, 1 0.5 0

0.5 1 0.80 0.8 1

(12)

I ε = F−1ε (Φ(ε∗)) = Φ−1(Φ(ε∗)) = ε∗ [error, standard normal]

I x = F−1x (Φ(x∗1 )) = Φ(x∗) [endogenous, uniform]

I z = Φ(z∗) [instrument, uniform]

I Dependent variable

y = β · x + ε = −1 · x + ε (13)

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 60: Using copulas to deal with endogeneity - An application to

Appendix

Distributions of endogenous regressor

Jump back

Jump to table with parameters

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 61: Using copulas to deal with endogeneity - An application to

Appendix

Correlation between endogenous and generated regressor

Jump back

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 62: Using copulas to deal with endogeneity - An application to

Appendix

Table 2: Parameters for simulated distributions

parametersuniform a = 0, b = 1

normal µ = 0, σ2 = 1bimodal P[N(0, 1)] = 0.5, P[N(5, 1)] = 0.5qmodal P[N(0, 1)] = 0.25, P[N(5, 1)] = 0.25, P[N(10, 1)] = 0.25, P[N(15, 1)] = 0.25chi2 df = 2beta1 α = β = 0.5beta2 α = 5, β = 1bernouilli P[X = 0] = 0.5, P[X = 1] = 0.5discrete P[X = 0] = 0.2, P[X = 1] = 0.2, P[X = 2] = 0.2, P[X = 3] = 0.2, P[X = 4] = 0.2poisson λ = 4nbinomial r = 4, p = 0.5

Jump back

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 63: Using copulas to deal with endogeneity - An application to

Appendix

Literature

I Economic growth (Kathuria et al., 2009; Waverman et al.,2005)

I [M]ay be twice as large in developing countries compared todeveloped countries. (Waverman et al., 2005)

I PricesI 20% reduction in grain prices across Nigerian markets (Aker,

2008)I 5-7% increase in price of onions of farmers in Philippines (Lee

and Bellemere, 2012)I [N]ear-perfect adherence to the Law of One Price in the

South-Indian fisheries sector (Jensen, 2007)I Market participation

I Increase in market participation for farmers in Uganda growingperishable crops in remote areas (Muto and Yamano, 2009)

Jump back

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 64: Using copulas to deal with endogeneity - An application to

Appendix

Table 3: Economic development scorecard

Question Answer Points

1. How many household members A: 3 or more 0are aged 25 or younger? B: 0, 1 or 2 82. How many household members A: Not all 0aged 6 to 17 are currently attending school? B: All 8

C: No children aged 6 to 17 213 What is the material of the walls of A: Mud/cow dung/grass/sticks 0the house? B: Other 54. What kind of toilet facility does A: Other 0your household use? B: Flush to sewer; flush to septic tank; 2

pan/bucket; covered pit latrine;or ventilation improved pit latrine

5. Does the household own a TV? A: No 0B: Yes 16

6. Does the household own a sofa? A: No 0B: Yes 14

7. Does the household own a stove? A: No 0B: Yes 12

8. Does the household own a radio? A: No 0B: Yes 8

9. Does the household own a bicycle? A: No 0B: Yes 5

10. How many head of cattle are A: None or unknown 0owned by the household currently? B: 1 or more 9

Note: The scorecard is a reproduction of the scorecard in Chen et al. (2008).

Jump backSanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 65: Using copulas to deal with endogeneity - An application to

Appendix

Table 4: Descriptive statistics of economic development for the fullsample and for the three geographic areas

Total Area 1 Area 2 Area 3

Mean 37.14 40.38 31.69 40.92Median 37.00 37.00 27.00 42.00Maximum 86.00 86.00 67.00 67.00Minimum 9.00 9.00 10.00 10.00Std. Dev. 15.87 17.83 14.40 13.96Observations 196 56 77 63

Average poverty likelihood (%) 35.98 35.62 44.61 35.13County poverty level (%) 32.31 22.60 29.00 45.00

Jump back

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 66: Using copulas to deal with endogeneity - An application to

Appendix

Jump backSanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity

Page 67: Using copulas to deal with endogeneity - An application to

Appendix

Size of impact of years of ownership

Jump back

Roughly...

I Increasing the time of ownership from 2.5 (the average) to 3.5years (40%) increases the PPI score with 20% (coefficient is0.5).

I On an average PPI of 37, this means an increase from 37 to44.

I This corresponds with a drop in poverty likelihood of 2.6%(from 35.4% to 32.8%, see Chen et al., 2008).

Sanne Blauw and Philip Hans Franses Using copulas to deal with endogeneity