multiple equation gmm with common coefficients: panel data · key insight: with panel data, δcan...
TRANSCRIPT
![Page 1: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/1.jpg)
Multiple Equation GMM with Common
Coefficients: Panel Data
Eric Zivot
Winter 2013
![Page 2: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/2.jpg)
Multi-equation GMM with common coefficients
Example (panel wage equation)
69 = + 69 + + 69 + 1
80 = + 80 + + 80 + 2
Note: common coefficients across two equations and (1 2) 6= 0
![Page 3: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/3.jpg)
The general multi-equation model with common coefficients δ is
= z01×
δ×1
+ = 1 ; = 1
[z] 6= 0 for some element of
The model for the equation is
y×1
= Z×
δ×1
+ ε×1
= 1
Note
δ1 = · · · = δ = δ
![Page 4: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/4.jpg)
The stacked system (across equations) is
y1 = Z1δ + ε1...
y = Zδ + ε
⎫⎪⎬⎪⎭⎡⎢⎣ y1
...y
⎤⎥⎦ =⎡⎢⎣ Z1
...Z
⎤⎥⎦ δ +⎡⎢⎣ ε1
...ε
⎤⎥⎦
This is in the form of a giant regression (with common coefficients)
y×1
= Z(×)
δ(×1)
+ ε×1
![Page 5: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/5.jpg)
Assume each equation has instruments x satisfying
[x] = 0
Very often the same instruments x are used for each equation.
The moment conditions are
g(δ)×1
=
⎡⎢⎣ x11...
x
⎤⎥⎦ =⎡⎢⎣ x1(1 − z01δ)...x( − z0δ)
⎤⎥⎦There are a total of =
P=1 moment conditions.
![Page 6: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/6.jpg)
Identification
[g(δ)] = 0 for δ = δ06= 0 for δ 6= δ0
Identification implies that⎡⎢⎣ [x11]...
[x ]
⎤⎥⎦−⎡⎢⎣ [x1z
01]...
[xz0 ]
⎤⎥⎦ δ0 = 0or
σ×1
− Σ(×)
δ0(×1)
= 0×1
For δ0 to be the unique solution requires the rank condition
rank(Σ) =
![Page 7: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/7.jpg)
Remarks:
• In the model with common coefficients, some equations may be individuallyunidentified but identified within the system.
• A sufficient condition for identification is that [xz0] have full rank for some equation
![Page 8: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/8.jpg)
The moments g(δ) are a MDS with
[g(δ0)g(δ0)0] = S =
⎛⎜⎝ S11 · · · S1... . . . ...S1 · · · S
⎞⎟⎠ S = [xx0]
Efficient GMM estimation
min
(δ S−1) = g(δ)0S−1g(δ)
g(δ) = S − Sδ
S =
⎡⎢⎣ S11...S
⎤⎥⎦ S =⎡⎢⎣ S11...S
⎤⎥⎦Straightforward algebra gives
δ(S−1) = (S0S−1S)−1S0S
−1S
![Page 9: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/9.jpg)
Special cases
1. 3SLS with common coefficients
conditional homoskedasticity
x1 = x2 = · · · = x = x
S = S3 = Σ⊗[xx0]
2. SUR with common coefficients (Random effects)
conditional homoskedasticity
x1 = x2 = · · · = x = x
x = union(z1 z)
S = S = Σ⊗[zz0]
![Page 10: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/10.jpg)
Random Effects Estimator (Efficient GMM with common coefficients)
The SUR model with common coefficients is the giant regression
y×1
= Z(×)
δ(×1)
+ ε Z =
⎡⎢⎣ Z1...Z
⎤⎥⎦[εε0] = Σ⊗ I
Σ =
⎛⎜⎝ 11 · · · 1... . . . ...1 · · ·
⎞⎟⎠
The FGLS estimator is the random effects (RE) estimator and has the form
δ = (Z0(Σ
−1 ⊗ I)Z)−1Z0(Σ−1 ⊗ I)y
![Page 11: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/11.jpg)
The elements of Σ are usually estimated using the pooled OLS estimator ofthe giant regression
δ = (Z0Z)−1Z0y
and forming
= (y − Zδ)0(y − Zδ)
Note: Hayashi uses a somewhat non-standard version of the RE estimator. Heassumes that Σ is unstructured. Most other treatments of the RE estimatorassume a special structure for Σ based on an error components representationfor In this case, the FGLS estimator uses a different estimator for Σ
![Page 12: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/12.jpg)
Simplifying the RE estimator
It turns out that the RE estimator may also be derived from an alternativerepresentation of the giant regression.
In the alternative representation, the system is stacked by observations insteadof by equations. We can do this because δ is the same across equations. Define
y×1
=
⎡⎢⎣ 1...
⎤⎥⎦ Z×
=
⎡⎢⎣ z01...z0
⎤⎥⎦ ε×1
=
⎡⎢⎣ 1...
⎤⎥⎦Then, the model for the individual across all equations is
y×1
= Z(×)
δ(×1)
+ ε×1
= 1
[εε0] = Σ
×[εε
0] = 0
![Page 13: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/13.jpg)
Example: 2 period panel wage equation
y =
Ã6980
!Z =
Ã1 69 691 80 80
!δ = ( )0 ε = (1 2)
0
Σ =
Ã11 1212 22
!
![Page 14: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/14.jpg)
The giant regression stacked by observations has the form⎡⎢⎣ y1...y
⎤⎥⎦ =⎡⎢⎣ Z1...Z
⎤⎥⎦ δ +⎡⎢⎣ ε1...ε
⎤⎥⎦or
y×1
= Z(×)
δ(×1)
+ ε×1
[εε0]×
=
⎛⎜⎝ [ε1ε01] · · · [ε1ε
0 ]... . . . ...
[εε01] · · · [εε0 ]
⎞⎟⎠=
⎛⎜⎝ Σ 0 00 . . . 00 0 Σ
⎞⎟⎠ = I ⊗Σ
![Page 15: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/15.jpg)
The FGLS or RE estimator is
δ = (Z0(I ⊗ Σ−1)Z)−1(Z0(I ⊗ Σ−1)y
Now
Z0(I ⊗ Σ−1)Z
=hZ01· · ·Z0
i⎛⎜⎝ Σ−1 0 00 . . . 0
0 0 Σ−1
⎞⎟⎠⎡⎢⎣ Z1...Z
⎤⎥⎦=
X=1
Z0Σ−1Z
![Page 16: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/16.jpg)
and
Z0(I ⊗ Σ−1)y
=hZ01· · ·Z0
i⎛⎜⎝ Σ−1 0 00 . . . 0
0 0 Σ−1
⎞⎟⎠⎡⎢⎣ y1...y
⎤⎥⎦=
X=1
Z0Σ−1y
Therefore,
δ =
⎛⎝ X=1
Z0Σ−1Z
⎞⎠−1 X=1
Z0Σ−1y
![Page 17: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/17.jpg)
Asymptotic Properties of the RE estimator
Consistency of the RE estimator follows from the usual manipulations
δ − δ =
⎛⎝−1 X=1
Z0Σ−1Z
⎞⎠−1 −1 X=1
Z0Σ−1ε
as long as
−1X=1
Z0Σ−1ε
→ 0
the RE estimator will be consistent. This will occur, for example, if Z isuncorrelated with (no endogeneity).
![Page 18: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/18.jpg)
Asymptotic normality also follows from the usual manipulations:
√(δ − δ) =
⎛⎝−1 X=1
Z0Σ−1Z
⎞⎠−1 −12 X=1
Z0Σ−1ε
Under the SUR model assumptions
−1X=1
Z0Σ−1Z
→ [Z0Σ−1Z]
−12X=1
Z0Σ−1ε
→ (0 [Z0Σ−1Z])
so that√(δ − δ)
→ (0 [Z0Σ−1Z]
−1)
![Page 19: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/19.jpg)
Pooled OLS again ⎡⎢⎣ y1...y
⎤⎥⎦ =⎡⎢⎣ Z1...Z
⎤⎥⎦ δ +⎡⎢⎣ ε1...ε
⎤⎥⎦or
y×1
= Z(×)
δ(×1)
+ ε×1
The pooled OLS estimator may also be computed using the giant regressionstacked by observations:
δ = (Z0Z)−1Z0y
=
⎛⎝ X=1
Z0Z
⎞⎠−1 X=1
Z0y
The asymptotic properties of the OLS estimator are straightforward to derive.
![Page 20: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/20.jpg)
For asymptotic normality,
√(δ − δ) =
⎛⎝−1 X=1
Z0Z
⎞⎠−1 −12 X=1
Z0ε
−1X=1
Z0Z→ [Z0Z]
−12X=1
Z0ε→ (0 [Z0εε
0Z])
Note, by iterated expectations and conditional homoskedasticity
[Z0εε0Z] = [[Z0εε
0Z|Z]
= [Z0[εε0]Z]|Z]
= [Z0ΣZ]
![Page 21: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/21.jpg)
Therefore,√(δ − δ)
→ [Z0Z]−1 ×(0 [Z0ΣZ])
≡ (0 [Z0Z]−1[Z0ΣZ][Z
0Z]
−1)
Equivalently,
δ∼ (δ −1 davar(δ))
davar(δ) =
⎛⎝−1 X=1
Z0Z
⎞⎠−1⎛⎝−1 X=1
Z0Σ−1Z
⎞⎠×⎛⎝−1 X
=1
Z0Z
⎞⎠−1
Question: When is pooled OLS efficient?
![Page 22: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/22.jpg)
Introduction to Panel Data
= z0δ +
= 1 (individuals); = 1 (time periods)
y×1
= Z(×)
δ(×1)
+ ε {yZ} is i.i.d.
Main question: Is z uncorrelated with ?
1. If yes, then we have a SUR type model with common coefficients. Underconditional homoskedasticity, the appropriate estimator is the RE estimator.
2. If no, then we have a multi-equation system with endogenous regressors. Weneed to use an estimation procedure to deal with the endogeneity. In the panelset-up, under certain assumptions, we can deal with the endogeneity withoutusing instruments using the so-called fixed effects (FE) estimator.
![Page 23: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/23.jpg)
Error Components Assumption
= + = unobserved fixed effect
[z] = 0
[εε0|x] = Σ x = union(z1 z)
The fixed effect component (which is actually an unobserved random vari-able) captures unobserved heterogeneity across individuals that is fixed overtime.
With the error components assumption, the RE and FE models are defined asfollows:
RE model: [z] = 0
FE model: [z] 6= 0
Note: Hayashi’s treatment of error components model is slightly non-standard.
![Page 24: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/24.jpg)
Example: Panel wage equation
69 = + 69 + + 69 + + 69
80 = + 80 + + 80 + + 80
[] 6= 0
Here captures unobserved ability that is correlated with . It is assumedthat ability does not vary over time.
Note: The existence of guarantees across equation error correlation:
[6980] = [³ + 69
´ ³ + 80
´]
= [2 ] +[80] +[69] +[6980]
6= 0
![Page 25: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/25.jpg)
Remark: With panel data, the endogeneity due to unobserved endogeneity (i.e.,[z] 6= 0) can be eliminated without the use of instruments. To see this,consider the difference in log-wages over time:
80 − 69 = (− ) + (80 − 69)
+(80 −69) + ( − ) +³80 − 69
´= (80 − 69) + (80 −69) +
³80 − 69
´Hence, we can consistently estimate and by using the first differenced data!
![Page 26: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/26.jpg)
Fixed Effects Estimation
Key insight: With panel data, δ can be consistently estimated without usinginstruments.
There are 3 equivalent approaches
1. Within group estimator
2. Least squares dummy variable estimator
3. First difference estimator
![Page 27: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/27.jpg)
Within group estimator
To illustrate the within group estimator consider the simplified panel regressionwith a single regressor
= + + [] 6= 0
[] = 0
Trick to remove fixed effect : First, for each average over time
= + +
=1
X=1
=1
X=1
=1
X=1
![Page 28: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/28.jpg)
Second, form the transformed regression
− = ( − ) + ( − ) + −
or
= +
The FE estimator is pooled OLS on the transformed regression (stacked byobservation)
= (0)−10
=
⎛⎝ X=1
0
⎞⎠−1 X=1
0
![Page 29: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/29.jpg)
Remarks
1. If z does not vary with (e.g. z = z) then z = 0 and we cannotestimate δ
2. Must be careful computing the degrees of freedom for the FE estimator.There are total observations and parameters in δ so it appreas thatthere are− degrees of freedom. However, you lose 1 degree of freedom foreach fixed effect eliminated. So the actual degrees of freedom are−− =( − 1)−
![Page 30: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/30.jpg)
Hayashi Notation for Within Estimator
Consider the general model (assume all variables vary with and )
= z0δ + +
Stack the observations over giving
y×1
= Z(×)
δ(×1)
+ (1×1)
1(×1)
+ η×1
Define
Q×
= I − 1(101)−110= I −P
P = 1(101)
−110 =−1110
![Page 31: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/31.jpg)
Note
P1 = 1 Q1 = 0
Py = 1(101)
−110y= 1
Qy = (I −P)y= y − 1
= y
![Page 32: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/32.jpg)
The transformed error components model is then
Qy = QZδ + Q1 +Qη⇒ y = Zδ + η
The giant regression (stacked by observation) is⎡⎢⎣ y1...y
⎤⎥⎦ =⎡⎢⎣ Z1...Z
⎤⎥⎦ δ +⎡⎢⎣ η1...η
⎤⎥⎦or
y×1
= Z(×)
δ(×1)
+ η×1
Note: Unless [ηη] = σ2I δ = δ is not efficient.
![Page 33: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/33.jpg)
The FE estimator is again pooled OLS on the transformed system
δ =
⎛⎝ X=1
Z0Z
⎞⎠−1 X=1
Z0y
=
⎛⎝ X=1
(QZ)0QZ
⎞⎠−1 X=1
(QZ)0Qy
=
⎛⎝ X=1
Z0QZ
⎞⎠−1 X=1
Z0Qy
since Q is idempotent.
![Page 34: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/34.jpg)
Least Squared Dummy Variable Model
Consider the general model
= z0δ + +
Stack the observations over giving
y×1
= Zδ + 1 + η
Now create the giant regression⎡⎢⎣ y1...y
⎤⎥⎦ =⎡⎢⎣ Z1...Z
⎤⎥⎦ δ +⎡⎢⎣ 1 0 00 . . . 00 0 1
⎤⎥⎦⎡⎢⎣ 1...
⎤⎥⎦+⎡⎢⎣ η1...η
⎤⎥⎦or
y×1
= Z(×)
δ(×1)
+ D(×)
α(×1)
+ η(×1)
= Zδ + (I ⊗ 1)α+ η
D = I ⊗ 1
![Page 35: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/35.jpg)
Aside: Partitioned Regression
Consider the partitioned regression equation
y×1
= X1×1
β11×1
+ X2×2
β22×2
+ ε
The LS estimators for β1 and β2 can be expressed as
β1 = (X01Q2X1)−1X01Q2y Q2 = I −PX2
β2 = (X02Q1X2)−1X02Q1y Q1 = I −PX1
where
PX1 = X1(X01X1)
−1X01 PX2 = X2(X02X2)
−1X02
![Page 36: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/36.jpg)
Result: The FE estimator is the OLS estimator of δ in the giant regression
δ = (Z0QZ)−1Z0Qy
Q = I −P P = D(D0D)−1D0 D = I ⊗ 1Now,
P = (I ⊗ 1)h(I ⊗ 1)0 (I ⊗ 1)
i−1(I ⊗ 1)0
= (I ⊗ 1) [I ⊗ 101 ]−1 (I ⊗ 1)0
= (I ⊗ 1) [I ⊗³101
´−1]³I ⊗ 10
´= I ⊗P
Therefore,
Q = I −P= I − (I ⊗P)= I ⊗Q
![Page 37: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/37.jpg)
As a result
δ = (Z0QZ)−1Z0Qy
= (Z0 (I ⊗Q)Z)−1Z0 (I ⊗Q)y
=
⎛⎝ X=1
Z0Z
⎞⎠−1 X=1
Z0y
which is exactly the result we got when we transformed the model by subtractingoff group means.
![Page 38: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/38.jpg)
Recovering Estimates of α
With the LSDV approach, it is straightforward to deduce estimates for Byexamining the normal equations for α in the Giant Regression, one can deducethat
= − z0δ
=1
X=1
z =1
X=1
z
![Page 39: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/39.jpg)
FE as a First Difference Estimator
Consider the error components model
= z0δ + +
[ηη0] = 2I
Note: The spherical error assumption is made so that GLS estimation is possi-ble.
Take 1st differences over :
∆2 = 2 − 1 = ∆z02δ +∆2...∆ = − (−1) = ∆z0δ +∆
Notice that differencing removes the fixed effect.
![Page 40: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/40.jpg)
In matrix form the model is
C0y(−1)×1
= C0Zδ +C0η
or (in Hayashi notation)
y = Zδ + η
where y = C0y Z = C0Z η = C0η and
C0(−1)×
=
⎛⎜⎜⎜⎝−1 1 0 00 −1 1 0
0 0 0 −1 1
⎞⎟⎟⎟⎠Note that
[ηη0] = [C0ηη
0C] = C
0[ηη0]C =2C
0C
![Page 41: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/41.jpg)
The transformed Giant Regression is⎡⎢⎣ y1...y
⎤⎥⎦ =⎡⎢⎣ Z1...Z
⎤⎥⎦ δ +⎡⎢⎣ η1...η
⎤⎥⎦or
y(−1)×1
= Z((−1)×)
δ(×1)
+ η(−1)×1
Notice that
[ηη0](−1)×(−1)
=
⎛⎜⎜⎜⎜⎝2C
0C 0 · · · 0
0 2C0C · · · 0
... ... . . . ...0 0 · · · 2C
0C
⎞⎟⎟⎟⎟⎠ = 2hI ⊗ (C0C)
i
![Page 42: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/42.jpg)
Pooled GLS on the transformed regression gives
δ =hZ0³I ⊗ (C0C)−1
´Zi−1
Z0³I ⊗ (C0C)−1
´y
=
⎛⎝ X=1
Z0C(C0C)−1C0Z
⎞⎠−1 X=1
Z0C(C0C)−1C0y
=
⎛⎝ X=1
Z0PZ
⎞⎠−1 X=1
Z0Py
=
⎛⎝ X=1
Z0QZ
⎞⎠−1 X=1
Z0Qy
=
⎛⎝ X=1
Z0Z
⎞⎠−1 X=1
Z0y
![Page 43: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/43.jpg)
Here
Z0³I ⊗ (C0C)−1
´Z
=hZ01· · · Z
0
i⎛⎜⎝ (C0C)−1 0 00 . . . 0
0 0 (C0C)−1
⎞⎟⎠⎡⎢⎣ Z1...Z
⎤⎥⎦=
X=1
Z0(C0C)−1Z
=X=1
(C0Z)0(C0C)−1C0Z
=X=1
Z0C(C0C)−1C0Z =
X=1
Z0PZ P = C(C
0C)−1C0
![Page 44: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/44.jpg)
The result
P = C(C0C)−1C0 = Q = I − 1(101)−110
follows from the result that
C01 =
⎛⎜⎜⎜⎝−1 1 0 00 −1 1 0
0 0 0 −1 1
⎞⎟⎟⎟⎠⎛⎜⎜⎜⎝11...1
⎞⎟⎟⎟⎠ =⎛⎜⎜⎜⎝00...0
⎞⎟⎟⎟⎠That is, P = C(C
0C)−1C0 is an idempotent matrix satisfying P1 = 0
It projects onto the space orthogonal to 1 and this is exactly what Q
does.
![Page 45: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/45.jpg)
The FE estimator is a GMM estimator
Using the first-difference transformation of the error components model
y = Zδ + η
the FE estimator can be viewed as a GMM estimator of the form
δ(W) =³S¯0WS
¯´−1
S¯0WS
¯
where
S¯
=1
X=1
Z ⊗ x x = union of (1 )
S¯
=1
X=1
y ⊗ x
W = (C0C)−1 ⊗ S−1
![Page 46: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/46.jpg)
Asymptotic Distribution of FE estimator
Assumptions:
1. y = Zδ + ε = 1
2. ε = 1 + η (error components)
3. {yZ} is i.i.d.
4. [] 6= 0 (Endogeneity)
5. [] = 0 for = 1 2
![Page 47: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/47.jpg)
It follows that
√³δ − δ
´=
⎛⎝1
X=1
Z0Z
⎞⎠−1 1√
X=1
Z0η
→ [Z0Z]−1(0 [Z0ηη
0Z])
≡ (0 avar(δ))
where
avar(δ) = [Z0Z]−1[Z0ηη
0Z][Z
0Z]
−1
Therefore,
δ ∼ (δ −1 davar(δ))Remark: The sandwich form of avar(δ) suggests that it is an inefficientGMM estimator.
![Page 48: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/48.jpg)
General Case: Conditional Heteroskedasticity
1
X=1
Z0Z→ [Z0Z]
1
X=1
Z0ηη0Z→ [Z0ηη
0Z]
where η = y − Zδ Then a consistent estimate for avar(δ) is
davar(δ) =
⎛⎝1
X=1
Z0Z
⎞⎠−1 ×⎛⎝1
X=1
Z0ηη0Z
⎞⎠−1
×⎛⎝1
X=1
Z0Z
⎞⎠−1
![Page 49: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/49.jpg)
Remarks
• The formula for davar(δ) is called the panel robust covariance estimate.It is not the same as the usual White correction for heteroskedasticity in apooled OLS regression. The White correction does not account for serialcorrelation. It is also not the Newey-West correction for heteroskedasticityand autocorrelation.
• The formula for davar(δ) is also known as the cluster robust covarianceestimate when the clustering variable is (each individual is a cluster)
![Page 50: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/50.jpg)
Special Case: Conditional Homoskedasticity
Assume: [εε0|x] = Σ x = union(z1 z)
Then it can be shown that (Hayashi page 332)
[Z0ηη0Z] = [Z0[ηη
0]Z] = [Z0VZ] V = [ηη
0]
and
V =1
X=1
ηη0→ [ηη
0] where η = (y − Zδ)
![Page 51: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/51.jpg)
Then a consistent estimate for avar(δ) is
davar(δ) =
⎛⎝1
X=1
Z0Z
⎞⎠−1 ×⎛⎝1
X=1
Z0VZ
⎞⎠−1
×⎛⎝1
X=1
Z0Z
⎞⎠−1
![Page 52: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/52.jpg)
Special Case: When η is Spherical
[ηη0] = 2I ⇒ [ηη
0] = 2Q
This implies that there is no serial correlation (correlation across time)
Then
[Z0ηη0Z] = [Z0[ηη
0]]Z] = 2[Z
0Z]
and
avar(δ) = 2[Z0Z]
−1
![Page 53: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/53.jpg)
Then a consistent estimate for avar(δ) is
davar(δ) = 2
⎛⎝1
X=1
Z0Z
⎞⎠−1
2 =1
− −
X=1
η0η
η = (y − Zδ)
Remark: In this case the FE estimator is an efficient GMM estimator!
![Page 54: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/54.jpg)
Hausman Specification Test for FE vs. RE
The hypotheses to be tested are
0 : [z] = 0 (RE estimation)
1 : [z] 6= 0 (FE estimation)
Hausman and Taylor (1981, Ecta) considered a test statistic based on
q = δ − δ
in the context of maximum likelihood estimation. Newey (1985, see alsoHayashi Chapter 3, Analytic Exercise 9) studied the test in the context ofGMM.
![Page 55: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/55.jpg)
Intuition: Under 0 both δ and δ are consistent (but δ is efficient)so that
q→ 0
Under 1 δ is not consistent but δ is consistent so that
q9 0
Therefore, consider the test statistic
= q (davar(q))−1 qIf is big then reject 0; otherwise do not reject 0
Q1: What is davar(q)?Q2: What is the asymptotic distribution of ?
![Page 56: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/56.jpg)
To answer these questions consider the general linear GMM model
= z0δ + = 1
[x] = 0 [xx02 ] = S
Let δ1(W1) denote an inefficient GMM estimator with weight matrix W1→
W1 6= S−1
Let δ2(W2) denote another inefficient GMM estimator with weight matrixW2
→W2 6=W1 6= S−1
The following results are based on Newey 1985 (See also Hayashi, Chapter 3,Analytic Exercise 9)
![Page 57: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/57.jpg)
Result 1 ⎡⎣ √ ³δ1(W1)− δ´
√³δ2(W2)− δ
´ ⎤⎦ →
"Ã00
!
"A11 A12A012 A22
##
where
A11 = avar(δ1(W1))
A22 = avar(δ2(W2))
A12 = acov(δ1(W1) δ2(W2))
![Page 58: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/58.jpg)
Define
q = δ1(W1)− δ2(W2)
Result 2:√q
→ (0 avar(q))
avar(q) = A11 +A22 −A12 −A012= avar(δ1(W1)) + avar(δ2(W2))
−acov(δ1(W1) δ2(W2))− acov(δ1(W1) δ2(W2))0
![Page 59: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/59.jpg)
Result 3: Set W2 = S−1 where S
→ S Then
A12 = A012 = A22 = avar(δ2(S
−1))
and
avar(q) = A11 +A22 − 2A22 = A11 −A22= avar(δ1(W1))− avar(δ2(S−1))
Remark: Because δ2(S−1) is the efficient GMM estimator avar(q) ≥ 0
![Page 60: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/60.jpg)
Applying Newey’s result to
q = δ − δ
gives
avar(q) = avar(δ)− avar(δ)
and
= q (davar(q))−1 q ∼ 2()
Therefore, we reject 0 : [z] = 0 at the × (100)% level if
21−()
Remark: Hayashi shows (Appendix to chapter 5) that avar(q) 0 so thatavar(q)−1 is well defined.
![Page 61: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/61.jpg)
Remarks
The form of avar(q) = avar(δ) − avar(δ) depends on assumptionsabout heteroskedasticity.
Typical case: Assume conditional homoskedasticity
√(δ − δ)
→ (0 [Z0Σ−1Z]
−1)√³δ − δ
´→
(0 [Z0Z]−1[Z0[ηη
0]Z][Z
0Z]
−1)
![Page 62: Multiple Equation GMM with Common Coefficients: Panel Data · Key insight: With panel data, δcan be consistently estimated without using instruments. There are 3 equivalent approaches](https://reader034.vdocuments.site/reader034/viewer/2022050405/5f823b2c4872da6b2b35904e/html5/thumbnails/62.jpg)
Then
davar(δ) =
⎛⎝1
X=1
Z0Z
⎞⎠−1 ×⎛⎝1
X=1
Z0VZ
⎞⎠−1
×⎛⎝1
X=1
Z0Z
⎞⎠−1
davar(δ) =
⎛⎝−1 X=1
Z0Σ−1Z
⎞⎠