[part 7] 1/68 stochastic frontiermodels panel data stochastic frontier models william greene stern...
TRANSCRIPT
[Part 7] 1/68
Stochastic FrontierModels
Panel Data
Stochastic Frontier ModelsWilliam Greene
Stern School of Business
New York University
0 Introduction1 Efficiency Measurement2 Frontier Functions3 Stochastic Frontiers4 Production and Cost5 Heterogeneity6 Model Extensions7 Panel Data8 Applications
[Part 7] 2/68
Stochastic FrontierModels
Panel Data
Main Issues in Panel Data Modeling
Issues Capturing time invariant effects Dealing with time variation in inefficiency Separating heterogeneity from Inefficiency Examining technical change and total factor
productivity growth Contrasts – Panel Data vs. Cross Section
[Part 7] 3/68
Stochastic FrontierModels
Panel Data
Technical Change Technical Change
LnOutputit = f(xit,zi,,t) + vit - uit.
LnCostit = c(xit,zi,,t) + vit + uit. Independent of other factors, TC = f(..)/t Change in output not explained by change in factors
or environment – shift in production or cost function Time shift the goal function.
Lnyit = xit + zi + t + vit - uit.
[Part 7] 4/68
Stochastic FrontierModels
Panel Data
Familiar RE and FE Models
Wisdom from the linear model FE: y(i,t) = f[x(i,t)] + a(i) + e(i,t)
What does a(i) capture? Nonorthogonality of a(i) and x(i,t) The LSDV estimator
RE: y(i,t) = f[x(i,t)] + u(i) + e(i,t) How does u(i) differ from a(i)? Generalized least squares and maximum likelihood
What are the time invariant effects?
[Part 7] 5/68
Stochastic FrontierModels
Panel Data
The Cross Section Departure Point: 1977
2
2
2
Aigner et al. (ALS) Stochastic Frontier Model
~ [0, ]
| | and ~ [0, ]
Jondrow et al. (JLMS) Inefficiency Estimator
( )ˆ [ | ]
1 ( )
,
i i i i
i v
i i i u
ii i i i
i
ui i i
y v u
v N
u U U N
u E u
v u
x
2 2, ,
iv u i
v
[Part 7] 6/68
Stochastic FrontierModels
Panel Data
A Frontier Model for Panel Data
y(i,t) = β’x(i,t) – u(i) + v(i,t) Effects model with time invariant inefficiency Same dichotomy between FE and RE –
correlation with x(i,t). FE case is completely unlike the assumption in the
cross section case
[Part 7] 7/68
Stochastic FrontierModels
Panel Data
The Panel Data Models Appear: 1981
2 2
1
Pitt and Lee Random Effects Approach: 1981
~ [0, ], | | and ~ [0, ]
Counterpart to Jondrow et al. (1982)
( / )ˆ [ | ,..., ]
1 ( / )
=
it it it i
it v i i i u
it it i
ii i i iT i
i
ii
y v u
v N u U U N
v u
u E u
T
x
2
2 ,
1 1
u u
vT T
Time fixed
[Part 7] 8/68
Stochastic FrontierModels
Panel Data
Estimating Technical Efficiency
2* 2 2
*2
1, , (1 )( ),
1u
i i i i i i vv iT
[Part 7] 9/68
Stochastic FrontierModels
Panel Data
2
2 2
2
( ) exp , 02
[ ]2
4[ ]
2
exp( )
u u
u
u
ui u
u uf u u
E u
Var u
ih
Stochastic Frontiers with a Rayleigh DistributionGholamreza Hajargasht, Department of Economics, University of Melbourne, 2013
[Part 7] 10/68
Stochastic FrontierModels
Panel Data
Rayleigh vs. Half NormalSwiss Railway Data
2 2 22 *
2 2 2 2
* 2 2 *
* *
Technical Efficiency Estimator in the Rayleigh Model
, ,
( ) ( ) ( )
( ) ( )
u u v ii i i i
v i u v u
i i i ii
i i i
TT
TE
Rayleigh
Half Normal
[Part 7] 11/68
Stochastic FrontierModels
Panel Data
Reinterpreting the Within Estimator: 1984
2
Schmidt and Sickles Fixed Effects Approach: 1984
~ [0, ],
.
Counterpart to Jondrow et al. (198
it i it it
it v i
y v
v N
x
semiparametically specified
fixed mean, constant variance
2)
ˆ ˆˆ max ( )
(The cost of the semiparametric specification is the
location of the inefficiency distribution. The authors
also revisit Pitt and Lee to demonstrate.)
i i i iu Time fixed
[Part 7] 12/68
Stochastic FrontierModels
Panel Data
Schmidt and Sickles FE Model
lnyit = + β’xit + ai + vit
estimated by least squares (‘within’)ˆ ˆˆ > 0 (for production or profit)
ˆ ˆˆ - > 0 (for cost or distance)
Implications: One firm is perfectly efficient.
Either deterministic frontier, or firm
i j j i
i i j j
u =max( a )- a
u =a min( a )
s
are compared to other firms,
not an absolute standard of zero.
[Part 7] 13/68
Stochastic FrontierModels
Panel Data
Misgivings About Time Fixed Inefficiency: 1990-
20 1 2
2 1
Cornwell Schmidt and Sickles (1990)
ˆ ˆˆ, u max( )
Kumbhakar (1990)
[1 exp( )] | |
Battese and Coelli (1992, 1995)
exp[ ( )] | |, exp[ ( , , )] | |
Cuesta (
it i i i it it it
it i
it i it it i
t t
u bt ct U
u t T U u g t T U
z
2000)
exp[ ( )] | |, exp[ ( , , )] | |it i i it i it iu t T U u g t T U z
[Part 7] 14/68
Stochastic FrontierModels
Panel Data
Battese and Coelli Models
it
i
2
In half or truncated normal model
u(i,t) = |U(i)| exp[g(z , t, )]
U(i) ~ normal, mean =
variance =
Commonly used formulation. g(...) is flexible.
Time invariant
ui
random component is still a major
influence on the model. Results are still vastly
different from models in which the random part
varies with time.
[Part 7] 15/68
Stochastic FrontierModels
Panel Data
Variations on Battese and Coelli (There are many) Farsi, M. JPA, 30,2, 2008.
i
20
log (log )
= fixed effects
0~ ,
0
it it i i it it
it i i i
i
i
C t u v
u u t t
N
x z
[Part 7] 16/68
Stochastic FrontierModels
Panel Data
Time Invariant Heterogeneity
[Part 7] 17/68
Stochastic FrontierModels
Panel Data
Observable Heterogeneity
[Part 7] 18/68
Stochastic FrontierModels
Panel Data
Are the time varying inefficiency models more like time fixed or freely time varying?
A Pooled Model
Battese and Coelli (1992) exp[ ( )] | |
Pitt and Lee (1981) | |
Where is Battese and Coelli?
Closer to
it it it it
it i
it it it i
y v u
u t T U
y v U
x
x
the pooled model or to Pitt and Lee?
Greene (2004): Much closer to the Pitt and Lee model
[Part 7] 19/68
Stochastic FrontierModels
Panel Data
2 2
In these models with time varying inefficiency,
( , ) | |
~ [0, ] and ~ [0, ],
where does unobserved time invariant
heterogeneity end up?
In the inefficiency! Even with t
it it it i it i
it v it u
y v g t U
v N U N
x z
he extensions.
[Part 7] 20/68
Stochastic FrontierModels
Panel Data
Greene, W., Distinguishing Between Heterogeneity and Inefficiency: Stochastic Frontier Analysis of the World Health Organization’s Panel Data on National Health Care Systems, Health Economics, 13, 2004, pp. 959-980.
[Part 7] 21/68
Stochastic FrontierModels
Panel Data
True Random and Fixed Effects: 2004
2 2
True Random and Fixed Effects Approach: 2004
~ [0, ], | | and ~ [0, ]
Unobserved time invariant heterogeneity,
not unobserved time invariant inefficiency
Jo
it i it it it
it v it it it u
i
y v u
v N u U U N
x
2
2 2
ndrow et al. (JLMS) Inefficiency Estimator
( )[ | ]
1 ( )
, , ,
itit it it
it
u itit it it v u i
v
E u
v u
Time varying
Time fixed
[Part 7] 22/68
Stochastic FrontierModels
Panel Data
The True RE Model is an RP Model
[Part 7] 23/68
Stochastic FrontierModels
Panel Data
Skepticism About Time Varying Inefficiency Models: Greene (2004)
[Part 7] 24/68
Stochastic FrontierModels
Panel Data
Estimation of TFE and TRE Models: 2004
2 2
True Fixed Effects: MLE
~ [0, ], | | and ~ [0, ]
Unobserved time invariant heterogeneity,
not unobserved time invariant inefficiency
it i it it it
it v it it it u
i
y v u
v N u U U N
x
2 2 2
Just add firm dummy variables to the SF model (!)
True Random Effects: Maximum Simulated Likelihood (RPM)
( )
~ [0, ], | | and ~ [0, ], ~ [0, ]
it i it it it
it v it it it u i w
y w v u
v N u U U N w N
x
Unobserved time invariant heterogeneity,
not unobserved time invariant inefficiency
Random parameters stochastic frontier model
i
[Part 7] 25/68
Stochastic FrontierModels
Panel Data
A True FE Model
[Part 7] 26/68
Stochastic FrontierModels
Panel Data
Schmidt et al. (2011) – Results on TFE Problem of TFE model – incidental parameters problem. Where is the bias? Estimator of u
Is there a solution? Not based on OLS Chen, Schmidt, Wang: MLE for data in group mean deviation
form
[ ] ( ) ( )
Trades fixed effects problem for the problem of obtaining
the distribution of the deviations of the one sided terms.
Derives a "within MLE" estimator.
it i it i it i it iy y v v u u x x
[Part 7] 27/68
Stochastic FrontierModels
Panel Data
1
Log likelihood function for stochastic frontier model
2log log
log ( , , , ) = ( )
log
i i
N
ii i
y
Ly
x
x
[Part 7] 28/68
Stochastic FrontierModels
Panel Data
1 1 1
for stochastic frontier model
with a time invariant random constant term. (TRE model)
2 ( )
1log ( , , , , ) = log
( (
it w ir it
N R TSw i r t
it w
y w
LR y w
Simulated log likelihood fun t
x
c ion
) )
draws from N[0,1].
ir it
irw
x
[Part 7] 29/68
Stochastic FrontierModels
Panel Data
TRE SF Model for 247 Spanish Dairy Farms
u
v
w
[Part 7] 30/68
Stochastic FrontierModels
Panel Data
Moving Heterogeneity Out of Inefficiency
World Health Organization study of life expectancy (DALE) and composite health care delivery (COMP)
[Part 7] 31/68
Stochastic FrontierModels
Panel Data
[Part 7] 32/68
Stochastic FrontierModels
Panel Data
Generalized True Random Effects Stochastic Frontier Model
Short run (transient) random components
Time varying normal - half normal SF
Long run
it i i it it it
it it
y A B v u
v u
x
(permanent) random components
Time fixed normal - half normal SFi iA B
[Part 7] 33/68
Stochastic FrontierModels
Panel Data
2
Generalized True Random Effects Stochastic Frontier Model
( | |)
Short run (time varying, transient) random components
~ [0, ], | | and ~ [0,
it w i i it it it
it v it it it u
y w e v u
v N u U U N
x
2 ],
Long run (time invariant) random components
~ [0,1], ~ [0,1]
The random constant term in this model has a closed skew
normal distribution, instead of the usual normal distribution.
i iw N e N
[Part 7] 34/68
Stochastic FrontierModels
Panel Data
A Stochastic Frontier Model with Short-Run and Long-Run Inefficiency:Colombi, R., Kumbhakar, S., Martini, G., Vittadini, G.University of Bergamo, WP, 2011
[Part 7] 35/68
Stochastic FrontierModels
Panel Data
1
Colombi et al. Classical Maximum Likelihood Estimator
log ( , )log
log ( ( , )) log 2
(...) T-variate normal pdf.
(..., )) ( 1) Multivariate normal int
N T i i T
iq i i T
T
q
Lnq
T
y X 1 AVA
R y X 1
egral.
Too time consuming and complicated. Estimated in two steps.
First step a variant of least squares for .
[Part 7] 36/68
Stochastic FrontierModels
Panel Data
Tsionas, G. and Kumbhakar, S.
Firm Heterogeneity, Persistent and Transient Technical Inefficiency: A Generalized True Random Effects ModelJournal of Applied Econometrics. Published online, November, 2012.Forthcoming.
Extremely involved Bayesian MCMC procedure. Efficiency components estimated by data augmentation.
[Part 7] 37/68
Stochastic FrontierModels
Panel Data
Kumbhakar, Lien, Hardaker
Technical Efficiency in Competing Panel Data Models: A Study of Norwegian Grain Farming, JPA, Published online, September, 2012.
Three steps based on GLS: (1) RE/FGLS to estimate (,) (2) Decompose time varying residuals using MoM and SF. (3) Decompose estimates of time invariant residuals.
[Part 7] 38/68
Stochastic FrontierModels
Panel Data
1 1 1
Maximum Simulated Full Information log likelihood function for the
"generalized true random effects stochastic frontier model"
( | |)2,
1logL , = log
,
it w ir ir
TN RS
i r t
w
y w U
R
( ( | |) )
draws from N[0,1]
|U | absolute values of draws from N[0,1]
it
it w ir ir it
ir
ir
y w U
w
x
x
[Part 7] 39/68
Stochastic FrontierModels
Panel Data
Estimating Efficiency in the CSN Model
1 12
1
Moment Generating Function for the Multivariate CSN Distribution
( , )E[exp( ) | ] exp
( , )
(..., ) Multivariate normal cdf. Parts defined in Colombi et al.
Computed using
T ii i i
T i
Rr tt u y t Rr t t
Rr
1
GHK simulator.
1 0 0
0 1 0, = , , ...,
0 0 1
i
ii
iT
e
u
u
u t
[Part 7] 40/68
Stochastic FrontierModels
Panel Data
247 Farms, 6 years.100 Halton draws.Computation time: 35 seconds including computing efficiencies.
[Part 7] 41/68
Stochastic FrontierModels
Panel Data
Estimated efficiency for farms 1-10 of 247.
[Part 7] 42/68
Stochastic FrontierModels
Panel Data
Kumbhakar, Lien, Hardaker suggest
Time invariant efficiency timesOverall Technical Efficiency =
Time varying efficiency
.8022 .8337 .8691 .8688 .8725 .8745
[Part 7] 43/68
Stochastic FrontierModels
Panel Data
Cost Efficiency of Swiss Railway Companies: Model Specification
C = f ( Y1, Y2, PL , PC , PE , N, DA )
43
C = Total costs
Y1 = Passenger-km
Y2 = Freight ton-km
PL = Price of labor (wage per FTE)
PC = Price of capital (capital costs / total number of seats)
PE = Price of electricity
N = Network length
DA = Dummy variable for companies also operating alpine
lines
[Part 7] 44/68
Stochastic FrontierModels
Panel Data
Data
50 railway companies, Period 1985 to 1997
Unbalanced panel with number of periods (Ti) varying from 1 to 13 and with 45 companies with 12 or 13 years, resulting in 605 observations
Data source: Swiss federal transport office
Data set available at http://people.stern.nyu.edu/wgreene
Data set used in: Farsi, Filippini, Greene (2005), Efficiency and measurement in network industries: application to the Swiss railway companies, Journal of Regulatory Economics
44
[Part 7] 45/68
Stochastic FrontierModels
Panel Data
Model Specifications: Special Cases and Extensions
Pitt and Lee
True Random Effects
Extended True Random Effects
Mundlak correction for the REM, group means of time varying variables
Extended True Random Effects with Heteroscedasticity in vit: v,it=vexp(’zit)
[Part 7] 46/68
Stochastic FrontierModels
Panel Data
Efficiency Estimates
46
TRE Models Move Heterogeneity Out of the Inefficiency Estimate
[Part 7] 47/68
Stochastic FrontierModels
Panel Data
2. Cost Efficiency of Norwegian Electricity Distribution Companies:
Model Specification
C = f ( Y, CU, NL, PL , PC )
47
C = Total costs of the distribution activity
Y = Output (total energy delivered in kWh)
CU = Number of customers
NL = Network length in km
PL = Price of labor (wage per FTE)
PC = Price of capital (capital costs / transformer
capacity)
[Part 7] 48/68
Stochastic FrontierModels
Panel Data
Data
111 Norwegian electricity distribution utilities
Period 1998 – 2002
Balanced panel with 555 observations
Data source: Norwegian electricity regulatory authority (Unpublished)
48
[Part 7] 49/68
Stochastic FrontierModels
Panel Data
Mundlak Specification
Suggests ei or wi may be correlated with the inputs.
[Part 7] 50/68
Stochastic FrontierModels
Panel Data
Efficiency Estimates
50
[Part 7] 51/68
Stochastic FrontierModels
Panel Data
Appendix A: Implementation Customized version of NLOGIT 5/LIMDEP 10.
Both instructions exist in current version. Modifications were:
For the generalized TRE, allow the random constant term in the TRE model to have a second random component that has a half normal distribution.
For the selection model, allow products of groups of observations to appear as the contribution to the simulated log likelihood
Now available from the author as an update to LIMDEP or NLOGIT. To be released with the next version.
[Part 7] 52/68
Stochastic FrontierModels
Panel Data
[Part 7] 53/68
Stochastic FrontierModels
Panel Data
DISTANCE FUNCTION
[Part 7] 54/68
Stochastic FrontierModels
Panel Data
A Distance Function Approach
http://www.young-demography.org/docs/08_kriese_efficiency.pdf
[Part 7] 55/68
Stochastic FrontierModels
Panel Data
Kriese Study of Municipalities
[Part 7] 56/68
Stochastic FrontierModels
Panel Data
[Part 7] 57/68
Stochastic FrontierModels
Panel Data
[Part 7] 58/68
Stochastic FrontierModels
Panel Data
[Part 7] 59/68
Stochastic FrontierModels
Panel Data
[Part 7] 60/68
Stochastic FrontierModels
Panel Data
[Part 7] 61/68
Stochastic FrontierModels
Panel Data
[Part 7] 62/68
Stochastic FrontierModels
Panel Data
[Part 7] 63/68
Stochastic FrontierModels
Panel Data
TOTAL FACTOR PRODUCTIVITY
[Part 7] 64/68
Stochastic FrontierModels
Panel Data
Factor Productivity Growth Change in output attributable to change in factors,
holding the technology constant Malmquist index of change in technical efficiency
TE(t+1|t) = technical efficiency in period t+1 based on factor usage in period t+1 in comparison to firms using factors and producing output in period t.
Index measures the change in productivity
[Part 7] 65/68
Stochastic FrontierModels
Panel Data
TFP measurement using DEA
[Part 7] 66/68
Stochastic FrontierModels
Panel Data
[Part 7] 67/68
Stochastic FrontierModels
Panel Data
Total Factor Productivity GrowthSpanish Dairy Farms
[Part 7] 68/68
Stochastic FrontierModels
Panel Data
Malmquist Index of Factor Productivity Growth Spanish Dairy Farms