regression-based modeling of market option prices: … modeling of market option prices: with...

Forthcoming: Journal of Forecasting

Regression-based Modeling of Market Option Prices: with Application to S&P500 Options

Gurupdesh Pandher∗

Revised Version: September 2006

Abstract

This paper presents a simple empirical approach to modeling and forecasting market option prices using localized option regressions (LOR). LOR projects market option prices over localized regions of their state space and is robust to assumptions regarding the underlying asset dynamics (e.g. log-normality) and volatility structure. Our empirical study using three years of daily S&P500 options shows that LOR yields smaller out-of-sample pricing errors (e.g. 32% one-day-out) relative to an efficient benchmark from the literature and produces option prices free of the volatility smile. In addition to being an efficient and robust option modeling and valuation tool for large option books, LOR provides a simple to implement empirical benchmark for evaluating more complex risk-neutral models. Keywords: option pricing, regression, volatility smile, S&P500. © All rights reserved. Please do not quote without author permission.

∗Address: Department of Finance, DePaul University, 1 East Jackson Boulevard, Chicago, IL 60604. Email: [email protected]; Phone: (312) 362-5915; Webpage: http://mozart.depaul.edu/~gpandher

1

I. INTRODUCTION

In their seminal work on option pricing, Black and Scholes (1973) and Merton

(1973) propose the risk-neutral valuation framework for identifying the equilibrium price

of a contingent claim. This methodology is based on the insight that, if the price risk of

the option can be dynamically hedged by trading in the underlying asset, then, to rule out

arbitrage, the hedged position must earn the same return as the risk-free security. Over

the past three decades, considerable research effort has gone into extending the initial

Black-Scholes-Merton framework by relaxing certain assumptions and incorporating

additional features in the asset return process (e.g. jumps, mean-reversion, stochastic

volatility). Relatively little progress has been reported, however, in the development of

empirical approaches to modeling market option prices.

This paper presents a relatively simple, but effective, empirical method for

modeling and predicting option prices based on localized option regressions (LOR)

where market option prices are projected over localized regions of their state space. The

method does not require assumptions regarding the asset price distribution (e.g. log-

normality), volatility structure and hedging behavior of risk-neutral models and offers a

robust data-driven method for valuing option books. While the LOR methodology may

appear deceptively simple, the empirical model and our surprising findings have not been

reported in the previous literature.

The LOR approach may be motivated by observing that how well risk-neutral

theory and models are able to explain actual market option prices depends on the extent

to which the assumptions and mechanics behind the risk-neutral arbitrage arguments hold

in actual markets. These include assumptions on the dynamics and probability

distribution of the underlying asset prices and the ability to efficiently hedge the option’s

price risk. The wide choice of available models, assumptions and estimation error in key

parameters implies that the relationship between option prices from theoretical models

2

and observed market prices is not unique and is necessarily approximate. LOR attempts

to empirically capture the relationship between market option prices and state variables

found to be important in risk-neutral option pricing theory.

Innovations in the derivatives literature following Black and Scholes (1973) and

Merton (1973) include the discrete risk-neutral framework of Cox, Ross and Rubinstein

(1979), jumps in the asset return process, deterministic volatility functions, stochastic

volatility, non-parametric estimation of option models and generalization using Levy

processes. The deterministic volatility approach adjusts the asset pricing lattice to

incorporate the volatility smile effect in market options (Derman & Kani (1994),

Rubinstein (1994)) while the jump, stochastic volatility and GARCH class of option

models attempt to theoretically address Black-Scholes anomalies like the ‘volatility

smile’ (Hull & White (1987), Heston (1993), Duan (1995), Bates (1996), Ritchen &

Trevor (1999), Duffie, Pan and Singleton (2000), Heston and Nandi (2000), Lehnert

(2003) and others). Carr and Lu (2004) use time-changed Levy processes to provide a

unified treatment of option pricing models where asset returns exhibit jumps, stochastic

volatility and correlations with volatility.

To evaluate the LOR model, we compare its performance with the efficient

Black-Scholes implementation of Dumas, Fleming and Whaley (1998) based on implied

volatility regressions. The literature also refers to their method as the “Practitioner

Black-Scholes” (PBS) model and it has become a common benchmark for evaluating new

models in the options literature. Using S&P500 options, Dumas, Fleming and Whaley

(1998) show that the PBS model compares favorably with binomial option models

calibrated using deterministic volatility functions (Derman and Kani (1994), Dupire

(1994) and Rubinstein (1994)). The PBS benchmark is also used by Heston and Nandi

(2000) to evaluate the performance of a closed-form GARCH option pricing model and

Christoffersen and Jacobs (2004) find that the PBS model with aligned loss functions

3

outperforms more complex models such as the stochastic volatility model of Heston

(1993).

A semi-nonparametric approach is employed by Ait-Sahalia and Lo (1998) where

implied volatility from kernel regressions is used in the Black-Scholes-Merton model to

determine option prices. The proposed LOR approach differs from both Ait-Sahalia and

Lo (1998) and Dumas, Fleming and Whaley (1998) in that it does not use Black-Scholes

or any other risk-neutral model to generate option prices. Instead, LOR is based on direct

projections of option prices onto their localized state space.

Our empirical study using three years of daily S&P500 index options (38,487

daily prices) shows that i) the LOR model yields significantly smaller in-sample and out-

of-sample pricing errors than the PBS implementation of Dumas, Fleming and Whaley

(1998) and ii) LOR options are largely free of the volatility smile effect. For example,

the one-day-out average pricing error (prediction RMSE) for LOR is $0.53 while it is

$0.70 for the PBS benchmark. Moreover, PBS pricing errors exceed LOR by 32.4%,

13.2% and 2.8% at one-day, two-day and three-day out-of-sample horizons, respectively

(see Table IV). Comparison with other studies using the same sample, such as Bakshi,

Cao and Chen (1997), further demonstrates that LOR is very competitive with stochastic

volatility models. We also find that the volatility smile is virtually non-existent in option

prices generated by the LOR approach (see Table VII and Figure 2).

An important practical implication of these results is that a relatively

straightforward econometric approach based on locally projecting option prices onto their

state process offers the potential of serving as an efficient alternative option

modeling/valuation tool, as well as a benchmark for evaluating the performance of more

complex structural risk-neutral models. Another attractive feature of the LOR approach

is its simplicity and ease of implementation. The localized regressions are much easier to

implement and use less data than the neural-networks approach to option valuation

4

proposed in Hutchinson, Lo and Poggio (1994), where two years of simulated daily

Black-Scholes prices on S&P500 futures are used to train the neural-network algorithm.

LOR methodology requires a much shorter estimation window and our empirical analysis

of Section IV provides very good results using a moving estimation window of 50 days.

We are not able to compare the empirical performance of LOR with the neural network

approach as the study of Hutchinson, Lo and Poggio (1994) is based on simulated data

and does not use market-traded option prices.

The paper is organized as follows. Section II introduces four candidate option

regression models and describes essential aspects of the LOR methodology including

localization to the option strike-maturity space, the out-of-sample forecasting procedure

and the benchmark PBS Black-Scholes model. The S&P500 options data is described in

Section III. The in-sample and out-of-sample performance of the models is reported in

Section IV along with an analysis of the volatility smile effect in LOR option prices. The

conclusion follows in Section V.

5

II. LOCALIZED OPTION REGRESSION MODELING

We begin by describing four candidate regression specifications that will serve as

the basic workhorse models for localized option regression (LOR) modeling. These

option regressions are sequentially localized to regions of the option state space based on

their maturity and strike price. The out-of-sample implementation of the LOR is also

described as it represents a natural pricing application where model parameters estimated

from recent options are used to estimate new option values.

A. Structural & Reduced-form Option Regressions

Let V represent the value of a given market-traded option (e.g. call, put) with

underlying asset price S (e.g. index, stock, currency, bond), time of option expiration T

(in trading days), strike price K , asset return volatility σ , and the risk-free interest rate

r . Further, tT −=τ is the option’s remaining time-to-maturity, where ],0[ Tt∈ is any

time up to option expiration.

We consider two classes of localized option regressions - structural and reduced-

form models – which locally project derivative prices on their state process based on the

underlying asset price, strike price, time-to-maturity, implied volatility and the risk-free

rate. The state space includes linear, quadratic and interaction terms arising among the

state variables. The structural specification models the option’s non-linear behavior

around the strike price through the “moneyness” variable KSm /= (ratio of the current

asset value to strike price), while the reduced-form specification models this directly in

terms of the strike price.

A number of specifications and functional forms were initially considered. To

make the presentation manageable, we narrow consideration to the four best performing

classes of reduced-form and structural option regression models. The first two models

project market options onto a linear and quadratic state-space of the state variables

6

( S ,τ , K , r ) using the reduced-form and structural specifications. The other two models

additionally include implied volatility IVσ as an additional predictor. For the purpose of

concreteness, an explicit representation of the four models is given below.

Reduced-form Model (RLOR):

ετααταατααατααααταααα

+++++++++++++++=

rKrKSrSSKrKSrKSV

14131211109

28

27

26

2543210 (1)

Structural Model (SLOR):

ετααταατααατααααταααα

+++++++++++++++=

rKrKmrmmKrKmrKmV

14131211109

28

27

26

2543210 (2)

Reduced-form Volatility Model (RLOR-V):

εσατσασασασασα

τααταατααατααααταααα

+++++++

++++++++++++++=

rKS

rKrKSrSSKrKSrKSV

IVIVIVIVIVIV 201918172

1615

14131211109

28

27

26

2543210

(3)

Structural Volatility Model (SLOR-V):

εσατσασασασασα

τααταατααατααααταααα

+++++++

++++++++++++++=

rKm

rKrKmrmmKrKmrKmV

IVIVIVIVIVIV 201918172

1615

14131211109

28

27

26

2543210

(4)

Initially, the complete models above with all quadratic and interaction terms were

considered1. In the empirical implementation, however, the statistical insignificance of

some coefficients leads to a reduction in model size (see Table III for estimates).

2Other functional forms were also investigated, including variants of (1)-(4) with logs of i) the dependent variable, ii) the independent variables and iii) both. The second set of models frequently lead to multicollinearty problems as the logged quadratic and interaction terms become linearly related to the main effects. Further, models where the dependent variable is the log-option price were found to be inferior to the simpler models with the option price as the dependent variable. For example, our analysis in Section IV.C identifies model (3) as the best specification. Its log-likelihood and deviance in the complete sample is -38,066 and 24,080, respectively, while the same for the log-option price is -51,817 and 63,136. While R-squares are not directly comparable, the adjusted R-square for model (3) is 0.9962 (see Table III) while the same for its log-price variant is 0.9532. Examination of alternative forms leads to the conclusion that the simple linear specification works best in terms of efficiency and pricing errors (these results are not reported in the interest of brevity).

7

For a more compact representation, if Z represents the (row) vector of

explanatory variables in (1)-(4), then the above option regressions may be generically

expressed as

εα += ZV (5)

where α is the parameter vector.

The option regressions with implied volatility (SLOR-V and RLOR-V) are

motivated by the importance of volatility in the Black-Scholes model. Implied volatility

is estimated numerically by inverting (6) on the market option price:

),,,,(1 στσ rKSBSIV−=

where )()(),,,,( στστ τ −−= − dNedSNrKSBS r (6)

is the Black-Scholes call option formula with ( ) σττσ )2/()/ln( 2++= rKSd .

For the volatility LOR models (3)-(4), a mechanism for estimating implied

volatility over the strike-maturity space is also required. This is done by adopting the

implied volatility estimation of Dumas, Fleming and Whaley (1998) and is further

discussed in Section B below.

B. Localized Option Regression (LOR) Modeling

In localized option regression modeling, models (1)-(4) are sequentially estimated

by maturity-moneyness clusters over a rolling estimation window (indexed by q ). This

estimation scheme reflects a natural valuation application of LOR where model

parameters estimated from recent market option data are used to price new options as

predicted values.

8

B.1 Localization

The localization of the option regressions (1)-(4) to sequential maturity-

moneyness clusters is represented generically by

εα += ),(),( cqZcqV (7)

where ),( cqV is the market price of an option with state variables Z in estimation period

q and maturity-moneyness cluster c and ),( cqα is the parameter vector.

Localization is defined by clusters formed from maturity-moneyness groups. Let

{ }piii ,...,1],,( 1 ==Τ +ττ represent option maturity groups with delineation points

pτττ <<< ...21 . Similarly, { }ljmm jj ,...,1],,( 1 ==Μ + represent option moneyness

groups with lmmm <<< ...21 . Thus, there are pl localized clusters formed by the

combinations Μ×Τ∈c .

There is, of course, some flexibility in the determination of the length of the

estimation window and localization clusters and some analysis is required to identify an

efficient delineation by balancing the trade-off between model fit and sample size

(however, this needs to be performed only once for a particular type of option and need

not be repeated in subsequent LOR applications). While increased localization may

improve the fit of the option regression in individual clusters, it also reduces the sample

size for estimating model parameters in each cluster. Since we are primarily interested in

the potential of LOR as a modeling and valuation tool, we focus on its out-of-sample

performance in determining the length of the estimation cycle and the localization

clusters. Here, over-fitting due to increased localization can lead to poor out-of-sample

performance.

Similar issues arise in the nonparametric option-pricing approach of Ait-Sahalia

and Lo (1998) where implied volatility is modeled using kernel regressions. Kernel

regressions provide a weighted average of observed values around a characteristic point

9

(e.g. ( S ,τ , K , r )), where more weight is given to points closer to the characteristic point.

The performance of the estimator is highly sensitive to the bandwidth of the kernel

density function. When the bandwidth is too small (relative to optimal), the estimator is

under-smoothed and exhibits high variance but low bias. Meanwhile, the estimator

becomes over-smoothed if the bandwidth is too high. In this case, its variance is low but

its bias increases.

Applying the LOR methodology of this Section (II) to the S&P500 options used

in our empirical study, we quickly identified an efficient localization based on two broad

moneyness groups ( { }]1.1,1(],1,9[.=Μ ), three maturity groups

( { }]367,100(],100,50(],50,7[T = ) and a moving estimation window of q =50 days (the

maximum maturity of options in our sample is 367 days). The efficient localization is

determined by evaluating the out-of-sample errors of LOR option prices under various

formulations of the moneyness-maturity groups (Μ and Τ ). Call options prices at

maturity exhibit a “hockey-stick” payoff centered at the moneyness value 1/ == KSm .

Our construction of the moneyess groups avoids the non-linearity resulting from this kink

by considering the following two candidate groupings: { }]1.1,1(],1,9[.=Μ and

{ }]1.1,05.1(],05.1,1(],1,95(.],95,.9[.=Μ . The second set leads to more refined localized

clusters for LOR modeling. Similarly, we considered coarser and more refined sets for

the maturity groups given by the following three sets: { }]367,100(],100,7[=Τ ,

{ }]367,100(],100,50(],50,7[=Τ and { }]367,200(],200,150(],150,100(],100,50(],50,7[=Τ .

[Table I about here]

One-day out-of-sample pricing errors generated by the six different combinations

of Μ and Τ are reported in Table 1 (generation of out-of-sample LOR option prices is

10

described in IV.E). Interestingly, greater localization does not necessarily decrease the

over-all pricing error due to over-fitting. For example, we find that the two moneyness

grouping based on 10% intervals for the moneyness variable KSm /= yield better

performance (1, 2 and 3 days out) than refinement to four groupings separated by

intervals of 5%. The lowest pricing errors occur for the localization based on

{ }]1.1,1(],1,9[.=Μ and { }]367,100(],100,50(],50,7[=Τ . We select these moneyess-

maturity sets to define our efficient localization for the empirical study of Section IV.

B.2 Implied Volatility Forecasting

The first step in determining new option prices using the LOR method involves

estimating and identifying the best LOR model from the candidates (1)-(4). If either of

the volatility models RLOR-V (3) or SLOR-V (4) are selected, then the regressor state

space also includes implied volatility IVσ in addition to ),,,( rKS τ . In this case, an

estimate of volatility in the out-of-sample period is required. A method for doing this is

described next before considering the estimation of out-of-sample LOR option prices.

Volatility estimates are obtained from the implied volatility regressions proposed

by Dumas, Fleming and Whaley (1998, DFW). They model the relationship between

implied volatility and the option’s strike and maturity over recent market prices and use

the estimated volatilities to obtain Black-Scholes option values in the out-of-sample

period. Among the various volatility regressions analyzed by DFW, the best candidate is

identified as

ετβτβτββββσ ++++++= KKKIV 52

432

210 . (8)

Let d represent the sample period (e.g. day, week) over which the implied

volatility regression will be estimated (note that d is much smaller than the rolling

11

estimation window q used in LOR modeling). Therefore, the parameters of the implied

volatility regressions may be represented as

ετβτβ

τββββσ

+++

+++=

Kdd

dKdKdddIV

)()(

)()()()()(

52

4

32

210 , qd ∈ , (9)

for each period d . As done in a number of other studies (e.g. Dumas, Fleming and

Whaley (1998), Christoffersen and Jacobs (2004)), we select the estimation period d for

the volatility regression (9) to be one trading day. This means that in determining the n -

day out-of-sample option prices, volatility parameter estimates from day nd − are used

to predict implied volatilities in day d (we used “day” for “period” without any loss of

generality).

B.3 Out-of-Sample LOR Option Pricing

We now consider the use of LOR as an option valuation tool where parameters

estimated over a moving estimation window q are used to generate new option prices as

out-of-sample predictions. LOR option values in the subsequent period 1+q with state

variables ),,,,( IVrKS στ are generated as follows2:

1) No Volatility Case: If the LOR is model (1) or (2)

In this case, an estimate of volatility is not required. LOR option values in period

1+q and maturity-moneyness cluster c are then calculated as

),(),,,(),1;,,,( cqrKSZcqrKSV αττ =+ . (10)

where ),,,( rKSZ τ is the vector of corresponding LOR regressor variables and ),( cqα is

the corresponding parameter vector estimated from market options in the previous period

2Note that the state variables ),,,( rKS τ in the out-of-sample period (e.g. next trading day) are not forecasted and would be known for the option being priced at that moment. For example, at the beginning

12

q and maturity-moneyness cluster c . Similarly, if the LOR model is (2), the LOR

option value is calculated as

),(),,,(),1;,,,( cqrKmZcqrKmV αττ =+ . (11)

2) Volatility Case: If LOR is model (3) or (4)

i) Estimate the next day out-of-sample implied volatility for day nd + as

τβτβτβ

βββτσ

Kddd

KdKddndKIV

)()()(

)()()();,(

52

43

2210

+++

++=+ (12)

where the parameters ))(),(,)(),(),(),(()( 543210 ddddddd ββτβββββ = are estimated

by fitting the volatility regression (9) to implied option volatilities from day d .

ii) LOR out-of-sample option values in period 1+q and maturity-moneyness cluster c

are then obtained as follows:

)1;,( += dKIV τσσ , 11 +∈+ qd ,

RLOR-V model (3): ),(),,,,(),1;,,,,( cqrKSZcqrKSV αστστ =+ (13)

SLOR-V model (4): ),(),,,,(),1;,,,,( cqrKmZcqrKmV αστστ =+ (14)

where ),,,,( στ rKSZ and ),,,,( στ rKmZ are the vectors of LOR regressors according

to (3) and (4), respectively. The corresponding parameters ),( cqα are estimated from

options trading in period q and maturity-moneyness cluster c .

C. The PBS Black-Scholes Benchmark

As discussed earlier, to evaluate the pricing performance of LOR, we use the

efficient Black-Scholes implementation of DFW as a benchmark model. The literature

also refers to their methodology as the “Practitioner Black-Scholes” (PBS) model.

of the trading day in 1+q , the option strike price, stock price and remaining time-to-maturity are all know while the LOR parameter ),( cqα is estimated from the option data in the previous period q .

13

The critical issue for obtaining Black-Scholes option prices is how to infer

volatility across the spectrum of exercise prices and maturities. For this purpose, DFW

identify the best implied volatility regression as

ετβτβτββββσ ++++++= KKKIV 52

432

210 .

Volatility regression parameters estimated from recently observed market option prices

are then used to construct volatility estimates for out-of-sample Black-Scholes option

prices.

As in the case of LOR volatility estimation (9), we apply the DWF volatility

modeling to daily options and use the estimated parameters to predict next-day implied

volatility by strike price and maturity. For any give day d in the sample, the volatility

parameters are estimated from the regression (9).

With q representing the current LOR estimation period, the corresponding PBS

option value with state variables ),,,( rKS τ in the subsequent period 1+q is obtained as

follows:

i) Estimate the out-of-sample implied volatility for day nd + as

τβτβτβ

βββτσ

Kddd

KdKddndKIV

)()()(

)()()();,(

52

43

2210

+++

++=+ (15)

where the parameters ))(),(,)(),(),(),(()( 543210 ddddddd ββτβββββ = are estimated

by fitting the volatility regression (9) to implied option volatilities from day d .

ii) Calculate the Black-Scholes option values for day nd + as

);,( ndKIV += τσσ

)()()();,,,,( 1 στστ τ −−−=+ − dNedNPVDSndrKSPBS r (16)

where ( ) σττσ )2/()/ln( 21 ++= rKSd and PVDS − is the S&P500 index net of the

present value of dividends.

14

III. DATA

Our empirical analysis uses option prices on the S&P500 index options traded on

the Chicago Board of Options Exchange (CBOE). Options written on the S&P500 index

are the most actively traded European-style contracts. This data was selected due to the

high market liquidity of these options and their frequent use in earlier empirical studies.

S&P500 index options have been the focus of many investigations related to the

estimation and performance of option pricing models, risk-neutral densities and implied

volatility analysis (Lamoureux, Christopher and Lastrapes (1993), Heynen (1993), Bates

(1996), Bakshi, Cao and Chen (1997), Aït-Sahalia & Lo (1998), Dumas, Fleming and

Whaley (1998), Das and Sundaram (1999), Jackwerth (2000), Jacquier and Jarrow (2000)

and others).

In particular, Bakshi, Cao and Chen (1997) use a three year sample of daily call

option prices on the S&P500 index from June 1, 1988 to May 31, 1991 to evaluate the

performance of alternative option pricing models including Black-Scholes and extensions

with stochastic volatility, jumps and stochastic interest rates. We are thankful to Gurdip

Bakshi for graciously providing us with this data and its use in evaluating the LOR

methodology facilitates comparison of pricing errors and the volatility smile across

studies. Christoffersen and Jacobs (2004) also use the same sample of S&P500 options

to show that consistency in the choice of loss functions for estimation and evaluation

significantly improves the performance of option models. Their study excludes option

maturities that are less than 60 days and restricts options to be within 5% of being at-the-

money (total of 14,648 options). Our analysis uses a broader range of options with

maturities exceeding 7 days that are within 10% of being at-the-money (28,417 options).

A brief description of the option data is provided below and we refer to Bakshi,

Cao and Chen (1997) for complete details. Panel A of Table II reports the summary

statistics for variables related to daily closing S&P500 call options over the three year

15

sample and Panels B and C give option means and the number of options by moneyness-

maturity combinations.

[Insert Table II about here]

The intra-day bid-ask quotes for S&P500 call options are obtained from the

Berkeley Options Database. For the analysis, option prices are formed by taking the

average of the last reported bid-ask prices (prior to 3:00 pm, Central Standard Time) for

each day in the sample. The corresponding S&P500 index values are synchronous to the

closing option prices and the index series was carefully adjusted for dividend payments.

For the risk-free return, data on daily Treasury-bill bid and ask discounts is used with

maturities up to one year, as reported in the Wall Street Journal. Following convention,

an annualized interest rate was constructed by forming an average of bid-ask Treasury

Bill discounts.

16

IV. EMPIRICAL RESULTS

We now report the results from applying the localized option regression (LOR)

methodology and the efficient PBS Black-Scholes implementation described in Section II

to 28,417 daily market prices on S&P500 call options in the three year sample. Pricing

errors from LOR are also compared with the performance of the stochastic volatility and

jump models studied by Bakshi, Cao and Chen (1997) who use the same data3.

We proceed as follows in the empirical study. First, the best LOR model is

identified from the four structural and reduced-form specifications (1)-(4) described in

Section II. The gains from localization and an in-depth analysis of in-sample and out-of-

sample pricing performance for the selected LOR model are then presented in relation to

the PBS benchmark. Finally, we analyze the presence of the volatility smile effect in

option prices generated by the LOR and PBS models.

A. Main Findings

Before presenting the detailed analysis, the main findings are first summarized

below. Out of the four candidate option regressions, the reduced-form LOR specification

with implied volatility (RLOR-V) is selected as the best LOR model upon localization to

maturity-moneyness clusters. The corresponding LOR models without implied volatility

(RLOR and SLOR) also do relatively well and outperform the PBS benchmark.

RLOR-V yields smaller average pricing errors (RMSEs) both in-sample and out-

of-sample in comparison to the PBS benchmark. One-day out-of-sample pricing errors

are $0.53 and $0.69 for LOR and PBS, respectively (in-sample pricing errors are $0.25

and $0.48, respectively). Further, LOR pricing is more consistent across the whole

spectrum of moneyness and maturity groupings. Over one-day, two-day and three-day

3Christoffersen and Jacobs (2004) also use the same sample to show that the PBS Black-Scholes implementation of Dumas, Fleming and Whaley (1998) outperforms stochastic volatility models.

17

out-of-sample horizons, pricing errors of the PBS benchmark are higher than LOR by

32.4%, 13.2% and 2.8%, respectively. LOR modeling without implied volatility retains

much of its efficiency, providing an error reduction of 19%. Overall pricing errors from

LOR models without implied volatility, namely RLOR and SLOR, are $0.59 and $0.59,

respectively, while the same for the PBS is $0.70.

LOR also compares favorably with more sophisticated models with stochastic

volatility. Its overall pricing error ($0.53) is in the mid-point of error ranges for the

stochastic volatility model ($0.41-0.65) reported by Bakshi, Cao and Chen (1997) using

the same three year sample of S&P500 options. Lastly, we find that out-of-sample option

prices generated by the LOR model are free of the volatility smile/sneer effect while this

effect is present in PBS option prices.

B. Option Regressions without Localization

To study the incremental improvement from localization, we begin by reporting

the performance of the reduced-form and structural option regression models (1)-(4) on

the complete sample without localization to the six moneyness-maturity clusters of Table

I.

[Table III about here]

Panels A and B show that the fit of the four models, as implied by their R-squares,

is extremely high falling in the range 0.9873-0.9962. The average pricing errors of the

reduced-form models with respect to CBOE market prices (as measured by RMSE) are

uniformly lower than their structural counterparts. Pricing errors for the volatility models

are $1.30 and $1.63 for RLOR-V and SLOR-V, respectively. The same for the no-

volatility models rise to $1.91 and $2.20 for RSLOR and SLOR, respectively.

18

It appears from the global fit that option regressions with implied volatility as a

predictor have a distinct advantage. We see below that this continues to hold when

estimation is sequentially localized to maturity-moneyness clusters although the

difference narrows. Lastly, all parameter estimates reported in Tables III are highly

significant (with most significance probability or “p-values” less than .0001)4.

C. The Best LOR Model

We first identify the best LOR model among the volatility and no-volatility

reduced-form and structural candidates: RLOR (1), SLOR (2), RLOR-V (3), SLOR-V

(4). The in-sample and out-of-sample performance of these models is considered over a

moving non-overlapping 50-day estimation window ( q ) of 22 periods spanning June 1,

1988 to May 31, 1991. This leads to a total of 28,417 options for analyzing in-sample

performance in the -10% to +10% moneyness range ( ∈= KSm / [0.9,1.1]). This same

range of options is typically considered in other empirical studies including Dumas,

Fleming and Whaley (1998) and Christoffersen and Jacobs (2004). Daily volatilities in

the in-sample analysis are estimated from the DFW volatility regression (9) where

implied volatility is regressed on linear and quadratic terms of option maturity and

exercise price.

The out-of-sample horizon is taken to be one, two and three days from the end of

each rolling estimation period q . In the out-of-sample analysis, next-day predicted

volatilities from the DFW volatility regression are used in LOR models with implied

volatilities (SLOR-V and RLOR-V). Out-of-sample LOR and PBS option values are

generated with (10)-(14) and (15-(16), respectively, using DWF volatility parameters

4It should be kept in mind that the data used to estimate LOR model parameters has a minimum option maturity of 7 days. Therefore, option prices from LOR models are valid only for maturities beyond 7 days.

19

from the previous n-th day (n=1, 2, 3). Out-of-sample pricing errors from both models

are plotted by moneyness in Figure 1.

[Figure 1 about here]

[Insert Table IV about here]

From the results reported in Table IV (Panel A and B), we identify the reduced-

form volatility model (RLOR-V) as the best LOR candidate, with its structural

counterpart SLOR-V as a close tie. RLOR-V yields in-sample and out-of-sample (1-day

out) root mean square errors (RMSEs) of $0.25 and $0.53, respectively, while the same

for the PBS model are $0.48 and $0.70, respectively. This amounts to a 48%

deterioration in in-sample pricing efficiency for PBS over RLOR-V and a 32% fall in

out-of-sample efficiency. Further, the higher pricing efficiency of RLOR-V continues to

hold at the 2-day and 3-day out horizons (Panels C and D).

Based on the comparative analysis of the four LOR specifications, we select

RLOR-V as the best localized option regression model for the remaining analysis and

will refer to it simply as “LOR”. LOR modeling without implied volatility, however,

retains much of its efficiency, yielding an error reduction of 19% relative to PBS (one-

day out-of-sample). Overall pricing errors from the LOR models without implied

volatility, namely RLOR and SLOR, are both $0.59 while the same for PBS is $0.70.

This specification has the further advantage of reducing modeling complexity as implied

volatilities do not need to be estimated and forecasted (see Section II.B.2).

20

D. Gains from Localization & In-sample Performance

We now consider the gains from localization and the in-sample performance of

LOR and PBS in greater detail. First, we note a dramatic increase in performance over

the global fit of Section A: the overall pricing error shrinks to $0.25 (RLOR-V) from

$1.30 (RLOR-V, Table II). Second, the overall reduction in pricing error (efficiency

gain) of LOR over PBS is 48.4% (EFF). Further, LOR pricing errors disaggregated by

year and quarter fall in the range $0.15-$0.33, representing gains in pricing efficiency of

34.9%-63.2% over PBS. Third, the coefficient of variation (CV) gives the pricing error

as a percentage of mean call price. These are relatively small, falling in the range 1.91%-

3.19%.

Table V gives a tabulation of pricing errors and efficiency gain by maturity-

moneyness categories. The pricing errors for LOR over the 12 categories fall in the range

of $0.16-$0.33 and correspond to efficiency gains of 14.4%-62.5% over the Black-

Scholes benchmark. We also note that the performance of LOR over option moneyness

is more consistent and stable, as pricing errors are similar in magnitude over the four

moneyness (S/X) ranges from 0.9 to 1.1. For example, among the shortest maturity calls

(less than 50 days), the pricing errors are $0.17, $0.22, $0.20 and $0.18, respectively,

over the moneyness categories [.9,.95], (.95,1.0], (1,1.05], (1.05,1.1], while PBS errors

over the same categories are $0.20, $0.30, $0.32 and $0.37.

[Insert Table V about here]

The in-sample empirical results demonstrate the superior performance of

localized option regression modeling over the PBS Black-Scholes benchmark in terms of

pricing precision and stability of estimates. We now examine its out-of-sample

performance in greater detail.

21

E. Out-of-Sample Performance

The out-of-sample performance of LOR is an indication of its usefulness and

quality as a pricing tool. We now focus attention on this feature of LOR in comparison to

the PBS benchmark.

Estimation of out-of-sample LOR regression parameters in (10)-(14) is performed

over a moving 50-day non-overlapping window and uses one-day predicted volatilities

from the DFW volatility regression (9). This generates 22 sequential estimation cycles in

our three-year sample and estimation is localized within each cycle to the maturity-

moneyness clusters defined in Table I. The procedure leads to 763, 776 and 776

observations at the one-day, two-day and three-day out-of-sample horizons and the

results are similar regardless of the starting point (various other starting dates in June

1988, aside from June 1, were tried and yielded similar results). Out-of-sample LOR and

PBS option values are generated using DWF volatility parameters from the previous n-th

day (n=1, 2, 3) in (12)-(14) and (15)-(16), respectively. Daily PBS out-of-sample option

prices are constructed from (15)-(16).

We find that LOR again outperforms the PBS benchmark. Tables IV and VI

show tabulations of pricing errors by year and maturity-moneyness groupings. As

reported earlier, the overall average pricing errors for LOR and PBS (one-day out) are

$0.53 and $0.70, respectively, with an efficiency loss of 32.4% for the benchmark model

(Panel B, Table IV). The LOR pricing error as a percentage of the mean option price

(CV) is 3.99%. With respect to tabulation across maturity-moneyness categories, LOR

dominates PBS in 11 of the 12 groups (Panel A, Table VI). LOR pricing errors fall in the

range $0.33-$0.73 and the same for PBS is $0.34-$0.96. At the longer three-day out

horizon (Panels C, Table VI), LOR pricing remains more efficient at longer option

maturities (>100 days) but underperforms PBS at the mid-maturity (50-100 days).

22

[Insert Table VI about here]

Overall, LOR continues to dominate at the longer two-day and three-day out

horizons, although the difference in pricing errors diminishes (Panels B and C, Table IV).

At the one-day out horizon, overall PBS pricing errors are 32.4% higher than LOR ($0.70

vs. $0.53) while, at the two-day horizon, the overall efficiency loss of PBS relative to

LOR is 13.2% and diminishes to 2.8% at three days out.

The in-sample and out-of-sample results provide confidence in the use of LOR as

an efficient and reliable option valuation tool, and as a robust benchmark for evaluating

the contribution of more complex structural option pricing models. We next examine the

volatility smile characteristics of LOR-based option pricing, after comparing its

performance to stochastic volatility models in the literature.

F. Comparison to Stochastic Volatility Models

As our analysis uses the same sample of S&P500 call options, covering the June

1988 to May 1991 period, as Bakshi, Cao and Chen (1997, BCC), comparison of out-of-

sample pricing errors provides some useful insights into the capability of LOR with

respect to more sophisticated risk-neutral models.

BCC evaluate the performance of alternative option pricing models incorporating

stochastic volatility (SV), stochastic volatility & stochastic interest rates (SVSI) and

stochastic volatility with jumps (SVJ) and compare their performance with Black-

Scholes-Merton (BSM). In their analysis, model parameters and implied volatility are

estimated from previous-day option prices and used to generate next-day prices.

While our analysis employs the DFW method for estimating volatility,

comparison with BCC is useful and informative because it uses the same option data.

BCC do not report overall pricing errors, but tabulate pricing errors by combinations of

23

18 maturity-moneyness categories. Ranges of (one-day out) pricing errors over these

combinations are $0.52-1.89 for BSM and $0.41-0.65 for SV. Their results show that the

BCC implementation of Black-Scholes is dominated by models with stochastic volatility.

The results noted in Section B above (Table IV) show that the overall out-of-

sample pricing error of the selected LOR model ($0.53) is in the mid-point of the range

for the SV model analyzed by BCC using the same sample. This comparison provides

further evidence that LOR modeling is highly competitive with more sophisticated risk-

neutral option models with stochastic volatility in the return process. Moreover, we show

below that the volatility smile problem is largely absent in LOR option prices.

G. The Volatility Smile & Pricing Error Regressions

An important empirical deficiency of the Black-Scholes model is the occurrence

of the volatility smile (or smirk), where the option’s implied volatility depends on the

value of the strike price, usually in a “smile” or “sneer” pattern. To what extent does this

problem occur in LOR option prices?

One way to examine the volatility smile issue is to compute the implied volatility

of option prices across strikes by inverting the Black-Scholes formula. Given the positive

monotonic relationship between volatility and option value, the smile effect in LOR and

PBS option prices may also be alternatively, and directly, analyzed from pricing errors in

price-strike space. We perform this analysis by testing for the following functional

relationship between pricing errors and the option’s moneyness ( SK / ) in the regression

iModel

ii SKSKVV εβββ +++=− 2210 )()( (17)

where Modelii VV − is the pricing error under the respective model (LOR or PBS). The

pricing error regression (17) is analogous to a paired t-test as the price difference

Modelii VV − cancels all common factors effecting option valuation (strike, maturity, index

24

value, interest rate) with the exception of volatility and the regression tests for the

residual’s dependence on the strike price. Further, the monotonic relationship between

option price and volatility ensures that the smile effect is uniquely captured by the error

regression (17).

Results from the LOR and BS models are reported in Table VII and the fitted

regression values are as follows:

PBS: =− Modelii VV -55.81 + 110.31 )( SK - 54.35 2)( SK (18)

LOR: =− Modelii VV -9.02 + 17.85 )( SK - 8.78 2)( SK (19)

The estimates reveal a very strong smile/sneer effect in PBS option prices while this

effect is virtually non-existent in LOR prices. The linear and quadratic smile parameters

( 1β and 2β , respectively) in the PBS regression are large and strongly significant with

significance probabilities less than .0001. For LOR, the same parameters are much

smaller in magnitude and are not statistically significant from zero.

Predicted values based on the LOR and PBS error regressions (18)-(19) are

plotted in Figure 2. The relatively flat error curve for LOR again points to the negligible

volatility smile effect in LOR option prices in contrast to PBS prices. If model prices

track market prices over moneyness (i.e. capture the volatility smile), then the pricing

error Modelii VV − should be flat with respect to moneyness. In Figure 2, the pricing error

Modelii VV − becomes inverted because PBS model prices PBS

iV over-shoot market prices

iV for away from the money options.

[Figure 2 about here]

Finally, results from our empirical analysis show that not only do LOR option

prices provide smaller pricing errors than the ‘Practitioner Black-Scholes’ (both in-

sample and out-of-sample), but LOR option prices are largely free of the volatility smile

effect.

25

V. CONCLUSION

This paper considers a simple empirical approach to modeling and predicting

market option prices using localized option regressions (LOR). LOR projects market

option prices over localized regions of their state space without requiring assumptions

regarding the asset price distribution (e.g. log-normality), volatility structure and hedging

behavior found in risk-neutral models. The model offers an efficient and robust data-

driven approach to valuing option books.

While the LOR model is deceptively simple, the method and our surprising

findings have not been reported in the previous literature. Our empirical study using

three years of daily S&P500 options (28,417 option prices) finds that LOR yields smaller

out-of-sample pricing errors (e.g. 32% one-day-out) than the efficient Black-Scholes

(PBS) implementation of Dumas, Fleming & Whaley (1998) that has become a common

benchmark for evaluating new option models in the literature (for example,

Christoffersen and Jacobs (2004) show that it empirically outperforms stochastic

volatility models). LOR out-of-sample pricing errors are significantly lower than PBS

(e.g. $0.53 vs. $0.70 one-day-out) and exceed LOR by 32.4%, 13.2% and 2.8% over one-

day, two-day and three-day out-of-sample horizons, respectively. Furthermore, we find

that the volatility smile is virtually non-existent in option prices generated by the LOR

model.

We also find that the corresponding LOR model without implied volatility

outperforms the PBS benchmark. Its overall pricing error one-day out is $0.59 while the

same for PBS is $0.70. This shows that LOR modeling without implied volatility retains

much of its efficiency with an error reduction of 19%. This specification has the

advantage of reducing modeling complexity as implied volatilities do not need to be

estimated and forecasted. LOR also compares favorably with stochastic volatility models

26

analyzed by Bakshi, Cao and Chen (1997) using the same S&P500 option data as in this

paper.

In addition to being an efficient and robust option modeling and valuation tool for

large option books, LOR provides a simple to implement empirical benchmark for

evaluating more complex risk-neutral models. Here, it follows that more complex option

pricing models derived from advanced theoretical arguments should be capable of, at the

very least, outperforming a relatively “agnostic” and simple empirical model in

predicting market option prices.

27

REFERENCES

Aït-Sahalia Y. and Lo, A. (1998). Non-parametric Estimation of State-Price Densities

Implicit in Financial Asset Prices. Journal of Finance, 53, 499-547.

Bakshi, G., Cao, C., and Chen, Z. (1997). Empirical Performance of Alternative Option

Pricing Models. Journal of Finance, 52, 2003-2049.

Bates, D. (1996). Jumps and Stochastic Volatility: Exchange Rate Processes Implicit in

Deusche Mark Options. Review of Financial Studies, 9, 68-107.

Black, F. and Scholes, B. (1973). The pricing of Options and Corporate Liabilities.

Journal of Political Economy, 81, 637-659.

Canina, L. and Figlewski, S. (1996). The Informational Content of Implied Volatilities.

Review of Financial Studies, 6, 659-681.

Carr, P. and Wu, L. (2004). Time-Changed Levy Processes and Option Pricing.

Journal of Financial Economics, 71, 113-141.

Christoffersen, P. and Jacobs, K. (2004). The Importance of the Loss Function in Option

Valuation. Journal of Financial Economics,72, 291-318.

Cox, J., S. Ross and Rubenstein, M. (1979). Option Pricing: A Simplified Approach.

Journal of Financial Economics, 7, 229-64.

Derman, E. and Kani, I. (1994). The Volatility Smile and its Implied Volatility Tree.

Quantitative Strategies Research Notes (Goldman Sachs, New York).

Duan, J. (1995). The GARCH Option Pricing Model. Mathematical Finance, 5, 13-32.

Duffie, D., Pan, J. and Singleton, K. (2000). Transform Analysis and Asset Pricing for

Affine Jump Diffusions. Econometrica, 68, 1343-1376.

Dupire, B. (1994). Pricing with a Smile. Risk, 7, 32-39.

Dumas, B. Fleming, J. and Whaley, R. (1998). Implied Volatility Functions: Empirical

Tests. Journal of Finance, 53, 2059-2106.

Heston, S. L. (1993). A Closed Form Solution for Options with Stochastic Volatility

with Applications to Bonds and Currency Options. Review of Financial Studies,

6, 327-43.

Heston, S. and Nandi, S. (2000). A Closed-form GARCH Option Pricing Model. Review

of Financial Studies, 13, 585-626.

Heynen, R. (1993). An Empirical Analysis of Observed Smile Patterns. Review of

28

Futures Markets, 317-353.

Hull, J. and White, A. (1987). The Pricing of Options on Assets with Stochastic

Volatilities. Journal of Finance, 42, 281-300.

Hutchinson, J. Lo, A. and Poggio, T. (1994). A Nonparametric Approach to Pricing and

Hedging Derivative Securities Via Learning Networks. Journal of Finance, 49,

851-889.

Jacquier, E. and Jarrow, R. (2000). Bayesian Analysis of Contingent Claim Model Error.

Journal of Econometrics, 94, 145-180.

Jackwerth, J. (2000). Recovering Risk Aversion from Option Prices and Realized

Returns. Review of Financial Studies 13 (2), 433-451.

Lamoureux, C. and Lastrapes, W. (1993). The Informational Content of Implied

Volatilities. Review of Financial Studies 6, 659-681.

Lehnert, T. (2003). Explaining Smiles: GARCH Option Prices with Conditional

Leptokurtosis and Skewness. Journal of Derivatives 10, 27-39.

Merton, R.C. (1973). Rational Theory of Option Pricing. Bell Journal of Economics and

Management Science, 4, 141-183.

Merton, R.C. (1976). Option Pricing When Underlying Stock Returns are Discontinuous.

Journal of Financial Economics 3, 125-44.

Ritchen, P. and R. Trevor (1999). Pricing Options Under Generalized GARCH and

Stochastic Volatility Processes. Journal of Finance, 54, 1, 377-402.

Rubenstein, M. (1994). Implied Binomial Trees. Journal of Finance, 49, 771-818.

29

Table I Localization to Moneyness-Maturity Groups

One-day out-of-sample pricing errors for S&P500 option prices from the LOR model are reported over different moneyness and maturity groupings. Moneyness is defined as the S&P500 index value divided by the option’s strike price ( KSm /= ) and maturity represents remaining days to option expiration.

Option Maturity Group

Moneyness Group Average Pricing

Error ($) { }]367,100(],100,7[=Τ { }]1.1,1(],1,9[.=Μ 0.57

{ }]1.1,05.1(],05.1,1(],1,95(.],95,.9[.=Μ 0.62 { }]367,100(],100,50(],50,7[=Τ { }]1.1,1(],1,9[.=Μ 0.53

{ }]1.1,05.1(],05.1,1(],1,95(.],95,.9[.=Μ 0.78 { }]367,200(],200,150(],150,100(],100,50(],50,7[=Τ { }]1.1,1(],1,9[.=Μ 0.92

{ }]1.1,05.1(],05.1,1(],1,95(.],95,.9[.=Μ 1.20

Table II Descriptive Statistics

Summary statistics are reported for closing daily S&P500 call options from June 2, 1988 to May 31, 1991. This sample consists of a total of 28,417 option prices and the same data is also used in the studies of Bakshi, Cao and Chen (1997) and Christoffersen and Jacobs (2004). Panel B reports the statistics by groups based on the option’s remaining time-to-maturity (in days) and its moneyness (S&P500 index value divided by option strike price). Panel C gives the corresponding number of options in each combination.

Panel A: Descriptive statistics Variable Units Mean Std Min Median Max Kurtosis Skew

Call Option Price (V ) $ 24.48 21.02 0.66 18.75 100.00 1.033 1.205 SP500 Index ( X ) 323.22 32.34 248.71 327.83 389.59 -0.799 -0.319 Exercise Price ( K ) 316.82 40.02 175.00 320.00 425.00 -0.540 -0.074 Risk-free Rate ( r ) 0.0772 0.0100 0.0252 0.0795 0.1009 0.550 -0.731 Time-to-maturity (τ ) Days 115 86 7 95 367 -0.338 0.759 Volatility (σ ) 0.2073 0.0684 0.0819 0.1917 0.8914 14.091 2.834 Panel B: Average call prices by Moneyness-Maturity categories Days-to-Maturity Moneyness ( KSm /= ) ]50,7[ ]100,50( ]150,100( ]200,150( 200>

]95.0,9.0( $1.52 3.12 6.79 9.77 15.40 ]0.1,95.0( $3.83 8.15 13.99 17.87 24.03 ]05.1,0.1( $12.75 18.19 24.40 28.35 34.45 ]1.1,05.1( $25.29 30.39 35.34 38.68 44.87

Panel C: Number of options ]95.0,9.0( 747 1,272 1,221 1,126 1,147 ]0.1,95.0( 3,376 1,962 1,380 1,231 1,198 ]05.1,0.1( 3,275 1,650 1,146 1,045 909 ]1.1,05.1( 2,426 1,173 910 662 677

30

Table III Estimation of Option Regressions

Estimation of the four reduced-form and structural option regression models (1)-(4) is reported over the complete option sample (28,417 options). Adjusted R-squares, RMSEs, parameter estimates, their standard errors and significance probabilities (p-values) are also reported.

Panel A: Reduced-form Option Regressions - with and without implied volatility (RLOR-V, RLOR) RLOR RLOR-V

Parameter Estimate S.E. p-value Estimate S.E. p-value Intercept -79.84047 1.58482 <.0001 -20.57762 1.29347 <.0001

S 1.04889 0.00959 <.0001 0.53545 0.007960 <.0001 K -0.41694 0.0066 <.0001 -0.47016 0.006480 <.0001 τ 0.12199 0.00201 <.0001 -0.007230 0.001820 <.0001 r -331.01583 10.19839 <.0001 102.80166 8.19313 <.0001 2S 0.00277 0.00001754 <.0001 0.003590 0.00001409 <.0001

K2 0.0034 0.00000957 <.0001 0.003470 0.00000876 <.0001 τ 2 -0.0001469 0.00000132 <.0001 -0.00012701 0.00000092 <.0001

SK -0.00724 0.00002028 <.0001 -0.007110 0.00001724 <.0001 τS -0.0001292 0.00000559 <.0001 0.00020119 0.00000438 <.0001

Sr 1.74005 0.04771 <.0001 0.78183 0.03998 <.0001 Kτ 0.00015371 0.00000399 <.0001 0.00001328 0.00000357 0.0002 Kr -1.00151 0.04131 <.0001 -0.99314 0.03740 <.0001

τr 0.01635 0.013 0.2085 0.39348 0.009740 <.0001 σ 48.84034 2.29227 <.0001

2σ -18.94554 0.87780 <.0001 στ 0.16482 0.002480 <.0001

Sσ -0.23264 0.006710 <.0001 Kσ 0.24567 0.005970 <.0001 rσ -98.5760 12.04575 <.0001

Adj. R-square 0.9917 0.9962 RMSE $1.9143 $1.2985

31

Panel B: Structural Option Regressions – with and without implied volatility (SLOR-V, SLOR)

SLOR SLOR-V Parameter Estimate S.E. p-value Estimate S.E. p-value Intercept 61.08867 2.58377 <.0001 117.79971 2.07702 <.0001

m -0.17671 0.01 <.0001 -0.62653 0.007990 <.0001 K -209.1852 2.49143 <.0001 -231.61593 2.20305 <.0001 τ 0.19747 0.00263 <.0001 0.022780 0.002330 <.0001 r -560.72895 17.41768 <.0001 -117.15934 14.75335 <.0001 2m -0.00109 0.00001342 <.0001 -0.00003114 .00001183 0.0085

K2 65.95335 0.86851 <.0001 101.45768 0.90662 <.0001 τ 2 -0.00014274 0.00000151 <.0001 -0.00012235 .00000116 <.0001

mK 0.82669 0.00442 <.0001 0.66122 0.004310 <.0001 τm 0.00004625 0.0000048 <.0001 0.00026438 .00000392 <.0001

mr 0.79347 0.03633 <.0001 -0.10808 0.02872 0.0002 Kτ -0.079 0.00131 <.0001 -0.048250 0.00135 <.0001 Kr 207.8841 13.83197 <.0001 186.71978 14.14917 <.0001

τr 0.00338 0.01484 0.8200 0.38908 0.01222 <.0001 σ 123.04548 3.00817 <.0001

2σ -11.20498 1.16542 <.0001 στ 0.18677 0.00312 <.0001 mσ 0.08025 0.00640 <.0001 Kσ -98.12362 2.10254 <.0001 rσ -101.3339 15.66606 <.0001

Adj. R-square 0.9891 0.9940 RMSE $2.1981 $1.6338

32

Table IV Average Pricing Errors of LOR and PBS Models

The in-sample and out-of-sample performance of the four candidate localized option regression (LOR) models (1)-(4) and the benchmark ‘Practitioner Black-Scholes’ (PBS) model of Dumas, Fleming and Whaley (1998, DFW) is reported. Estimation of LOR parameters is performed over non-overlapping 50-day windows by the maturity-moneyness clusters of Table I. In the in-sample analysis, daily volatilities are estimated from the DFW volatility regression (9) where implied volatility is regressed on linear and quadratic terms of option maturity and exercise price. In the out-of-sample analysis, next-day predicted volatilities from the DFW volatility regression are used in LOR models with implied volatilities (SLOR-V and RLOR-V). Out-of-sample LOR and PBS option values are generated using equations (10)-(14) and (15-(16), respectively, with DWF volatility parameters from the previous n-th day (n=1, 2, 3). The average pricing error ($PE) is the RMSE based on the difference between the option market price and the model price. The efficiency loss (EFF) is the percentage increase in average PBS pricing errors over LOR.

Panel A: In-Sample

Pricing Error (PE or RMSE)

Efficiency Gain (% EFF)

Year Mean PBS RLOR SLOR SLOR-V RLOR-V RLOR-V SLOR-V All $17.08 $0.48 $0.45 $0.45 $0.25 $0.25 48.40% 47.87% 1988 13.24 0.28 0.30 0.30 0.16 0.16 43.06 42.30 1989 16.23 0.48 0.39 0.39 0.21 0.21 56.06 55.64 1990 18.83 0.50 0.51 0.51 0.28 0.28 44.58 44.03 1991 18.20 0.55 0.50 0.50 0.30 0.30 46.97 46.35

Panel B: 1-Day Out All $17.48 $0.70 $0.59 $0.59 $0.53 $0.53 32.44% 31.98% 1988 13.56 0.31 0.46 0.46 0.37 0.37 -15.86 -15.96 1989 16.80 0.49 0.43 0.43 0.42 0.41 18.25 17.23 1990 19.23 0.87 0.69 0.68 0.63 0.63 38.28 38.15 1991 18.26 0.83 0.69 0.69 0.58 0.58 43.53 42.53

Panel C: 2-Days Out All $17.70 $0.78 $0.74 $0.75 $0.69 $0.69 13.10% 13.17% 1988 13.47 0.39 0.66 0.66 0.43 0.43 -8.79 -9.05 1989 16.34 0.54 0.47 0.48 0.48 0.48 12.28 11.54 1990 19.78 0.82 0.77 0.77 0.76 0.76 7.72 8.46 1991 19.48 1.22 1.11 1.11 0.99 0.99 24.36 23.83

Panel D: 3-Days Out All $13.52 $0.76 $0.65 $0.74 $0.74 $0.75 1.98% 2.87% 1988 16.14 0.43 0.67 0.65 0.54 0.55 -22.24 -21.76 1989 19.66 0.68 0.71 0.67 0.64 0.63 7.81 6.51 1990 18.84 0.76 0.98 0.71 0.85 0.87 -12.57 -10.43 1991 13.52 1.11 0.65 0.98 0.83 0.83 33.98 33.32

33

Table V In-Sample Performance by Maturity-Moneyness Groups

Pricing performance by maturity and moneyness categories is reported for the selected localized option regression model (3, RLOR-V) model and the benchmark ‘Practitioner Black-Scholes’ (PBS) model of Dumas, Fleming and Whaley (1998, DFW). Estimation of LOR parameters is performed over non-overlapping 50-day windows by the maturity-moneyness clusters of Table I. Daily volatilities are estimated from the DFW volatility regression (9) where implied volatility is regressed on linear and quadratic terms of option maturity and exercise price. The average pricing error ($PE) is the RMSE based on the difference between the option market price and the model price. The efficiency loss (EFF) is the percentage increase in average PBS pricing errors over LOR. The correlation of variation (CV) is the percentage pricing error relative to the mean option value ($PE/Call Mean). The data sample is based on daily closing S&P500 call options from June 1988 to May 1991.

Maturity (Days)

Money-ness (m=S/K)

LOR ($PE)

PBS ($PE)

EFF (%)

Call Mean

CV (%)

All All $0.25 $0.48 48.40% $17.08 2.80% ]95.0,9.0[ 0.17 0.20 14.4 1.52 13.26

Less than 50 ]0.1,95.0( 0.22 0.30 27.2 3.84 7.86 ]05.1,0.1( 0.20 0.32 36.8 12.75 2.49 ]1.1,05.1( 0.18 0.37 50.5 25.29 1.45 ]95.0,9.0[ 0.15 0.25 38.7 3.13 7.94

50 to 100 ]0.1,95.0( 0.16 0.29 45.1 8.17 3.58 ]05.1,0.1( 0.16 0.30 44.6 18.28 1.63 ]1.1,05.1( 0.17 0.44 62.5 30.49 1.45 ]95.0,9.0[ 0.33 0.80 58.3 10.58 7.57

More than ]0.1,95.0( 0.31 0.59 46.9 18.40 3.22 100 ]05.1,0.1( 0.28 0.47 39.8 28.68 1.63

]1.1,05.1( 0.27 0.53 48.8 39.19 1.36

34

Table VI Out-of-Sample Performance over Maturity-Moneyness Groups

Out-of-sample pricing performance by maturity and moneyness categories is reported for the best LOR model (3, RLOR-V) and the benchmark ‘Practitioner Black-Scholes’ (PBS) model of Dumas, Fleming and Whaley (1998, DFW). Estimation of LOR parameters is performed over non-overlapping 50-day windows by the maturity-moneyness clusters of Table I and uses next-day predicted volatilities from the DFW volatility regression (9). Out-of-sample LOR and PBS option values are generated with (12)-(14) and (15-(16), respectively, using DWF volatility parameters from the previous n-th day (n=1, 2, 3). The average pricing error ($PE) is the prediction RMSE based on the difference between the option market price and the model price. The data sample includes daily closing S&P500 call options from June 1988 to May 1991.

Maturity (Days)

Moneyness (m=S/K)

LOR ($PE)

PBS ($PE)

EFF (%)

Call Mean

CV (%)

Panel A: 1-Day Out All All $0.53 $0.70 32.44% $17.48 3.99%

]95.0,9.0[ 0.33 0.34 4.57 1.65 20.85 Less than 50 ]0.1,95.0( 0.44 0.44 1.06 4.00 11.05

]05.1,0.1( 0.49 0.47 -4.56 13.22 3.56 ]1.1,05.1( 0.49 0.57 15.38 25.64 2.22 ]95.0,9.0[ 0.33 0.46 40.15 3.25 14.32

50 to 100 ]0.1,95.0( 0.56 0.65 16.34 8.04 8.11 ]05.1,0.1( 0.48 0.63 30.71 18.58 3.38 ]1.1,05.1( 0.51 0.64 25.90 30.50 2.09 ]95.0,9.0[ 0.58 0.96 65.39 10.46 9.19

More than ]0.1,95.0( 0.55 0.89 63.32 18.79 4.75 100 ]05.1,0.1( 0.51 0.65 27.57 28.80 2.26

]1.1,05.1( 0.72 0.88 20.99 38.92 2.25 Panel B: 2-Days Out

All All $0.69 $0.78 13.10% $17.70 4.38% ]95.0,9.0[ 0.45 0.43 -3.82 1.34 32.29

Less than 50 ]0.1,95.0( 0.64 0.56 -12.66 4.04 13.76 ]05.1,0.1( 0.63 0.63 -0.20 13.03 4.85 ]1.1,05.1( 0.52 0.66 27.00 25.72 2.58 ]95.0,9.0[ 0.40 0.35 -10.79 2.99 11.80

50 to 100 ]0.1,95.0( 0.60 0.50 -16.46 8.00 6.21 ]05.1,0.1( 0.92 0.62 -32.98 18.27 3.38 ]1.1,05.1( 1.00 0.85 -15.77 30.58 2.76 ]95.0,9.0[ 0.74 1.04 40.85 10.48 9.94

More than ]0.1,95.0( 0.67 1.05 57.21 18.66 5.64 100 ]05.1,0.1( 0.62 0.71 13.78 28.82 2.45

]1.1,05.1( 0.82 0.88 7.27 39.43 2.23

35

Maturity

(Days) Moneyness

(m=S/K) LOR ($PE)

PBS ($PE)

EFF (%)

Call Mean

CV (%)

Panel C: 3-Days Out All All $0.75 $0.76 1.98% $17.52 4.37%

]95.0,9.0[ 0.39 0.37 -7.02 1.45 25.33 Less than 50 ]0.1,95.0( 0.61 0.48 -21.40 3.87 12.48

]05.1,0.1( 0.57 0.50 -12.59 12.81 3.87 ]1.1,05.1( 0.60 0.58 -4.58 25.03 2.30 ]95.0,9.0[ 0.74 0.30 -59.28 2.87 10.46

50 to 100 ]0.1,95.0( 0.73 0.37 -49.90 7.65 4.79 ]05.1,0.1( 1.26 0.50 -60.18 18.36 2.73 ]1.1,05.1( 1.44 0.67 -53.45 30.61 2.19 ]95.0,9.0[ 0.72 1.24 71.94 10.27 12.10

More than ]0.1,95.0( 0.68 1.07 56.11 18.68 5.72 100 ]05.1,0.1( 0.66 0.73 11.93 28.42 2.58

]1.1,05.1( 0.65 0.73 13.29 39.38 1.86

36

Table VII Regressions of Pricing Errors and the Volatility Smile

The volatility smile effect is LOR and PBS out-of-sample prices is evaluated using the pricing error regression (17). Option pricing errors Model

ii VV − (difference between market and model prices) are regressed on linear and quadratic terms of SK / (strike over price). The regression is analogous to a paired t-test as the price difference Model

ii VV − cancels all common factors effecting option valuation (strike, maturity, index, discount rate) with the exception of volatility, and the residual difference is tested for dependence on strike. The monotonic relationship between option price and volatility ensures that the smile effect is uniquely captured by the price difference Model

ii VV − . The volatility smile is absent if both the linear and quadratic terms of the pricing error regression are not significant.

Model Variable Estimate Standard Error

t-value p-value

PBS Intercept -55.8092 9.1909 -6.07 <.0001 SK / 110.3114 18.3272 6.02 <.0001 ( )2/ SK -54.3526 9.1183 -5.96 <.0001

LOR Intercept -9.0174 7.1079 -1.27 0.2050

SK / 17.8450 14.1737 1.26 0.2084 ( )2/ SK -8.7806 7.0518 -1.25 0.2135

37

Figure 1 Pricing Errors from the LOR and PBS Models by Option Moneyness

Out-of-sample pricing errors from the LOR and PBS models are plotted by option moneyness. The pricing errors are S&P500 market option prices minus model prices (LOR or PBS) and moneyness is defined as the S&P500 index over the option strike price ( KS / ).

38

Figure 2 The Volatility Smile in LOR & PBS Prices

Predicted pricing errors Model

ii VV − are based on estimated parameters of the pricing error regressions (17). They are plotted for both LOR and PBS models against SK / (option strike over price). In the absence of a volatility smile/sneer effect, the differences in market and model option prices ( Model

ii VV − ) would be flat and not vary with the option’s strike (see Table VII for statistical tests). If model prices track market prices over moneyness (i.e. capture the volatility smile), then the pricing error Model

ii VV − should

be flat with respect to moneyness. For the PBS model, the pricing error Modelii VV − becomes inverted

because PBS option prices PBSiV over-shoot market prices iV for away from the money options.

regression-based modeling of market option prices: … modeling of market option prices: with...

Documents