garch vola measurepdf
TRANSCRIPT
-
8/3/2019 Garch Vola Measurepdf
1/26Electronic copy available at: http://ssrn.com/abstract=1941699
Swiss Finance InstituteResearch
Paper Series N11 4
5
Forecasting Financial TimeSeries: Normal GARCH with
Outliers or Heavy TailedDistribution Assumptions?Christoph HARTZUniversity of MunichMarc S. PAOLELLAUniversity of Zurich and Swiss Finance Institute
-
8/3/2019 Garch Vola Measurepdf
2/26Electronic copy available at: http://ssrn.com/abstract=1941699
Established at the initiative of the Swiss Bankers' Association, the Swiss
Finance Institute is a private foundation funded by the Swiss banks and
Swiss Stock Exchange. It merges 3 existing foundations: the International
Center FAME, the Swiss Banking School and the Stiftung "Banking and
Finance" in Zurich. With its university partners, the Swiss Finance Institute
pursues the objective of forming a competence center in banking and
finance commensurate to the importance of the Swiss financial center. It will
be active in research, doctoral training and executive education while also
proposing activities fostering interactions between academia and the
industry. The Swiss Finance Institute supports and promotes promising
research projects in selected subject areas. It develops its activity in
complete symbiosis with the NCCR FinRisk.
The National Centre of Competence in Research Financial Valuation and
Risk Management (FinRisk) was launched in 2001 by the Swiss National
Science Foundation (SNSF). FinRisk constitutes an academic forum that
fosters cutting-edge finance research, education of highly qualified finance
specialists at the doctoral level and knowledge transfer between finance
academics and practitioners. It is managed from the University of Zurich and
includes various academic institutions from Geneva, Lausanne, Lugano,
St.Gallen and Zurich. For more information see www.nccr-finrisk.ch .
This paper can be downloaded without charge from the Swiss Finance
Institute Research Paper Series hosted on the Social Science Research
Network electronic library at:
http://ssrn.com/abstract=1658362
-
8/3/2019 Garch Vola Measurepdf
3/26Electronic copy available at: http://ssrn.com/abstract=1941699
Forecasting Financial Time Series: Normal GARCH with Outliers
or Heavy Tailed Distribution Assumptions?
Christoph Hartza Marc S. Paolellaa,b,
aDepartment of Statistics, University of Munich, Munich, GermanybDepartment of Banking and Finance, University of Zurich, Switzerland
cSwiss Finance Institute
September 2011
Abstract
The use of GARCH models is widely used as an effective method for capturing
the volatility clustering inherent in financial returns series. The residuals from such
models are however often non-Gaussian, and two methods suggest themselves for
dealing with this; outlier removal, or use of non-Gaussian innovation distributions.
While there are benefits to both, we show that the latter method is better if interest
centers on volatility and value-at-risk prediction. New volatility measures based on
OHLC (open high low close) data are derived and used. Use of OHLC measures are
shown to be superior to use of the naive estimator used in other GARCH outlierstudies.
KeywordsOutliers, fat-tailed distributions, GARCH, volatility.
Corresponding author. E-mail address: [email protected]. Part of this research has been carriedout within the National Centre of Competence in Research Financial Valuation and Risk Management (NCCRFINRISK), which is a research program supported by the Swiss National Science Foundation.
-
8/3/2019 Garch Vola Measurepdf
4/26
1 INTRODUCTION 2
1 Introduction
The growth of interest in modeling and forecasting financial time series continues unabated, both
in academic and financial institutions, with a steady proliferation of models being proposed andtested for use in the prediction of volatility and Value-at-Risk, hereafter VaR. It is virtually
undisputed that returns of financial time series sampled at weekly or higher frequencies deviate
markedly from being independently and identically distributed (iid), most notably in the form
of (i) blatant volatility clustering, which is also reflected by high autocorrelation of absolute and
squared returns; (ii) substantial kurtosis, i.e., the density of the unconditional return distribu-
tion is more peaked around the center and possesses much fatter tails than the normal density;
and (iii) mild skewness. With respect to the first of these, use of (generalized) autoregressive
conditional heteroscedasticity (GARCH) models and their numerous extensions constitute byfar the most popular modeling method. While they are indeed quite capable of accounting for
the volatility clustering and, thus, a significant amount of the excess kurtosis (compared to the
normal distribution) andfor asymmetric GARCH generalizationssome of the skewness, the
residuals of fitted GARCH models are most often still leptokurtic and skewed. Because of this, it
is commonplace to allow the innovations process in the GARCH model to be an iid sequence of
random variables from a distributional class which still contains normality as a special case, but
is otherwise fat-tailed and asymmetric.
A different way which is, in some respects, more in line with the traditional approach in
statistics to handle aberrant observations not in accord with the normal distribution, is to
classify them as outliers and remove them from the analysis. Several authors have contributed
to this idea. Frances and Ghijsels (1999) proposed an Additive Outlier GARCH model, in short
AO-GARCH, driven by normal innovations. Their method is based on that of Chen and Liu
(1993) for detecting different types of outliers in time series and is described in more detail in
Section 2 below. Their model was extended to support innovative outliers, also based on the Chen
and Liu (1993) approach, by Charles and Darne (2005a); see also Charles and Darne (2005b) for
further analysis of this model. Alternative approaches, involving outlier removal either before or
after applying a GARCH filter, are discussed in Carnero et al. (2001).
We concentrate herein on the method in Frances and Ghijsels (1999). They fit their model
to the returns on four weekly European stock indices and demonstrated that volatility forecasts
are more accurate with the AO-GARCH model than when using the normal-GARCH ormore
importantlya GARCH model driven by Students t innovations with (jointly) estimated degrees
of freedom parameter, denoted t-GARCH. This finding would have important consequences for
risk management, given the predominance of t-GARCH and other variants for deriving VaR
-
8/3/2019 Garch Vola Measurepdf
5/26
1 INTRODUCTION 3
predictions.
If the true data generating process (DGP) is (or, realistically, is closer to) a normal-GARCH
process with occasional additive outliers, then it is certainly reasonable to expect that an AO-
GARCH model with estimated parameters will result in better out-of-sample forecasts than al-
ternative models such as t-GARCH. However, the validity of concluding the converse, i.e., that
improved forecasting ability of model A over model B implies that model A is a better descrip-
tion of the DGP than model B, strongly depends on the method of forecast measurement. In
the case of Frances and Ghijsels (1999), they compare the predicted volatility of the competing
models with the absolute return (demeaned by a long time average) as a proxy for the realized
volatility. This type of realized volatility has been, unfortunately, shown to be highly inaccu-
rate by Andersen and Bollerslev (1998). This simplistic, faulty measure was also used by Park
(2002) to demonstrate superiority of his outlier robust GARCH model for exchange rate returns.
Ideally, intra-day transactions would be used to construct volatility measures, because they have
been shown to be an excellent method of eliciting the true, unobservable, daily volatility (An-
dersen et al., 2003). For the data used in our study below, intra-day prices are not available,
so we instead follow Parkinson (1980) and Wiggins (1991) and make use of the OHLC-prices
(open-high-low-close) to obtain better predictions of the true volatility.
Another point of contention with Frances and Ghijsels (1999) is that their simulation studies
consist only of estimating and detecting outliers in the GARCH(1,1) model from simulated,
outlier-infected GARCH data and subsequently verifying that the procedure is indeed able to
successfully label and remove most of the actual generated outliers. While this is important
and useful, they do not study the arguably more important issue of the behavior of their model
and detection procedure, and the quality of the ensuing forecasts, assuming a data generating
process which closely mimics the stylized facts of financial time series. In particular, one could
consider data generated from a GARCH-type model with fat-tailed innovations such as from
the generalized asymmetric Students t, or GAt (Mittnik and Paolella, 2000), the asymmetric
stable Paretian (see Doganoglu et al., 2007; Broda et al., 2011; and the references therein), or the
generalized hyperbolic (see Broda and Paolella, 2009; Yang, 2011; Paolella and Polak, 2011; and
the references therein). In such a case, one would expect the outlier-based method to perform
poorly because it will ultimately remove the extreme data pointswhich are precisely those
which become relevant for understanding and modeling the tail behavior of the distribution and,
consequently, for prediction of risk.
The aim of this paper is to show via the use of more appropriate measures of volatility and
the use of several real data sets that the AO-GARCH method is inferior to the use of fat-tailed
-
8/3/2019 Garch Vola Measurepdf
6/26
2 MODELS UNDER INVESTIGATION 4
GARCH models with respect to volatility forecasting and VaR prediction. An added plus to
this conclusion is that the AO-GARCH model is, relatively to other GARCH models, far more
time-intensive to estimate and also involves the use of a user-decided tuning parameter for the
threshold of outlier determination which is decisive for its performance.
The remainder of this paper is as follows. In Section 2, the models under comparison are
briefly introduced, while Section 3 presents the time series used for comparison and discusses
the in-sample results for both models. In Section 4, the out-of-sample performance of both
models with respect to volatility forecasting and VaR prediction is discussed in detail. Section 5
concludes.
2 Models under Investigation
Let the observed time series be denoted by rt, t = 1, . . . , T , where rt = 100(ln Pt ln Pt1) isthe usual definition of the one-period return of the asset with price Pt at time t. The normal-
GARCH(p,q) model assumes
rt = + t, t = ztt, (1)
2t = c0 +
pi=1
ci2ti +
qj=1
dj2tj , (2)
where ztiid
N(0, 1). For the vast majority of applications, p = q = 1 is deemed adequate. For thedata sets considered in Section 3, this choice was also found to be appropriate.
As discussed in the Introduction, a popular method of augmenting the normal-GARCH model
to accommodate the typically observed non-normality of the residuals is simply to replace the
normal distribution with one which can exhibit asymmetry and tails fatter than those of the
normal. We use the GAt distribution, which generalizes Students t distribution by adding an
additional shape and and asymmetry parameter. In particular, its density is given by
fGAt(z; d,,) = K
1 + (z)d
(+
1
d)
, if z < 0,1 + (z/)
d
(+ 1d)
, if z 0,(3)
with K1 = ( + 1)d11/d B(d1, ). The density is asymmetric for = 1 and coincides with
the usual Students t density with m degrees of freedom with m = /2, d = 2 and = 1. The
raw integer moments are given by
E[Zr] =(1)r(r+1) + r+1
1 +
B(
r+1d , rd
)B(1d , ) r/d (4)
for r < d; see Paolella (2007) for derivation and further details of the GAt.
-
8/3/2019 Garch Vola Measurepdf
7/26
2 MODELS UNDER INVESTIGATION 5
We now briefly describe the AO-GARCH model proposed by Frances and Ghijsels (1999),
which is based on the method of Chen and Liu (1993) for detecting additive outliers in ARMA
time series. Let time series yt be given by the ARMA(r, s) model r(L)yt = s(L)t, where t is
a white noise process and the roots of r(L) and s(L) lie outside the unit circle. This series is
not observed, but rather yt, where
yt = yt + It(), (5)
with It() = 1 for t = and zero otherwise. That is, the observed series is an ARMA process
plus an additive outlier. By modeling the observed series, yt, with an ARMA(r, s) model, the
estimated residuals t = (L)yt, = t + (L)It() are obtained, with (L) = (L)/(L), from
which the estimated impact, (), can be elicited from the regression t = xt + t, with xt = 0
for t < , xt = 1 for t = , and xt+k = k for t and k = 1, 2, . . . .Outlier detection involves testing the significance of the (). Chen and Liu (1993) propose
three ways for estimating the residual standard deviation, a, from which Frances and Ghijsels
(1999) use the omit-one method, i.e., use the sample variance and omit the observation at t = .
The test statistic they obtain is given by
= [()/a]
/[T
t=
x2t
]1/2, (6)
where the impact of an AO is deemed significant if exceeds the critical value C, which is setequal to 4 by Frances and Ghijsels (1999). If the impact is significant, then the observation yt
can be adjusted to obtain the AO-corrected yt = yt It().Frances and Ghijsels (1999) apply the aforementioned AO correction for ARMA models to
the normal GARCH(1,1) model by defining t = 2t 2t and rewriting the GARCH equation to
obtain
2t = c0 + (c1 + d1)2t1 + t d1t1. (7)
This corresponds to an ARMA(1,1) model for
2
t , to which the AO correction is applied. Afterestimating a GARCH(1,1) model for the original series t, they calculate t =
2t 2t and
(L) = (1 (c1 + d1)L)/(1 d1L) and perform the regression for the estimated residuals t forevery t = to obtain () and the test statistic . Then, t is replaced by
t for that t = with
the largest value of > C = 4. Next, the series 2t = t +
2t is constructed to calculate the
AO-corrected series, where
t =
t, for t = ,
sign(t)(2t )
1/2, for t = .(8)
-
8/3/2019 Garch Vola Measurepdf
8/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 6
For the corrected series, t , the procedure of estimating the normal GARCH(1,1) model and the
correcting for AO is repeated until no test statistic exceeds the critical value C = 4. The
estimated GARCH(1,1) model for the final AO corrected series t is then used for forecasting
purposes.
In the subsequent empirical analysis, we follow Frances and Ghijsels (1999) and use C = 4.
It is, however, important to realize that estimation and forecasting results for the AO-GARCH
model are highly dependent on the choice of C and no objective criteria for choosing a correct
value of C has been proposed.
3 Data Analysis and In-Sample Results
The data used in our study are based on the daily prices of five major European stock indices:
the German DAX, the Dutch AEX, the French CAC40, the Italian MIBTEL and the British
FTSE100. While the ending periods of each coincide (last trading day in 2001), the starting
dates differ because of data availability and are, respectively, Jan. 1992, Jan. 1993, Jan. 1991,
Jan. 1994 and Jan. 1992.
Figure 1 plots the returns for each of the five series, from which the volatility clustering is quite
blatant. The usual measures of sample skewness and kurtosis were also computed; each series
exhibits negative skewness, i.e., has a longer left tail, and a sample kurtosis which is considerably
larger than the appropriate value of three for the normal distribution.1
With respect to computing time, estimation of the GAt-GARCH model takes about five
times longer than normal-GARCH. Time for estimation of the AO-GARCH model with normal
innovations depends mainly on the number of outliers detected, as each detected outlier invokes a
normal-GARCH estimation and T regressions for detecting subsequent outliers. This is, in turn,
dictated by the choice of tuning parameter C. Roughly speaking, with C = 4 we found about 5%
of the returns under investigation to be outliers. With that one can say that the AO-GARCH
model takes more than 0.05T times longer than estimation of the normal-GARCH model. (All
programs were written in Matlab and compiled to increase execution speed.)
Because of the nature of the AO-GARCH method of fitting a model to the data, it does
not lend itself to likelihood-based comparisons. In particular, after the outliers are chosen and
removed, the resulting fit isessentially by constructionexcellent. As such, for in-sample com-
parisons, we will study the choice of outliers removed by the AO-GARCH method, and argue
1 Standard errors and classical inference methods to assess the significance of the deviation of these statisticsfrom their null values corresponding to normality are not reliable in this context because the data are clearlynot iid, and the existence of low moments is not at all certain. As such, the statistics are not presented.
-
8/3/2019 Garch Vola Measurepdf
9/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 7
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
DAX
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
AEX
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
CAC40
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
MIBTEL
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
FTSE100
Figure 1: Return plots
that it is inappropriate for financial return data.
Table 1 presents the estimated parameters and several test statistics for the AO-GARCH
model applied to each of the five time series under study. For every index there are two columns,
labeled 0 and AO, with 0 referring to no outlier detection and use of just the usual normal-GARCH
model, and AO, which refers to the final iteration after full AO detection and correction. For
all five series, the mean of the standardized residuals as well as the mean parameter is not
-
8/3/2019 Garch Vola Measurepdf
10/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 8
Table 1: Estimation results for the normal-GARCH and the AO-GARCH modelsa
DAX(0/AO)
AEX(0/AO)
CAC40(0/AO)
MIBTEL(0/AO)
FTSE100(0/AO)
0.067(0.020)
/ 0.064(0.019)
0.070(0.019)
/ 0.078(0.018)
0.041(0.022)
/ 0.059(0.019)
0.053(0.027)
/ 0.039(0.025)
0.043(0.016)
/ 0.043(0.015)
c0 0.023(0.006)
/ 0.009(0.004)
0.016(0.005)
/ 0.010(0.004)
0.054(0.015)
/ 0.008(0.005)
0.092(0.023)
/ 0.048(0.017)
0.010(0.004)
/ 0.004(0.002)
c1 0.091(0.012)
/ 0.059(0.009)
0.101(0.013)
/ 0.065(0.013)
0.066(0.012)
/ 0.028(0.007)
0.145(0.020)
/ 0.076(0.015)
0.065(0.010)
/ 0.039(0.009)
d1 0.897(0.012)
/ 0.934(0.011)
0.891(0.013)
/ 0.924(0.016)
0.900(0.019)
/ 0.965(0.009)
0.814(0.024)
/ 0.890(0.024)
0.925(0.012)
/ 0.955(0.011)
mean 0.022/0.013 0.015/0.011 0.010/0.008 0.020/0.006 0.020/0.015var 0.998/ 0.997 0.997/ 0.997 1.003/ 1.002 1.000/ 1.001 0.997/ 0.998
skew 0.242/0.158 0.399/0.130 0.353/0.032 0.249/0.050 0.074/0.050kurt 4.230/ 2.721 3.992/ 2.577 5.057/ 2.480 5.279/ 2.603 4.072/ 2.402
LB10 15.02(0.13)
/ 12.64(0.24)
14.00(0.17)
/ 14.19(0.16)
19.29(0.04)
/ 18.23(0.05)
25.61(0.00)
/ 15.86(0.10)
25.29(0.00)
/ 25.35(0.00)
QLB10 6.25(0.79)
/ 8.53(0.58)
15.12(0.13)
/ 8.76(0.56)
5.66(0.84)
/ 10.14(0.43)
6.65(0.76)
/ 16.94(0.08)
6.41(0.78)
/ 9.90(0.45)
# AO 99(3.9%)
118(5.2%)
128(4.6%)
74(3.7%)
152(6.0%)
# AO 51(2.0%)
62(2.7%)
71(2.6%)
35(1.7%)
77(3.1%)
# AO+ 48(1.9%)
56(2.5%)
57(2.1%)
39(1.9%)
75(3.0%)
R(AO) 0.06 0.00 0.00 0.00 0.00
R(+) 0.92 0.00 0.00 0.00 0.00a For every index there are two columns, labeled 0 for normal-GARCH model without any outlier detectionand AO for final iteration after full outlier detection and correction.Rows 1 to 4 contain the parameter estimates (standard errors given in parenthesis); rows 5 to 10 givestatistics for the standardized residuals, where LB10 (QLB10) is the Ljung-Box test statistic of order 10 forthe (squared) standardized residuals (p-values are given in parentheses); rows 11 to 13 give the number ofoutliers (# AO), the number of negative outliers (# AO-), and the number of positive outliers (# AO+)detected and corrected by the AO-GARCH model; and rows 14 and 15 give the p-values for the runs-teststatistic for the sequence of outliers, R(AO), and the sequence of positive and negative outliers, R(+).
significantly affected by the outlier correction. This and the number of outliers of either sign
(#AO+ and #AO-) indicate that the method is equally likely to detect a negative or a positive
outlier.
For the fitted AO-GARCH model, the constant term of the GARCH equation, c0, and the
ARCH term, c1, are smaller for all five indices in comparison to the normal-GARCH model,
whereas the GARCH parameter d1 is greater after the outlier correction. This finding is in line
with Figure 2, where the corrected return series are plotted. It is obvious that the AO correction
-
8/3/2019 Garch Vola Measurepdf
11/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 9
removes all of the peaks of the original series. By cutting these peaks, any information about
the DGP contained in these extreme returns is lost. If these observations are indeed genuine
outliers resulting from, say, incorrect data entry, or (far more likely) correspond to information
arrivals via political or economic events which are truly exceptional and will not have any relevance
for future modeling, then the procedure apparently succeeds admirably in their removal.
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
DAX
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
AEX
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
CAC40
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
MIBTEL
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
5
0
5
FTSE100
Figure 2: AO corrected return plots
The standardized residuals of the normal-GARCH model still exhibit some of the properties as
-
8/3/2019 Garch Vola Measurepdf
12/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 10
the return series themselves. For all five indices, they have negative skewness and higher kurtosis
than imposed by the normal distribution. After applying the AO-correction, the negative skewness
is reduced and the kurtosis drops down below the anticipated value of three for the normal
distribution, which leads to the conjecture that the critical value C = 4 may be too small and
too many returns are corrected by the procedure. Hence, the AO-correction works as expected:
All observations not in line with the assumed model and error distribution are corrected.
As indicated by the Ljung-Box test statistic for the squared standardized residuals for order
10 (QLB10) and corresponding p-values, there is no heteroscedasticity left in the standardized
residuals of the normal-GARCH model for all five time series. In comparison to the normal-
GARCH model, the QLB10 test statistic for the AO-GARCH model rises for all indices except
for the AEX index. For the MIBTEL index, QLB10 = 16.94, with p-values 0.08, indicating that
there is still an amount of heteroscedasticity remaining in the standardized AO-GARCH model
residuals.
The occurrence of the detected and removed outliers from the AO-GARCH model is illustrated
in Figure 3. From the plot, it is strikingly evident that their positions are not randomly distributed
throughout the sample, but rather occur in clusters. This undermines the presumption that
outliers are sprinkled throughout the data in a random, unpatterned fashion, but rather that
the AO-GARCH model is removing observations occurring in groups of returns with larger than
average heteroscedasticity. It is hardly tenable that these observations are outliers in the sense
that their consideration would only corrupt the underlying signal in the data, but rather that
they are an intimate part of the time-varying volatility phenomena containing information both
about the GARCH parameters and, separate from any particular parametric model, about the
actual probability of extreme events.
To statistically test whether the observations deemed to be outliers are occurring at random
or are clustered, define a new time series {Bt} with Boolean elements, depending on whether anoutlier is detected or not, i.e., Bt = 1 if an outlier is detected, and zero otherwise. The runs test
statistic
R(AO) =s 2T p0p1
2
np0p1
asy N(0, 1) (9)
can then be applied, where p0 and p1 are the proportions of zeros and ones, respectively and s is
the number of runs.2
We also test for the random occurrence of positive and negative outliers. Let {St} be thetime series such that St = sign(rt)Bt. With p1 and p1 the proportion of negative and positive
2 See Paolella (2006, Sec. 6.3) for the exact distribution of this and related runs distributions, original references,further details and applications.
-
8/3/2019 Garch Vola Measurepdf
13/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 11
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
1
0
1
DAX
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
1
0
1
AEX
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
1
0
1
CAC40
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
1
0
1
MIBTEL
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002
1
0
1
FTSE100
Figure 3: Outlier occurrence: a cross (x) marks the occurrence of an outlier of eithersign, whereas a plus (+) marks the same outliers, but differenced by sign. So,the cross series sums the plus series.
outliers, respectively, and p0 and s are defined as before, the test statistic is given by
R(+) =s n
1
1i=1
p2i
n
1
i=1p2i 2
1
i=1p3i +
1
i=1p2i
2 asy N(0, 1). (10)
-
8/3/2019 Garch Vola Measurepdf
14/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 12
With the single exception of the DAX index, the statistical significance of the resulting test
statistics provide evidence to suggest that the occurrence of the outliers is non-random and that
useful information about the process is lost by removing the outliers.
Table 2: Estimation results for the GAt-GARCH modela
DAX AEX CAC40 MIBTEL FTSE100
0.178(0.035)
0.203(0.032)
0.179(0.045)
0.097(0.049)
0.133(0.030)
c0 0.022(0.009)
0.022(0.008)
0.055(0.020)
0.133(0.039)
0.014(0.006)
c1 0.133(0.021)
0.154(0.024)
0.101(0.019)
0.202(0.035)
0.109(0.025)
d1 0.913(0.013)
0.901(0.015)
0.919(0.016)
0.840(0.026)
0.930(0.011)
5.879(0.000)
5.991(0.187)
5.968(0.299)
5.979(0.338)
5.981(3.068)
d 2.019(0.061)
2.060(0.090)
2.029(0.084)
2.016(0.094)
2.100(0.455)
0.922(0.020)
0.903(0.021)
0.924(0.023)
0.966(0.025)
0.926(0.022)
mean(theo.)
0.097(0.098)
0.117(0.122)
0.098(0.094)
0.042(0.041)
0.094(0.091)
var(theo.)
0.610(0.601)
0.591(0.587)
0.607(0.595)
0.606(0.595)
0.574(0.570)
skewness(theo.)
0.235(0.180)
0.398(0.217)
0.387(0.172)
0.249(0.077)
0.074(0.159)
kurtosis(theo.)
4.350(3.762)
4.133(3.671)
5.499(3.720)
5.355(3.725)
4.086(3.576)
LB10 14.96(0.13)
13.21(0.21)
19.21(0.04)
26.46(0.00)
25.61(0.00)
QLB10 4.53(0.92)
13.47(0.20)
5.16(0.88)
6.96(0.73)
6.76(0.75)
a Rows 1 to 7 contain the parameter estimates (standard errors given in parenthesis); rows 8to 14 give statistics for the standardized residuals, where LB10 (QLB10) is the Ljung-Box teststatistic of order 10 for the (squared) standardized residuals (p-values are given parenthesis);
We now turn to the analysis of the fitted GAt-GARCH models. The estimation results are
given in Table 2. Denote by zt the residuals of the estimated model. If the GARCH model is
appropriate, then the zt should be approximately iid with zero location, unit scale, and density
given by the GAt density with the estimated values of the shape and skewness parameters (these
having been jointly estimated with the GARCH parameters). As for the normal-GARCH model
and the AO-GARCH model, the values for the mean and the variance are close to the theoretical
values computed from (4) with the estimated parameter values. But, in addition, the GAt
distribution is able to capture parts of the skewness and the excess kurtosis. Figure 4 shows the
-
8/3/2019 Garch Vola Measurepdf
15/26
3 DATA ANALYSIS AND IN-SAMPLE RESULTS 13
kernel and fitted probability density functions of the zt. It is visually clear that the model fits
the data quite well.
4 3 2 1 0 1 2 3 40
0.1
0.2
0.3
0.4
0.5
0.6
DAX
4 3 2 1 0 1 2 3 40
0.1
0.2
0.3
0.4
0.5
0.6
AEX
4 3 2 1 0 1 2 3 40
0.1
0.2
0.3
0.4
0.5
0.6
CAC40
4 3 2 1 0 1 2 3 40
0.1
0.2
0.3
0.4
0.5
0.6
MIBTEL
4 3 2 1 0 1 2 3 40
0.1
0.2
0.3
0.4
0.5
0.6
FTSE100
Figure 4: Fitted GAt density (solid) vs. empirical kernel density (dashed) for thestandardized residuals.
-
8/3/2019 Garch Vola Measurepdf
16/26
4 OUT-OF-SAMPLE FORECASTING RESULTS 14
4 Out-of-Sample Forecasting Results
4.1 Volatility Forecasts
Judging the quality of a forecast of the volatility obviously necessitates a benchmark value against
which it can be compared. As discussed in detail by Andersen and Bollerslev (1998), the com-
monly used proxy for the true unobservable daily volatility,
vst = |rt r|, r =1
T
rt, (11)
is a poor measure for the true daily volatility owing to its enormous variance (although it is un-
biased). This is, unfortunately, the measure employed by Frances and Ghijsels (1999). Andersen
and Bollerslev investigate and recommend the use of intra-day returns for measuring daily volatil-
ity. However, because of the limited availability of such data, we propose four different volatility
measures, each based on the widely available OHLC (open-high-low-close) data for stock prices
to approximate the realized volatility.
pot
pht
plt
pct
Figure 5: Diagrammatic occurrence of OHLC-Prices: pot (pht , p
lt and p
ct ) denote the
logarithmic open (high, low and close) price at time t.
The first approach is to use the four available prices per day to construct six possible intra-day
returns, i.e. the returns that may occur by intra-day trading and using the limited information
set of the OHLC-prices. These artificial returns are then used in conjunction with the usual
method for calculating a sample standard deviation, i.e.,
vat =
16
(x,y)A
(r(x,y)t E[rt])2, A = {(o, c), (o, h), (o, l), (h, l), (h, c), (l, c)}, (12)
where r(x,y)t = 100 |pyt pxt | for (x, y) A, are the absolute values of the six artificial returns
between two realized prices of one day.
-
8/3/2019 Garch Vola Measurepdf
17/26
4 OUT-OF-SAMPLE FORECASTING RESULTS 15
The second approach also uses the four available prices of one day, as well as the closing price
of the previous day, to derive four artificial returns in terms of an inter-day trading with limited
information, i.e., the returns that may occur when trading takes place between the closing price
the day before, and one of four possible prices today. As before, the standard deviation of these
returns is used as a proxy for todays realized volatility:
vet =
14
(x,y)E
(r(x,y)t E[rt])2, E = {(c, c), (c, o), (c, l), (c, h)}, (13)
where r(x,y)t = 100 |pyt pxt1| with (x, y) E, are the absolute values of the four possible
inter-day returns.
A third way to calculate the daily volatility is a mixture of the two previously mentioned
approaches, which we refer to as the mixed volatility measure :
vmt =
110
(x,y)M
(r(x,y)t E[rt])2, M = A + E, (14)
where r(x,y)t are the ten possible intra- and inter-day returns as defined before.
The three aforementioned approaches are straightforward and obviously simplistic in construc-
tion. The fourth approach we consider was proposed in Garman and Klass (1980) for estimating
the volatility constant of a diffusion process for security prices by the use of OHLC-prices. They
derive six different estimators for , one of which we use, given by
vxt =
a
((pot pct1) 100)2f
+ (1 a) ((pht plt) 100)2
(1 f)4log2 , (15)
with f is the portion of day where no trading takes place; that is, the time between closing at day
t 1 and opening of day t. For the empirical applications below, f = 0.6 is used, which turnedout to be a good approximation for the different values of f for the different indices. Finally,
a = 0.17 is a constant which minimizes vxt , independent of f (this is the 23 estimator of Garman
and Klass, 1980).
The five volatility approximations described above are used to judge the forecasting accuracy
of the two competing GARCH models. For both models, 1000 one-step-ahead volatility forecasts
are constructed for all five indices, with an estimation period of 250 days (about one year of
trading data) for both models. To make the comparison more realistic, the model parameters
were updated for each t = T 1000, . . . , T 1. Note that such an exercise will extremely time-consuming when using the AO-correction method.
-
8/3/2019 Garch Vola Measurepdf
18/26
4 OUT-OF-SAMPLE FORECASTING RESULTS 16
The three popular error criteria, mean squared error (MSE), median squared error (MedSE)
and the mean absolute percentage error (MAPE), were used to judge the performance. In addi-
tion, we ran a regression analogous to that of Andersen and Bollerslev (1998) of the form
vf = 0 + 1 vf, f = 1, . . . , 1000, (16)
with vf being the realized volatility measured by one of the volatility proxies and vf is the
one-step-ahead prediction of volatility of one of the models. Note that vf equals f for the
AO-GARCH model (with normal innovations) and vf = f
E[(Z E[z])2] for the GAt dis-tribution. For a model which provides correct one-step volatility predictions, the parameters 0
and 1 are expected to be 0 and 1, respectively.
The results for the standard volatility proxy (11), as well as for the mixed and the extreme
value approximations are given in Tables 3 to 5. (The results for the intra- and inter-day approx-
imations are similar to the results obtained by the mixed volatility approximation, so we do not
present them here to save space.) Beginning with the standard volatility proxy vst in (11), we see
that, comparatively speaking, it is by far the worst: The low R2 value is in agreement with the
results reported in Andersen and Bollerslev (1998) and is the lowest R2 of all competing proxies
across both models and all indices. Similarly, vst exhibits the largest error criteria values; in the
case of the MAPE, it is on average 10 times higher than for the vmt in (14) and about 15 times
higher as the MAPE for the vxt in (15).
Table 3: Forecasting results for standard volatility approximation (11)a
DAX(AO/GAt)
AEX(AO/GAt)
CAC40(AO/GAt)
MIBTEL(AO/GAt)
FTSE100(AO/GAt)
0 0.341(0.145)
/ 0.083(0.104)
0.210(0.121)
/ 0.015(0.081)
0.283(0.234)
/ 0.113(0.130)
0.331(0.141)
/ 0.074(0.082)
0.073(0.142)
/ 0.005(0.107)
1 1.144(0.102)
/ 0.745(0.063)
1.119(0.099)
/ 0.776(0.053)
1.147(0.185)
/ 0.719(0.087)
1.196(0.112)
/ 0.739(0.053)
0.834(0.131)
/ 0.790(0.085)
R2 0.112/ 0.121 0.113/ 0.174 0.037/ 0.064 0.102/ 0.163 0.039/ 0.080
MSE 1.026/ 1.110 0.896/ 0.934 0.917/ 0.979 1.054/ 1.086 0.624/ 0.659MedSE 0.488/ 0.557 0.324/ 0.409 0.448/ 0.556 0.424/ 0.472 0.343/ 0.399
MAPE 550.1/ 583.0 417.7/ 468.6 458.1/ 530.8 411.3/ 451.9 485.0/ 538.9a For every index there are two columns, labeled AO for the AO-GARCH model and GAt for the GAt-GARCHmodel. Values 0 and 1 are the estimated regression coefficients (standard errors in parentheses).
The most accurate proxy for realized volatility seems to be the extreme value estimator from
Garman and Klass (1980), vxt , in (15). It results in the highest R2, the lowest MAPE and, with
two exceptions, the lowest MedSE of all proxies. For the MSE, it took second place, with a slight
preference being given to the volatility approximations given in equations (12) to (14). Thus, the
-
8/3/2019 Garch Vola Measurepdf
19/26
4 OUT-OF-SAMPLE FORECASTING RESULTS 17
Table 4: Forecasting results for mixed volatility approximation (14)a
DAX(AO/GAt)
AEX(AO/GAt)
CAC40(AO/GAt)
MIBTEL(AO/GAt)
FTSE100(AO/GAt)
0 0.363(0.095)
/ 0.090(0.067)
0.260(0.076)
/ 0.008(0.048)
0.610(0.146)
/ 0.123(0.077)
0.341(0.089)
/ 0.135(0.050)
0.024(0.094)
/ 0.045(0.069)
1 1.249(0.067)
/ 0.820(0.041)
1.209(0.063)
/ 0.820(0.032)
1.494(0.116)
/ 0.955(0.051)
1.241(0.071)
/ 0.727(0.032)
0.985(0.087)
/ 0.840(0.055)
R2 0.260/ 0.286 0.272/ 0.401 0.143/ 0.258 0.235/ 0.337 0.114/ 0.190
MSE 0.436/ 0.459 0.358/ 0.361 0.362/ 0.344 0.423/ 0.452 0.269/ 0.271
MedSE 0.151/ 0.210 0.099/ 0.172 0.119/ 0.180 0.141/ 0.186 0.106/ 0.142
MAPE 42.2/ 49.3 42.5/ 53.1 38.6/ 46.4 49.3/ 57.2 43.0/ 50.7a For every index there are two columns, labeled AO for the AO-GARCH model and GAt for the GAt-GARCHmodel. Values 0 and 1 are the regression coefficients (standard errors in parentheses).
Table 5: Forecasting results for extreme value volatility approximation (15)a
DAX(AO/GAt)
AEX(AO/GAt)
CAC40(AO/GAt)
MIBTEL(AO/GAt)
FTSE100(AO/GAt)
0 0.450(0.116)
/ 0.150(0.082)
0.336(0.093)
/ 0.013(0.058)
0.854(0.181)
/ 0.177(0.094)
0.450(0.104)
/ 0.193(0.058)
0.006(0.112)
/ 0.101(0.082)
1 1.650(0.081)
/ 1.081(0.050)
1.587(0.077)
/ 1.078(0.038)
2.026(0.143)
/ 1.284(0.063)
1.622(0.083)
/ 0.936(0.037)
1.355(0.103)
/ 1.102(0.065)
R2 0.292/ 0.321 0.300/ 0.444 0.168/ 0.297 0.279/ 0.388 0.149/ 0.225
MSE 0.882/ 0.688 0.688/ 0.439 0.759/ 0.523 0.696/ 0.492 0.529/ 0.394
MedSE 0.152/ 0.146 0.091/ 0.095 0.131/ 0.109 0.112/ 0.110 0.125/ 0.106
MAPE 30.3/ 30.5 29.8/ 30.3 28.4/ 27.3 32.1/ 32.5 31.3/ 29.9a For every index there are two columns, labeled AO for the AO-GARCH model and GAt for the GAt-GARCHmodel. Values 0 and 1 are the regression coefficients (standard errors in parentheses).
simple estimators vat , vet and v
mt proposed herein are seen to be competitive with v
xt from Garman
and Klass (1980). (We elected not to report the results based on the most efficient extreme value
estimator proposed by Garman and Klass (1980) because the one used here has a straightforward
interpretation, is related to the volatility estimator of Parkinson (1980), and leads to qualitatively
similar results.)
While interesting in themselves, the different volatility approximations are not our main goal,
but rather to compare the volatility forecasts of the competing GARCH models. The plots for
volatility forecasts for both models as well as the realized volatility, measured by the extreme
value estimator, vxt , are given in Figure 6. We see that the AO-GARCH model is not able
to predict higher fluctuations of the realized volatility. Its volatility forecasts are very smooth
and slowly varying. While this might be appealing from an applied point of view, in the sense
-
8/3/2019 Garch Vola Measurepdf
20/26
4 OUT-OF-SAMPLE FORECASTING RESULTS 18
0 100 200 300 400 500 600 700 800 900 10000
2
4
6
8
10
DAX
0 100 200 300 400 500 600 700 800 900 10000
2
4
6
8
AEX
0 100 200 300 400 500 600 700 800 900 10000
2
4
6
8
CAC40
0 100 200 300 400 500 600 700 800 900 10000
2
4
6
8
10
MIBTEL
0 100 200 300 400 500 600 700 800 900 10000
2
4
6
8
FTSE100
Underlying grey line: realized volatiliy, measured by the extreme value approximation, vx
Dashed (black) line: volatility forecast of the AO-GARCH modelSolid (blue) line: volatility forecast of the GAt-GARCH model
Figure 6: Realized Volatility vs. Volatility Forecasts
that the frequency of portfolio weight adjustment (and the associated costs) are lessened, the
predictions are simply too poor and would not pass the tests required by the Basle Accord.
The behavior of the forecasts is what one would expect based on the results of the runs tests
in Section 3. In particular, as the extreme values are not randomly dispersed throughout the
-
8/3/2019 Garch Vola Measurepdf
21/26
4 OUT-OF-SAMPLE FORECASTING RESULTS 19
data, they are informative for risk prediction, but the method removes all such extraordinary
peaks of the original series and labels them as outliers. In contrast, the volatility forecasts of the
GAt-GARCH model are able to capture the higher fluctuations of the volatility process and are,
thus, considerably more accurate.
With regard to the error criteria, the conclusion as to which model is best in predicting future
volatility depends on the chosen volatility approximation. For the standard approximation in
(11) as well as for the intra-day measure in (12), the inter-day measure in (13) and the mixed
volatility measure in (14), the AO-GARCH model yields smaller error criteria values for all
indices (with two exceptions for the intra and one exception for the mixed approximation). For
the extreme value approximation, vxt in (15), we draw opposite conclusions. The GAt-GARCH
model is preferred for all indices by the MSE, for 4 out of 5 indices by the MedSE, and for
2 out of 5 indices by the MAPE. The conclusion is, however, unequivocal when based on the
regression results: Irrespective of the chosen volatility approximation, the R2 value is highest for
the GAt-GARCH model for all indices.
Furthermore, the GAt-GARCH model not only leads to higher R2 values but also the number
of regression coefficients which are insignificantly different from their expected values, are higher
for all the volatility measures, with the exception of the standard approximation, vst . Based on the
most accurate volatility approximation, vxt , there are 4 (3) of the 0 (1) estimates insignificantly
different for the GAt-GARCH model, whereas for the AO-GARCH model, all parameter estimates
are found to be significantly in violation of the null hypothesis.
4.2 VaR-Forecasts
Besides predicting future volatility, GARCH models can be, and are, used to measure the down-
side risk inherent in a given financial position, i.e., the Value-at-Risk (VaR) of a financial position.
The VaR is defined to be the maximum amount of loss of a financial position that may occur
within a given number of days for a given probability . For our purposes, the VaR is given by
the cut-off point for which the next days predicted return will not fall with probability 1 .It is obtained by inverting the equation
Pr (rt+1 t+1()) = . (17)
When using a GARCH model for predicting the VaR, the calculation is done by using the condi-
tional density ft+1|t(rt+1|; rt, rt1, . . . ), where refers to the vector of estimated model param-eters, and f() is given by the assumed error distribution.
As for the volatility predictions above, 1000 one-step-ahead forecasts of the conditional re-
turn densities for the five indices are evaluated for both models under investigation. Again, an
-
8/3/2019 Garch Vola Measurepdf
22/26
5 CONCLUSIONS 20
estimation period of 250 days for both models is used and the model parameters are updated for
every forecast. To compare the accuracy of the VaR predictions, the empirical tail probabilities
=
1
1000
T1t=T1000
I
[rt+1 < t+1()] (18)are calculated for three different probability levels, = 0.01, = 0.025 and = 0.05, where I
is the indicator function. When based on the true distribution at time t + 1, the empirical tail
probability will coincide, in terms of the long-run average, with the given probability level. If the
observed probability is higher (less) than , then the model tends to underestimate (overestimate)
the risk for the index returns, i.e., the implied absolute -values tend to be too small (large). The
results for the different indices and probabilities are given in Table 6.
As expected from the results of forecasting volatility, the AO-GARCH model is not able
to measure the VaR correctly. By detecting and correcting the time series for anomalous
observations and, thus, removing the information contained in them, the AO-GARCH model
suffers when attempting to predict the tails of the distribution. For all three probability levels,
the AO-GARCH model underestimates the VaR of all time series by a factor of 1 .7 for the 5%
level and a factor of 3.3 for the 1% level. On the other hand, the GAt-GARCH model works quite
well, in agreement with the findings in Mittnik and Paolella (2000). For all probability levels, the
empirical tail probabilities are close to the given probability levels, i.e., the conditional modeling
of volatility, in conjunction with a more flexible return distribution, enhances the ability to draw
conclusions about the future tail probabilities of returns.
Table 6: Empirical tail probabilitiesa
5%(AO/GAt)
2.5%(AO/GAt)
1%(AO/GAt)
DAX 9.0 / 5.4 5.3 / 2.5 2.5 / 0.7
AEX 8.4 / 4.5 4.7 / 2.3 3.0 / 1.0
CAC40 8.3 / 5.2 5.5 / 2.7 3.2 / 0.7
MIBTEL 9.4 / 6.4 5.7 / 3.3 2.7 / 1.2FTSE100 8.8 / 5.2 6.1 / 2.4 3.3 / 1.0
a Observed tail probabilities from (17) multiplied by 100. For acorrectly specified model, we expect .
5 Conclusions
We compare two possible ways to accommodate the well-documented stylized fact of non-normal,
heteroskedastic financial returns for the purpose of risk prediction. One way involves systemati-
-
8/3/2019 Garch Vola Measurepdf
23/26
REFERENCES 21
cally removing aberrant values; the other is to use distributions which allow for explicit modeling
of the observed features of the data. The differences between both ways can be summarized
as follows. The AO-GARCH model is successful in identifying and correcting the original time
series for additive outliers. For all the time series investigated in this paper, the occurrence of the
outliers reveals strong systematic patterns, against the assumption that outliers occur at random.
Instead of labeling them as outliers, such aberrant values appear to be chronic in financial return
series and form an important part of the analysis and prediction of risk.
The volatility forecasts obtained from the two competing models are very different, although
it is hard to judge which model leads to better forecasts when judging is based on the standard
volatility approximation and standard error criteria. The results are far clearer if more efficient
volatility approximations are used, as proposed herein.
References
Andersen, T. G. and Bollerslev, T. (1998). Answering the Skeptics: Yes, Standard Volatility
Models Do Provide Accurate Forecasts. International Economic Review, 39(4), 885905.
Andersen, T. G., Bollerslev, T., Diebold, F. X., and Labys, P. (2003). Modeling and Forecasting
Realized Volatility. Econometrica, 71(2), 579625.
Broda, S. A., Haas, M., Krause, J., Paolella, M. S., and Steude, S.-C. (2011). Stable Mixture
GARCH Models. Swiss Finance Institute Occasional Paper Series No. 11 39.
Broda, S. A. and Paolella, M. S. (2009). CHICAGO: A Fast and Accurate Method for Portfolio
Risk Calculation. Journal of Financial Econometrics, 1, 125.
Carnero, M. A., Pena, D., and Ruiz, E. (2001). Outliers and Conditional Autoregressive Het-
eroscedasticity in Time Series. Revista Estadistica, 53, 143213.
Charles, A. and Darne, O. (2005a). Outliers and GARCH Models in Daily Financial Data.Economics Letters, 86, 347352.
Charles, A. and Darne, O. (2005b). Relevance of Detecting Outliers in GARCH Models for
Modelling and Forecasting Financial Data. Finance: Revue de Lassociation Francaise de
Finance, 26(1), 3371.
Chen, C. and Liu, L.-M. (1993). Joint Estimation of Model Parameters and Outlier Effects in
Time Series. Journal of American Statistical Association, 88(421), 284297.
-
8/3/2019 Garch Vola Measurepdf
24/26
REFERENCES 22
Doganoglu, T., Hartz, C., and Mittnik, S. (2007). Portfolio Optimization when Risk Factors are
Conditionally Varying and Heavy Tailed. Computational Economics, 29(3-4), 333354.
Frances, P. H. and Ghijsels, H. (1999). Additive Outliers, GARCH and Forecasting Volatility.
International Journal of Forecasting, 15, 19.
Garman, M. B. and Klass, M. J. (1980). On the Estimation of Security Price Volatilities from
Historical Data. Journal of Business, 53(1), 6778.
Mittnik, S. and Paolella, M. S. (2000). Conditional Density and Value-at-Risk Prediction of Asian
Currency Exchange Rates. Journal of Forecasting, 19, 313333.
Paolella, M. S. (2006). Fundamental Probability: A Computational Approach. Chichester.
Paolella, M. S. (2007). Intermediate Probability: A Computational Approach. Chichester.
Paolella, M. S. and Polak, P. (2011). MARC-MARS: Modeling Asset Returns via Conditional
Multivariate Asymmetric Regime-Switching. Submitted.
Park, B.-J. (2002). An Outlier Robust GARCH Model and Forecasting Volatility of Exchange
Rate Returns. Journal of Forecasting, 21, 381393.
Parkinson, M. (1980). The Extreme Value Method for Estimating the Variance of the Rate of
Return. Journal of Business, 53(1), 6165.
Wiggins, J. B. (1991). Empirical Tests of the Bias and Efficiency of the Extreme-Value Variance
Estimator for Common Stocks. Journal of Business, 64(3), 417432.
Yang, M. (2011). Volatility feedback and risk premium in garch models with generalized hyper-
bolic distributions. Studies in Nonlinear Dynamics & Econometrics, 15(3).
-
8/3/2019 Garch Vola Measurepdf
25/26
c/o University of Geneva40 bd du Pont d'Arve
1211 Geneva 4Switzerland
T +41 22 3798471F +41 22 379 82 [email protected]
www.SwissFinanceInstitute.ch
-
8/3/2019 Garch Vola Measurepdf
26/26
c/o University of Geneva40 bd du Pont d'Arve
1211 Geneva 4Switzerland
T +41 22 379 84 71