momentum in an icapm world · 2019. 12. 30. · momentum in an icapm world december 30, 2019...
TRANSCRIPT
Momentum in an ICAPM World
December 30, 2019
ABSTRACT
I empirically test the ability of the ICAPM to explain the momentum anomaly in equity markets.
Using the Epstein and Zin formulation of Campbell and Vuolteenaho (2004) and Campbell et
al (2017) as inspiration, this paper derives an ICAPM representation of the Stochastic Discount
Factor (SDF) which implies that the momentum risk premium has three main components: cash
flow, discount rate and volatility risk.Winner stocks are those that have higher cash flow betas and
lower discount rate and volatility betas whereas loser stocks are those for whom the converse is
true. Thus the momentum strategy can be seen as a bet against a stocks cash flow, discount rate
and volatility betas. This no- arbitrage model of momentum fits the data reasonably well, though
the central prediction that winner stocks have higher cash flow betas and lower discount rate and
volatility betas is not empirically supported.
1
1. Introduction
The momentum effect, the tendency of short run returns to predict future returns, has mystified
the asset pricing anomalies literature for over three decades. Traditional factor models such as the
CAPM or the Fama-French 3 factor model (FF) cannot explain the statistically significant alphas
attached to the momentum strategy (Fama and French, 1996). Introducing ad-hoc macroeconomic
variables into this multifactor framework adds little predictive power for these factor models as
momentum alphas are still largely unexplained (Chen, Roll, and Ross 1986; Li, Vassalou, and Xing
2006; Liu and Zhang 2008).
Whilst it is tempting to do so, one should not interpret the failure of traditional factor models
to price momentum as evidence against the efficient markets hypothesis. Rather, the failure can
largely be explained by a misspecification in the factor structure. Traditional reduced form factor
models such as the CAPM and FF assume that equity pricing is symmetric, that is asset prices
respond identically to positive and negative shocks of the same magnitude. Yet recent evidence
suggests that the time varying equity risk premium is largely asymmetric, with downside risk priced
more heavily by investors than upside risk. In fact, Lettau, Maggiori and Weber (2014) find that the
Downside Risk CAPM (DR CAPM), which allows time variation in market betas, has far greater
predictive power than the CAPM or FF for the cross-section of equity returns, especially in bearish
markets. This asymmetric element to equity pricing is especially important for momentum, given
the documented crash risk in momentum strategy returns across bad states of nature (Daniel and
Moskowitz, 2016).
This paper seeks to address this asymmetric pricing issue by considering an ICAPM SDF model
based on Epstein-Zin (EZ) recursive preferences which assumes that the utility function is non-
separable across states of nature. This allows news about future bad states to influence investor’s
utility today and consequently asset prices. Using EZ preferences, our stylised no-arbitrage model
is able to identify three main determinants of the SDF: cash flow, discount rate and volatility risk.
The SDF positively covaries with cash flow risk whereas discount rate and volatility risk attract a
negative risk premium.
This implies that momentum is a bet on the cash flow, discount rate and volatility betas
of a stock . ’Winner’ stocks are those with higher cash flow betas and lower discount rate and
volatility betas whereas ’loser’ stocks are those for whom the converse is true. I follow Campbell
and Vuolteenaho (2004) in making the assumption that cash flow and volatility risk attract greater
risk prices than discount rate risk. I make this assumption because discount rate risk is transitory:
higher discount rates today may reduce contemporaneous wealth but will improve investment op-
portunities in the future. On the other hand, shocks to cash flows and volatility have permanent
effects on investor wealth meaning that risk averse investors demand greater compensation for these
1
components of the long run risk equation (Campbell and Vuolteenaho, 2004).
Intuitively, the ’winner’ portfolio loads heavily on cash flow risk because it is largely comprised
of small and value stocks which are highly sensitive to news about future cash flows (Campbell
and Vuolteenaho, 2004). This is largely because small and value stocks generate cash flows in the
immediate future and are thus less sensitive to discount rates, leaving cash flow risk as the main
source of variation. In contrast, the ’loser’ portfolio loads heavily on discount rate risk because
large and growth stocks generate more distant cash flows that are more sensitive to market-wide
discount rates (Campbell and Vuolteenaho, 2004). Thus momentum is long cash flow risk but short
discount rate risk.
With regards to volatility risk, the momentum strategy can be seen as a strategy that is short
volatility because of the option-like behaviour of the losers. In bear markets, the loser portfolio
becomes highly levered and in accordance to Merton (1974) behaves like a deep out of the money
call option on the market portfolio. When markets correct, the steep convexity of the loser portfolio
results in the losers significantly outperforming the winners, initiating a ’momentum crash’ (Daniel
and Moskowitz, 2016). Thus, momentum is a bet against volatility with the volatility risk price
being negative in our framework.
I find that this long run risk formulation of momentum anomaly has large explanatory power for
the returns to ten momentum sorted portfolios. The adjusted R2 ranges between 60-90% and the
signs of the cash flow, discount rate and volatility risk factors are highly signiifcant at a 1% level
and conform to the basic predictions of our model, with cash flow risk attracting a positive risk pre-
mium and discount rate and volatility risk attracting a negative risk premium. However I find that
the magnitudes of the coefficients are not consistent with our model: the loser portfolio attracts
higher cash flow, discount rate and volatility betas relative to the winner portfolio. One possible
explanation that may explain the failure of our model predictions may be that higher moments
of cash flow, discount rate and volatility risk are important in pricing momentum returns. Kalev,
Saxena and Zolotoy (2017) find that this is certainly the case for the size and the value anomalies,
indicating that an avenue for future research would be to extend this framework to include these
higher moments. Given recent evidence that higher moments of the return distribution do matter
for momentum (Barroso and Santa Clara, 2014), there is great potential for such an avenue of
research to yield critical results for the pricing of the momentum anomaly.
The remainder of this paper is organised as follows. Sections 2 and 3 introduce my theoretical
model and discuss the key implications for the momentum risk premium. Section 4 discusses my
empirical methodology and estimation results. Section 5 concludes this paper.
2
2. Model
My theoretical model assumes a complete markets framework where a unique stochastic discount
factor (SDF) mt+1 exists that prices all assets in the universe. Specifically, I use the SDF framework
of Campbell et al (2017) which assumes that the representative agent has Epstein and Zin (EZ)
preferences that takes the following form:
mt+1 = θ ln δ − θ
ψ∆ct+1 + (θ − 1)rt+1 (1)
∆ct+1 refers to the log value of reinvested wealth, rt+1 refers to the return on the aggregate wealth
portfolio Wt which is the market value of the consumption stream owned by the representative
agent. δ refers to the subjective time preference parameter. ψ refers to the elasticity of intertem-
poral substitution (EIS) and θ= 1−γ1− 1
ψ
where γ is the relative risk aversion parameter.
By introducing separate parameters for relative risk aversion and EIS, long run consumption risk is
allowed to influence investor’s contemporaneous utility as per EZ preferences through the parame-
ter rt+1. So long as γ< 1ψ , the investor is concerned about both short run and long run consumption
growth risk.
To avoid empirical issues associated with consumption data, Campbell (1993) demonstrates that
one can apply a log-linear transformation of equation 1 to substitute out consumption and redefine
the SDF in terms of news to future cash flows Nc,t+1, news to future discount rates Nd,t+1 and news
to future volatility Nv,t+1. Campbell and Vuolteenaho (2004) and Campbell et al (2017) use this
transformation alongside the Campbell-Shiller return decomposition to derive a process for shocks
to mt+1 as follows:
mt+1 − Etmt+1 = −γNc,t+1 +Nd,t+1 + φNv,t+1 (2)
where:
Nc,t+1 = (Et+1 − Et)
∞∑s=1
ρsct+1+s
Nd,t+1 = (Et+1 − Et)∞∑s=1
ρsrt+1+s
Nv,t+1 = (Et+1 − Et)
∞∑s=1
ρsvart+s(mt+1+srt+1+s) (3)
In this stylised framework, the risk price for Nc,t+1 is γ times greater than the risk price for Nd,t+1.
This follows from the fact that cash flow shocks permanently reduce wealth whereas positive dis-
count rate shocks are transitory and are partially offset by higher future returns (Campbell and
3
Vuolteenaho, 2004). In a similar vein, the risk price for Nv,t+1 is φ times greater than the risk price
for Nd,t+1. The parameter ρ is a discount coefficient.
Substituting equation 2 into equation 1 yields a stochastic process for the expected log SDF Etmt+1 :
Etmt+1 = κ+ Et[−γ∞∑s=0
ρs∆ct+1+s +
∞∑s=0
ρsrt+1+s + φ
∞∑s=1
ρsvart+s(mt+1+srt+1+s)] (4)
Finally I arrive at my specification for mt+1 by substituting equation 4 into equation 2 which yields
a stochastic process that depends on both expectations and shocks to cash flows, discount rates
and volatility:
mt+1 = −γNc,t+1 +Nd,t+1 + φNv,t+1 + κ − γEc,t + Ed,t + φEv,t (5)
where:
Ec,t = Et[∞∑s=1
ρsct+1+s]
Ed,t = Et[
∞∑s=1
ρsrt+1+s]
Ev,t = Et[∞∑s=1
ρsvart+s(mt+1+srt+1+s)] (6)
Equation 5 represents our stylised theoretical model of the SDF. In a world of no-arbitrage, mt+1
defined by equation 5 is such that the Asset Pricing Euler Equation must hold for any asset i:
Et(mt+1ri,t+1) = 1 (7)
3. Model Implications for Risk Premia
Equation 5 implies that in a world where investors have EZ preferences, there are three main
drivers of the SDF mt+1: cash flow, discount rate and volatility risk. γ and φ represent the risk
loadings that the SDF attaches to cash flow and volatility risk, whereas the loading for the discount
rate component is normalised to 1. Thus, the SDF attaches greater risk loading to cash flow and
volatility risk by a factor of γ and φ respectively.
Cash flow risk components Nc,t+1 and Ec,t attract a negative coefficient -γ in our SDF model. This
is because when there future cash flows are expected to be low, consumption today decreases which
results in a lower intertemporal marginal rate of substitution (IMRS) and consequently a higher
SDF mt+1. This implies that assets that are positively correlated with Nc,t+1 and Ec,t are
4
risky and hence demand a risk premium .
Conversely, volatility risk components Nv,t+1 and Ev,t attract a positive coefficient φ in the SDF
model. This represents the volatility risk premium: Risk averse agents have a preference for con-
sumption smoothing: that is they would like to moderate their consumption over multiple periods.
This implies that risk averse investors have a preference for assets that are expected to have lower
future volatility, enabling consumption smoothing. Thus the volatility terms attract a positive risk
price φ. This implies that assets that are negatively correlated with Nv,t+1 and Ev,t are
risky and hence demand a risk premium
Finally, the discount rate components Nd,t+1 and Ed,t also attracts a positive coefficient, albeit it
is smaller than the volatility components by a factor of φ. This is because discount rate risk is
transitory: whilst higher discount rates will reduce wealth today, they improve investment oppor-
tunities for the future allowing for a partial offset for that contemporaneous wealth loss. If long
run risk concerns dominate an investors risk aversion, as they should under an EZ framework, then
assets that positively covary with discount rate risk are valuable. In other words assets that are
negatively correlated with Nd,t+1 and Ed,t are risky and hence demand a risk premium .
These 3 observations of our stylised model lead us to Propositions 1-3:
Proposition 1: If a stock has a higher cash flow beta, that is the stock positively co-
varies with Nc,t+1 and Ec,t, they should have higher returns on average than stocks for
which the converse is true
Proposition 2: If a stock has a higher discount rate beta, that is the stock positively
covaries with Nd,t+1 and Ed,t, they should have lower returns on average than stocks
for which the converse is true
Proposition 3: If a stock has a higher volatility beta, that is the stock positively co-
varies with Nv,t+1 and Ev,t, they should have lower returns on average than stocks for
which the converse is true
I discuss the implications of this model for the momentum anomaly in the next section.
3.1 Model Implications for Momentum
Since propositions 1-3 imply that stocks with higher cash flow betas and lower discount rate and
lower volatility betas outperform on average, we can motivate the following propositions about the
momentum portfolio:
5
Proposition 4: If Proposition 1 holds, momentum returns are increasing in cash
flow beta differentials between winner and loser stocks.
Proposition 5: If Proposition 2 holds, momentum returns are decreasing in dis-
count rate beta differentials between winner and loser stocks.
Proposition 6: If Proposition 3 holds, momentum returns are decreasing in volatil-
ity beta differentials between winner and loser stocks.
Propositions 4-6 imply that the momentum strategy invests in stocks that have higher cash flow
betas and lower discount rate and volatility betas and short stocks for whom the converse is true.
Thus momentum is a bet on a stock’s cash flow, discount rate and volatility betas. I will empirically
test our theory of momentum using 10 US momentum sorted portfolios. I outline my empirical
methodology next.
3.2 Empirical Framework
To test the prediction of our model that winner stocks have higher cash flow betas and lower dis-
count rate and volatility betas than loser stocks, it is necessary to construct estimates of these betas
for both winner and loser stocks. I do this using three main measurements. Firstly, I estimate the
betas using first differences multivariate regressions for the 10 momentum sorted portfolios. Sec-
ondly, I estimate a lag adjusted beta estimate that accounts for stale prices. Finally, I estimate the
betas by GMM system, using moment conditions from the Asset Pricing Euler Equation derived
under conditional lognormality.
Multivariate Regression Betas
The multivariate regression approach is the most basic approach to estimating our betas of interest.
I run the follow regression for all 10 momentum sorted portfolios:
∆ri,t+1 = α+βi,NC∆Nc,t+1+βi,ND∆Nd,t+1+βi,NV ∆Nv,t+1+βi,EC∆Ec,t+βi,ED∆Ed,t+βi,EV ∆Ev,t
(8)
Here ri,t+1 is the monthly return on momentum portfolio i at month t+ 1. ∆ is the first difference
operator. Using the coefficients as our estimates of the beta, it is easy to ascertain whether or not
the winner portfolio betas are larger than the loser portfolios.
I use a first differences approach for regression to control for persistence in the news terms Nc,t+1,
Nd,t+1 and Nv,t+1. As we will detail shortly, we estimate these terms using a VAR system com-
6
prising of highly persistent state variables. Thus these news terms are also likely to be persistent.
Hence I take first differences to ensure an estimation model that has valid statistical inference. I
then scale the betas by a factor of 100 to account for the scale differences between Nc,t+1, Nd,t+1
and Nv,t+1 and the dependent variable ri,t+1.
Lag Adjusted Beta
I define our lag adjusted betas in accordance with Campbell and Vuolteenaho (2004). The shock
betas are defined as follows:
βi,ND =ˆCov(ri,t, NDt)
ˆV ar(NCt, NDt)+
ˆCov(ri,t, NDt−1)
ˆV ar(NCt, NDt)
βi,NC =ˆCov(ri,t, NCt)
ˆV ar(NCt, NDt)+
ˆCov(ri,t, NCt−1)
ˆV ar(NCt, NDt)
βi,NV =ˆCov(ri,t, NV t)
ˆV ar(NCt, NDt)+
ˆCov(ri,t, NV t−1)
ˆV ar(NCt, NDt)
(9)
Similarly I define the expectation betas as:
βi,ED =ˆCov(ri,t, EDt−1)
ˆV ar(NCt, NDt)
βi,EC =ˆCov(ri,t, ECt−1)
ˆV ar(NCt, NDt)
βi,EV =ˆCov(ri,t, EV t−1)
ˆV ar(NCt, NDt)
(10)
These beta measurements differ from standard definitions of beta because of very special features
of the data. We are using data from the 10 US momentum sorted portfolios available from the
Kenneth French data library spans from January 1929 to December 2010. Since some of the stocks
in these momentum portfolios were not traded frequently in the early parts of the sample, it is
possible that the presence of stale prices may give the misleading perception that our news terms
are apriori causing portfolio returns (Scholes and Williams 1977; Dimson 1979). To remove this
concern we follow Campbell and Vuolteenaho (2004) in adding a lagged component to the beta
measurement of βi,ND, βi,NC and βi,NV . Since our expectation terms for time t are already condi-
tional on time t− 1, we need not make a similar adjustment for βi,ED, βi,EC and βi,EV .
Finally Campbell and Vuolteenaho (2004) show that under mild assumptions, ˆV ar(NCF,t,−NDR,t)
7
is equivalent to ˆV ar(reM,t − Et−1reM,t), which is the denominator in standard definitions of beta.
Thus the denominator of the beta estimates in equations 9 and 10 are identical to the standard
CAPM formulation.
GMM
The derivation of the empirical GMM moment condition comes from Campbell et al (2017). I start
from the Asset Pricing Euler Equation which is defined as follows:
Et(Mt+1Ri,t+1) = 1 (11)
Here Mt+1 is the Stochastic Discount Factor (SDF) and Ri,t+1 is the return on asset i at time
t+1. Assuming that Mt+1 and Ri,t+1 are jointly conditionally lognormal, we have that Mt+1Ri,t+1
is also conditionally lognormal. Using the property that for any lognormal random variable X
log(E(X))=E(logX) + 12var(logX), we have that:
Et(mt+1 + ri,t+1) +1
2V art(mt+1 + ri,t+1) = 0 (12)
mt+1 and rei,t+1 are the log SDF and log returns respectively. We also know by Jensen’s inequality:
Et(ri,t+1) +1
2V art(ri,t+1) ≈ (EtRi,t+1 − 1) (13)
Substituting equation 15 into 14, we arrive at the moment condition for any asset i:
Et(mt+1) + Et(Ri,t+1) − 1 + Et(mt+1ri,t+1) − Et(mt+1)Et(ri,t+1) (14)
If this moment condition holds for asset i, it must also hold for a reference asset j:
Et(mt+1) + Et(Rj,t+1) − 1 + Et(mt+1rj,t+1) − Et(mt+1)Et(rj,t+1) (15)
Subtracting 17 from 16 leads us to our general GMM moment condition:
Et[Ri,t+1 −Rj,t+1 − (ri,t+1 − rj,t+1)(mt+1 − Etmt+1)] (16)
Plugging equation 2 into equation 17 yields our testable GMM moment condition:
Et[Ri,t+1 −Rj,t+1 − (ri,t+1 − rj,t+1)(−γNc,t+1 +Nd,t+1 + φNv,t+1)] (17)
Again, the risk price for Nd,t+1 is normalised to 1. Using equation 17 as our final moment condition,
I use GMM to estimate our beta estimates γ, φ and τ . I follow Campbell et al (2017) in using the
CRSP value weighted index as the reference asset Rr,t+1. The 10 momentum sorted portfolios are
used as the test assets in the GMM estimation.
8
4. News Estimation
In order to estimate the betas using the approaches outlined in the previous section, the news terms
Nc,t+1, Nd,t+1, Nv,t+1, Ec,t, Ed,t and Ev,t must be estimated. This is done via a VAR system that
I detail below.
4.1. VAR Methodology
I follow Campbell et al (2014) in using the following VAR system to estimate the news terms:
zt+1 = α + Γ(zt − z) + σtut+1 (18)
zt+1 is an m x 1 vector of state variables with excess market returns rt+1 as the first element. α and
ut+1 are m x 1 vectors of constant parameters and residuals respectively. z and Γ are m x 1 the m
x m vector of constant parameter estimates. Finally σt is the conditional variance of market returns.
After estimating the parameters of the VAR system, I linearly map Nc,t+1, Nd,t+1 and Nv,t+1 to
the shock vector ut. Our linear mapping process is captured by the expressions below:
Nc,t+1 = (e1T + e1Tλσt)ut+1
Nd,t+1 = e1Tλσtut+1
Nv,t+1 = e2Tλσtut+1 (19)
e1 is an m×m vector with 1 as the first element and 0 for all other elements. e2 is an m x 1 vector
with 1 tied to the element linked with volatility and 0 for all other elements. The term e1T + e1Tλ
represents the contribution that shocks to the state variables play in driving Nc,t+1. e1Tλ measures
the contribution of state variable shocks to Nd,t+1. Finally, e2λ measures the contribution of state
variable shocks to Nv,t+1. We multiply each of the news terms by the scalar term σt to account for
different error variances across the state variables in the VAR system. I outline how σt is estimated
later in this section.
λ is defined as:
λ = (ρΓ(I − ρΓ)−1 (20)
where ρ = 0.95112 and I is the identity matrix. Following Campbell and Vuolteenaho (2004) I use
this default value for ρ on the assumption that the annual consumption-wealth ratio is 5.2% for
the representative investor.
I adopt a similar linear mapping process to estimate Ec,t, Ed,t and Ev,t, mapping them to the lagged
9
state variables:
Ec,t = (e1T + e1Tλσt)Γzt
Ed,t = (e1Tλσt)Γzt
Ev,t = (e2T + e2Tλσt)Γzt (21)
4.2. VAR State Variables
To operationalise our VAR approach we must choose specific state variables to enter our state
vector. We follow Campbell et al (2017) in choosing the following six state variables: excess mar-
ket returns (ReM,t+1), equity volatility (σt), term spread (TSt+1), risk free rate (RFt+1), price to
earnings (PEt+1) and the small stock value spread (V St+1). I define these variables in accordance
with Campbell et al (2017). Sampling information about these variables are detailed below in table
1:
Table 1: State Variable Sample Information
State Variables Source Sample Period Interval
ReM,t+1 CRSP Database January 1929 - Dec 2010 Monthly
σt Kenneth French Data Library January 1929 - Dec 2010 MonthlyTSt+1 CRSP Database January 1929 - Dec 2010 MonthlyRFt+1 Kenneth French Data Library January 1929 - Dec 2010 MonthlyPEt+1 Robert Shiller Website January 1929 - Dec 2010 MonthlyVSt+1 Kenneth French Data Library January 1929 - Dec 2010 Monthly
In terms of the first state variable, ReM,t+1 is defined as the difference between the log returns on the
CRSP value weighted index and the log risk free rate. The second variable is the predicted market
variance conditional on all state variables at time t. The third variable is the US term spread which
is defined as the difference between the 10 year US government yield and the 3 month US treasury
bill rate. The fourth variable is the 3 month US Treasury Bill Rate. The fifth variable is the
logarithm of the smoothed SP500 price-earnings ratio available via Professor Bob Shiller’s website.
Finally the small stock value spread is the return differential between small value and small growth
stocks. This spread is computed using the return differential between the small growth and small
value portfolios available on the Kenneth French Data Library.
Our choice of these state variables is motivated by the need for variables that predict future cash
flows, discount rates and volatility. Firstly, financial variables such as the price-earnings ratio, small
stock value spread and market excess returns drive the cross-section of equity returns, indicating
that they hold important information for predicting future discount rates. Secondly, macroeco-
nomic variables such as the term spread and the risk free rate capture broad shifts in the economy
and hence are likely to be good predictors for future cash flows. Finally, historical volatility is
10
needed to predict future patterns in expected volatility.
4.3. Predicting Conditional Market Variance
In order to calculate the second state variable-conditional market variance σt, I follow the approach
of Campbell et al (2017). First I construct realized monthly market variance RV ARt+1 from daily
market return data using the standard definition of sample variance:
RV ARt+1 =1
N − 1[
N∑t=1
rt − r] (22)
Define rt+1 as the log daily return on the CRSP value weighted index and r as the average monthly
return for month t+ 1. I then run the following predictive regression:
RV ARt+1 = α+ φ1RV ARt + φ2MRt + φ3TSt + φ4RFt + φ5PEt + φ6V St (23)
To control for the heteroskedasticity of innovations to our state variables, Equation 16 is estimated
as a Weighted Least Squares (WLS) regression where each observation is weighted by the inverse
of the realized variance RV AR−1t . The predictive regression results are presented below:
Table 2: Variance Predictive Regression
Variance Prediction ParametersConstant RVARt MRt TSt RFt PEt VSt R2
(1) (2) (3) (4) (5) (6) (7) (8)0.001 0.70 -0.001 0.001 0.203 0.001 0.002 29.83%(0.88) (16.86) (-0.27) (3.13) (3.37) (-0.19) (1.53)
Table 2 clearly demonstrates that volatility is highly predictable, a result that has been well doc-
umented in the literature (Glosten, Jagannathan and Runke 1993; Ang et al 2006 and Bekaert,
Hodrick, Zhang 2011). This predictability largely arises from the persistence of lagged volatility
but the term spread and the risk free rate are also informative about expected volatility.
Using the above estimates of α and φ1 − φ6, I estimate σt as:
σt+1 = α+ φ1RV ARt + φ2MRt + φ3TSt + φ4RFt + φ5PEt + φ6V St (24)
I use σt+1 as the second state variable in the VAR system.
4.4. State Variable Summary Statistics
I document the mean, standard deviations and correlation matrix of all six state variables on the
11
next page. The early sample is from January 1929 to December 1969 and the later sample is from
January 1970 to December 2010. Note that Market Returns, Conditional Variance, Term Spread
and the Risk Free rate are denoted in percentage units whereas the PE Ratio and the Value Spread
is quoted in log real units.
Table 3. State Variable Summary Statistics
This table shows the summary statistics for the six state variables in the VAR system. The sample period
is from January 1929 to December 2010.
Full Sample Early Sample Later Sample
Variables µ σ µ σ µ σ
ReM 0.41% 5.49% 0.49% 6.17% 0.32% 4.74%σt+1 0.90% 2.07% 0.72% 2.16% 1.26% 1.84%TS 7.55% 8.09% 6.72% 5.85% 8.38% 9.74%RF 0.29% 0.26% 0.14% 0.14% 0.45% 0.25%PE 2.90 0.37 2.77 0.32 3.02 0.37VS 1.64 0.36 1.80 0.43 1.47 0.15
Correlations ReM,t+1 σt+1 TSt+1 RFt+1 PEt+1 VSt+1
ReM,t+1 1 -0.074 0.049 0.057 -0.006 -0.031
σt+1 -0.074 1 0.145 0.1263 -0.1929 0.2116TSt+1 0.049 0.145 1 -0.482 -0.103 0.2827RFt+1 0.057 0.1263 -0.482 1 0.1319 -0.4874PEt+1 -0.006 -0.1929 -0.103 0.1319 1 -0.3529VSt+1 -0.031 0.2116 0.2827 -0.4874 -0.3529 1
ReM,t 0.1147 -0.088 0.042 -0.063 0.070 -0.033
σt -0.066 0.685 0.148 0.118 -0.199 0.222TSt 0.050 0.1407 0.942 -0.489 -0.100 0.278RFt -0.445 0.140 -0.462 0.975 0.129 -0.485PEt -0.091 -0.184 -0.109 0.137 0.992 -0.350VSt -0.023 0.207 0.289 -0.488 -0.354 0.991
The correlation matrix outlines a few noteworthy trends. Firstly, many of the state variables ex-
hibit a strong autocorrelation structure. This is especially true of the term spread, risk free rate,
price-earning ratios and the small value stock spread, whose autocorrelations exceed 90%. Market
returns are far less persistent, though autocorrelations are still significant at 11.45%.
Secondly, the correlation matrix highlights clear economic relationships between the state variables.
The positive correlation between the term spread and the small stock value spread indicates that
long term bonds underperform exactly when growth stock prices are high. One possible explana-
tion, As outlined by Campbell and Vuolteenaho (2004), is that long term bonds are highly sensitive
to changes in inflationary expectations, inducing negative correlation with high duration growth
assets. Another interesting result is that the small stock value spread and the price-earning ratios
12
are negatively correlated. If we take the PE ratio as a proxy for the cost of capital, this indi-
cates that growth stocks are more exposed to external financing conditions than value stocks and
are hence more exposed to discount rate shocks. This point will remain relevant in our later analysis.
4.5. News Estimation
Table 4 on the next page documents the VAR estimation results as well as summary statistics on
the news terms. The results add further weight to the previous observation that our state variables
are highly persistent, with most variables exhibiting a lagged coefficient in excess of 90% . Expected
Volatility also has a large lagged coefficient of approximately 61%.
In addition, the VAR results indicate that many of the state variables have reasonable predictive
power for future market excess returns. The PE ratio negatively predicts future returns at a 1%
significance level, a result that is consistent with a long literature documenting the link between
the PE ratio and discount rates (Campbell and Shiller 1988b, 1998; Campbell 1987; Fama and
French 1989). Volatility also negatively predicts future returns, in line with recent evidence that
investors engage in volatility hedging behaviour (Ang et al 2006; Adrian and Rosenberg 2008;
Lettau, Ludvigson and Wachter 2008)
13
Table 4. VAR Parameter Estimates and News Terms
Variables Constant ReM,t+1 σt TSt RFt PEt VSt R2 F
ReM,t+1 0.082 -0.102 - 0.870 0.004 -0.54 -0.018 -0.011 2.70% 5.538
(-0.352) (3.232) (-2.023) (1.398) (-0.606) (-3.580) (-1.88)σt+1 0.001 -0.003 0.615 0.001 0.292 0.00 0.001 49.80% 1.578 x 103
(0.857) (-1.489) (24.504) (3.629) (5.535) (-1.490) (4.080)TSt+1 -0.0369 -0.001 1.604 0.934 -1.440 0.001 0.047 88.67% 1.280 x 104
(-0.352) (-0.003) (0.748) (73.837) (-0.319) (-0.032) (1.579)RFt+1 0.001 -0.001 0.002 -0.008 0.954 0.001 -0.002 95.22% 3.234 x 104
(1.39) (-1.357) (0.499) (-3.208) (102.08) (0.383) (-1.776)PEt+1 0.024 0.519 -0.243 0.002 0.611 0.991 -0.001 99.03% 1.666 x 104
(1.67) (24.389) (-0.848) (1.396) (1.010) (292.297) (-0.292)VSt+1 0.019 -0.001 1.267 -0.003 -1.479 0.001 0.985 98.24% 9.087 x 104
(1.019) (-0.326) (3.340) (-1.507) (-1.856) (0.120) (183.919)
Summary Statistics µ σ 25% Mean 75% Skewness Kurtosis
NCt+1 0.00% 0.03% - 0.01% 0.00% 0.01% 0.004 19.827NDt+1 0.00% 0.57% -0.02% 0.00% 0.02% -0.01 33.889NVt+1 0.00% 0.19% 0.00% 0.00% 0.00% 0.43 20.903ECt -2.60% 1.29% -2.96% -2.19% -1.81% -2.745 14.795EDt -2.60% 1.28% -2.92% -2.20% -1.80% -2.703 14.369EVt+1 2.60% 0.20% 0.14% 0.20% -1.80% 2.900 14.749
News Correlation NCt+1 NDt+1 NVt+1 ECt EDt EVtNCt+1 1 0.046 - 0.609 0.001 -0.003 -0.016
(1.450) (-24.040) (0.030) (-0.100) (-0.490)NDt+1 0.046 1 -0.057 -0.02 -0.01 0.03
(1.450) (-1.780) (-0.590) (-0.420) (0.940)NVt+1 -0.609 -0.057 1 0.050 0.056 -0.043
(-24.040) (-1.780) (1.560) (1.750) (-1.330)ECt 0.001 -0.02 0.050 1 0.998 -0.897
(0.030) (-0.590) (1.560) (535.640) (-63.430)EDt -0.003 -0.01 0.056 0.998 1 -0.895
(-0.100) (-0.420) (1.750) (535.640) (-62.590)EVt+1 -0.016 0.03 -0.043 -0.897 -0.895 1
(-0.490) (0.940) (-1.330) (-63.430) (-62.590)
14
NC
Tim
e
NC
19401960
19802000
−0.003 −0.001 0.001 0.003
ND
Tim
e
ND
19401960
19802000
−0.006 −0.002 0.002 0.004
NV
Tim
e
NV
19401960
19802000
−0.0015 −0.0005 0.0005 0.0015
EC
Tim
e
EC
19401960
19802000
−0.12 −0.10 −0.08 −0.06 −0.04 −0.02
ED
Tim
e
ED
19401960
19802000
−0.12 −0.10 −0.08 −0.06 −0.04 −0.02
EV
Tim
e
EV
19401960
19802000
0.000 0.005 0.010 0.015
Evolution of N
ews Term
s
The time series graphs of the news terms indicates that two very interesting patterns. Firstly, expecta-
tion terms ECt and EDt exhibit right skewed distributions that experience large negative asymmetric
shocks centred around 1987, 2001 and 2007-2008. EVt on the other hand has a left skewed distribu-
tion with large positive asymmetric shocks occuring around the same dates. These trends also have an
intuitive explanation: investor expectations about the short term future are on average negative and
subject to large negative shocks because risk aversion implies that investors apply greater weight to bad
news than to good news of the same magnitude. Similarly expected volatility is on average positive with
large positive shocks given that negative shocks to cash flows and discount rates have a larger asym-
metric effect on volatility than positive shocks (Black 1976). In addition major systemic events such as
Black Monday (1987), the Technology Bubble (2000-2001) and the Global Financial Crisis (2007-2008)
dramatically shift these short term expectations, resulting in large volatility in the short term which
accounts for the opposite directions of the asymmetric shocks for ECt and EDt and EVt. Thus systemic
risk events dramatically shift short term expectations about cash flows, discount rates and volatility.
Secondly the shock terms NCt+1, NDt+1 and NVt+1 are largely centred around zero, though the dis-
tributions exhibit significant kurtosis as documented by table 4. This is captured by large spikes in
the time series for each of these terms at the beginning (1930s) and end (2007-2008) of the samples.
Since these time intervals correspond with the Great Depression and the recent Global Financial Crisis,
a natural interpretation is that systemic risk events dramatically shift investor expectations about long
term cash flows, discount rates and volatility. This indicates that asset prices are not only affected by
short term expectations but also long run expectations about the future which is entirely consistent with
the Epstein and Zin utility framework that my model is seeking to test.
4.6. Multivariate Regression Estimation
Adding further credence to the relevance of the long run risk framework are the results of the multivari-
ate beta regressions which are documented in table 5. Table 5 documents the multivariate regressions
for all 10 momentum sorted portfolios available on the Kenneth French Data library. Portfolio I is the
’loser’ portfolio and contains the stocks whose previous 12 month market performance ranked in the
bottom 10% of stocks across the NASDAQ and NYSE at any point in time. Portfolio X is the ’winner’
portfolio and represents the top 10% of stock performers across the NASDAQ and NYSE at any point in
time. The reported R2 is the implied measure defined by equation 10. Estimation results are reported
for both the full sample and a later sub-sample from January 1970-December 2010.
The regression results indicate four main findings. Firstly, The no-arbitrage model of momentum outlined
in this paper does a reasonably good job at explaining the cross-section of momentum sorted portfolios.
For all portfolios, the R2 ranges between 50-75% with all the alphas statistically insignificant at the 10%
level. The plot of the predicted momentum return using the regression coefficients against the realised
momentum returns also paint a similar story with the best fitted line for each portfolio approximating
a 45 degree line.
16
Table 5: Panel A: Multivariate Regressions
Model α ∆Nc ∆Nd ∆Nv ∆Ec ∆Ed ∆Ev R2 N
Panel A: 1929-2010
I -0.004 0.926∗∗∗ −1.166∗∗∗ 0.077 0.102∗∗∗ −0.060∗ 0.328∗∗∗ 60.60% 898(-1.59) (15.33) (-32.35) (0.52) (3.18) (-1.94) (4.72)
II −0.002 0.902∗∗∗ −0.988∗∗∗ 0.061 0.030 −0.0134 0.109∗∗ 66.55% 898(-1.17) (19.14) (-35.11) (0.53) (1.20) (-0.55) (2.00)
III -0.002 0.762∗∗∗ −0.898∗∗∗ −0.090% 0.024 0.001 0.179∗∗∗ 70.39% 898(-0.87) (19.46) (-38.46) (-0.95) (1.16) (0.06) (3.97)
IV 0.000 0.717∗∗∗ −0.781∗∗∗ −0.021 0.016 0.011 0.213∗∗∗ 66.45% 898(-0.49) (18.84) (-34.40) (-0.22) (0.81) (0.55) (4.86)
V 0.00 0.650∗∗∗ −0.790∗∗∗ −0.124∗ −0.002 0.020 0.131∗∗∗ 74.25% 898(-0.05) (20.70) (-42.13) (-1.62) (-0.11) (1.22) (4.86)
VI 0.00 0.677∗∗∗ −0.751∗∗∗ −0.068 −0.002 0.024 0.131∗∗∗ 73.46% 898(0.31) (21.74) (-40.41) (-0.90) (-0.09) (1.48) (4.66)
VII 0.00 0.638∗∗∗ −0.701∗∗∗ −0.101 −0.010 0.046∗∗∗ 0.265∗∗∗ 71.57% 898(0.41) (20.39) (-37.58) (-1.32) (-0.63) (2.85) (7.36)
VIII 0.0012 0.593∗∗∗ −0.653∗∗∗ −0.083 0.002 0.021 0.168∗∗∗ 65.13% 898(0.82) (17.91) (-33.01) (-1.02) (0.11) (1.21) (4.41)
IX 0.001 0.582∗∗∗ −0.704∗∗∗ −0.077 −0.022 0.040∗∗ 0.141∗∗∗ 65.09% 898(0.79) (16.53) (-33.54) (-0.90) (-1.19) (2.23) (4.41)
X 0.002 0.569∗∗∗ −0.693∗∗∗ −0.089 −0.038 0.052∗∗ 0.141∗ 49.83% 898(0.93) (11.93) (-24.34) (-0.77) (-1.49) (2.13) (1.93)
Panel B: 1970-2010
I -0.005 1.023∗∗∗ −1.296∗∗∗ 0.354∗ 0.271∗∗∗ −0.247∗∗∗ 0.193∗ 49.90% 451(-1.35) (9.96) (-19.13) (1.78) (4.02) (-3.80) (1.73)
II -0.003 0.894∗∗∗ −1.071∗∗∗ 0.141∗∗∗ −0.136∗∗∗ −0.013 0.027 58.15% 451(-1.11) (11.92) (-21.65) (1.21) (2.86) (-2.87) (0.33)
III -0.002 0.708∗∗∗ −0.919∗∗∗ -0.037 0.080∗ −0.072∗ 0.039% 59.83% 451(-0.88) (11.19) (-22.01) (-0.30) (1.93) (-1.79) (0.56)
IV 0.00 0.728∗∗∗ −0.847∗∗∗ 0.056 0.088∗∗∗ −0.074∗∗ 0.105∗ 64.18% 451(-0.52) (13.53) (-23.86) (0.54) (2.50) (-2.18) (1.80)
V 0.00 0.646∗∗∗ −0.816∗∗∗ −0.050∗ 0.078∗∗∗ −0.072∗∗ 0.034 67.98% 451(-0.16) (13.68) (-26.21) (-0.54) (2.51) (-2.39) (0.66)
VI 0.00 0.729∗∗∗ −0.834∗∗∗ 0.039 0.059∗∗ −0.050∗ 0.070 71.19% 451(-0.05) (15.85) (-27.51) (0.43) (1.96) (-1.72) (1.41)
VII 0.00 0.724∗∗∗ −0.828∗∗∗ −0.025 0.074∗∗ −0.055∗ 0.135 68.75% 451(-0.29) (14.98) (-25.98) (0.26) (2.33) (-1.81) (2.57)
VIII 0.00 0.690∗∗∗ −0.828∗∗∗ 0.007 0.063∗∗% −0.058∗∗ 0.026 69.63% 451(0.13) (14.76) (-26.85) (0.07) (2.06) (-1.97) (4.41)
IX 0.00 0.714∗∗∗ −0.891∗∗∗ −0.010 −0.060∗ −0.056∗ 0.034 67.84% 451(0.21) (13.66) (-25.85) (-0.10) (1.76) (-1.69) (0.61)
X 0.00 0.816∗∗∗ −1.053∗∗∗ 0.057 0.086∗ −0.081∗∗ 0.048∗ 58.33% 451(0.19) (10.95) (-21.44) (0.39) (1.76) (-1.72) (0.19)
17
−0.4
0.00.2
0.40.6
0.8
−0.5 0.0 0.5
M1
PM1
−0.4
0.00.2
0.40.6
0.8
−0.6 −0.2 0.2 0.6
M2
PM2
−0.2
0.00.2
0.40.6
−0.4 0.0 0.4
M3
PM3
−0.2
0.00.2
0.40.6
−0.4 0.0 0.2 0.4
M4
PM4
−0.2
0.00.2
0.40.6
−0.4 0.0 0.2 0.4
M5
PM5
−0.2
0.00.2
0.4
−0.4 0.0 0.2 0.4
M6
PM6
−0.3
−0.1
0.10.2
0.3
−0.4 0.0 0.2 0.4
M7
PM7
−0.2
−0.1
0.00.1
0.20.3
−0.4 −0.2 0.0 0.2 0.4
M8
PM8
−0.2
0.00.1
0.20.3
−0.4 0.0 0.2 0.4
M9
PM9
−0.3
−0.2
−0.1
0.00.1
0.20.3
−0.4 −0.2 0.0 0.2 0.4
M10
PM10P
redicted vs Realised P
lot
Secondly there is a clear declining relationship between the R2 and the decile of our momentum portfolio,
with the R2 maximised for the losers and minimized for the winners. A similar phenomenon was also
evident in the R2 of our multivariate regressions. This indicates that our momentum model has greater
predictive power for the losers than the winners.
Thirdly, the signs of the betas largely conform to the predictions of the model. βNC is positive whereas
βND is negative, confirming that cash flow (discount rate) risk attracts a positive (negative) risk pre-
mium. The expectation terms for cash flow and discount rates βEC and βED are not statistically
significant in the full sample and their signs fluctuate across momentum portfolios. However this is
corrected in the later sample, as the coefficients for βEC and βED turn positive (negative) and are all
statistically significant at the 5-10% level. This result may make sense given well documented evidence
that many of the prevalent asset pricing anomalies today such as size, value and betting against the beta
are not observable in the data before the 1970s (Campbell, Vuolteenaho 2004). Thus state variables
that predict expected cash flows and discount rates are likely to have far less predictive power for the
early sample (1929-1970), explaining the fluctuating sign of the coefficients on Ec,t, Ed,t.
Fourthly, the signs of the volatility betas fluctuate across the momentum portfolios for both the full
sample and the later sample. For both samples, the sign of βNV fluctuates and is largely insignificant at
all significance levels. The sign of βEV is largely significant in the full sample but loses its significance in
the later sample. This indicates that βEV was capturing the noise in the earlier sample resulting from
the reduced predictability of asset returns discussed earlier. Improved predictability of asset returns in
the later sample has therefore largely removed the significance of βEV , indicating that both our volatility
terms Nv,t+1 and Ev,t do not predict momentum returns.
Finally and most significantly, the central prediction of our model, that the winner portfolio should
have higher cash flow betas, is inconsistent with our results. We find that for both samples, the loser
portfolio attracts higher betas than the winner portfolio. Table 8 on the next page documents this
formally with the beta differential between the loser portfolio (I) and the winner portfolio (X) ranging
between 0.3 - 0.5 for all the respective betas. This is robust even when we apply the lag adjusted beta
measurement defined by equations 11 and 12. There is a clear descending relationship between cash
flow, discount rate and volatility betas and the rank of the momentum portfolio, whereas our model
implies the exact opposite: winner stocks attract higher betas as compensation for cash flow, discount
rate and volatility risk. Whilst the magnitude of the R2 still suggests that our model has a lot to say
about the cross-section of momentum returns, the failure of the model prediction indicates that there
are some important ingredients missing from our model.
19
Table 6: Lag Adjusted Betas
Model βNC βND βNV βEC βED βEVPanel A: 1929-2010 (Full Sample)
I 0.557 −1.549 −0.164 3.044 −2.589 −0.078II 0.481 −1.297 −0.142 2.600 −2.268 −0.111III 0.414 −1.134 −0.140 2.358 −2.090 −0.158IV 0.378 −1.030 −0.122 2.283 −2.034 −0.145V 0.361 −0.981 −0.118 1.965 −1.733 −0.151VI 0.341 −0.954 −0.120 2.00 −1.785 −0.139VII 0.300 −0.881 −0.109 1.950 −1.783 −0.165VIII 0.292 −0.836 −0.105 1.953 −1.768 −0.200IX 0.263 −0.843 −0.097 1.757 −1.613 −0.155X 0.284 −0.856 −0.091 2.145 −1.974 −0.279
Panel B: 1929-1969 (Early Sample)
I 0.800 −1.799 −0.218 2.202 −1.637 0.153II 0.603 −1.524 −0.181 3.088 −2.664 −0.081III 0.532 −1.352 −0.177 2.949 −2.593 −0.177IV 0.466 −1.237 −0.146 2.605 −2.263 −0.133V 0.445 −1.176 −0.143 2.480 −2.172 −0.179VI 0.420 −1.119 −0.147 2.789 −2.500 −0.217VII 0.356 −1.029 −0.133 2.703 −2.470 −0.244VIII 0.347 −0.946 −0.130 2.524 −2.274 −0.264IX 0.308 −0.926 −0.126 2.300 −2.111 −0.228X 0.319 −0.843 −0.120 2.837 −2.643 −0.365
Panel B: 1970-2010 (Later Sample)
I 0.311 −1.154 −0.976 4.066 −3.787 −0.443II 0.285 −0.932 −0.080 1.889 −1.705 −0.161III 0.225 −0.779 −0.081 1.809 −1.685 −0.136IV 0.237 −0.692 −0.083 2.013 −1.912 −0.171V 0.227 −0.667 −0.079 1.123 −1.014 −0.107VI 0.213 −0.689 −0.076 0.771 −0.688 −0.016VII 0.210 −0.643 −0.071 0.755 −0.693 −0.038VIII 0.203 −0.659 −0.065 1.092 −1.012 −0.100IX 0.191 −0.710 −0.050 0.825 −0.747 −0.036X 0.228 −0.876 −0.045 1.044 −0.910 −0.141
20
Table 7 Panel A: GMM Betas (Full Sample)
Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)
Portfolio I II III IV V VI VII VIII IX X
βNC 0.919∗∗∗ 0.918∗∗∗ 0.670∗∗∗ 0.700∗∗∗ 0.778∗∗∗ 0.800∗∗∗ 0.743∗∗∗ 0.641∗∗∗ 0.761∗∗∗ 0.869∗∗∗
(4.80) (7.95) (4.64) (7.74) (7.51) (9.18) (9.40) (4.94) (7.25) (3.96)βND −1.14∗∗∗ −0.924∗∗∗ −0.772∗∗∗ −0.650∗∗∗ −0.690∗∗∗ −0.801∗∗∗ −0.723∗∗∗ −0.575∗∗∗ −0.718∗∗∗ −0.828∗∗∗
(-9.67) (-17.20) (-10.59) (-7.93) (-8.24) (-14.94) (-12.43) (-7.79) (-8.04) (-4.19)βNV 0.038 0.45 −0.19 0.250∗ −0.087 0.124 0.200 0.02 0.101 0.010
(1.12) (0.41) (-1.20) (1.87) (-0.57) (1.01) (1.25) 0.08 (0.52) (0.05)
N 467 503 516 538 544 548 564 564 566 550R2 59.40% 69.25% 62.79% 63.94% 65.28% 66.56% 52.88% 48.77% 47.47% 24.97%
Panel B: GMM Betas (Later Sample)
Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)
Portfolio I II III IV V VI VII VIII IX X
βNC 0.880∗∗∗ 0.707∗∗∗ 0.454∗∗∗ 0.841∗∗∗ 0.646∗∗∗ 0.647∗∗∗ 0.697∗∗∗ 0.613∗∗∗ 0.791∗∗∗ 1.272∗∗∗
(2.60) (4.34) (4.08) (7.63) (6.52) (5.97) (6.18) (4.03) (6.40) (5.57)βND −1.07∗∗∗ −0.9793∗∗∗ −0.500∗∗∗ −0.759∗∗∗ −0.498∗∗∗ −0.794∗∗∗ −0.682∗∗∗ −0.431∗∗∗ −0.621∗∗∗ −1.125∗∗∗
(-4.95) (-6.23) (-5.06) (-8.97) (-5.64) (-10.86) (-9.01) (-4.35) (-6.09) (-6.89)βNV 0.176 0.09 −0.27∗ 0.040 −0.1597 0.100 0.020 −0.349 −0.432∗∗ −0.370
(0.39) (1.13) (-1.64) (0.39) (-1.15) (0.72) (0.10) -1.52 (-2.04) (-1.01)
N 226 245 250 264 270 267 280 282 285 274R2 40.00% 46.38% 35.71% 58.61% 44.91% 63.66% 54.66% 39.80% 39.21% 19.15%
Panel C: GMM Betas (Earlier Sample)
Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)
Portfolio I II III IV V VI VII VIII IX X
βNC 1.05∗∗∗ 0.992∗∗∗ 0.888∗∗∗ 0.759∗∗∗ 0.821∗∗∗ 0.945∗∗∗ 0.741∗∗∗ 0.683∗∗∗ 0.707∗∗∗ 0.614∗∗∗
(6.09) (6.470) (6.53) (6.13) (5.18) (7.92) (7.17) (7.54) (4.98) (5.22)βND −1.145∗∗∗ −0.952∗∗∗ −0.864∗∗∗ −0.700∗∗∗ −0.822∗∗∗ −0.858∗∗∗ −0.773∗∗∗ −0.6135∗∗∗ −0.833∗∗∗ −0.600∗∗∗
(-8.20) (-18.35) (-15.46) (-7.00) (-18.03) (-10.35) (-8.52) (-10.17) (-4.64) (-5.48)βNV 0.616 −0.018 −0.120 0.6620∗∗∗ −0.104 0.278 0.550∗∗∗ 0.286∗ 0.481∗∗ 0.106
(1.31) (-0.06) (-0.63) (2.77) (-0.54) (1.37) (2.50) 1.92 (2.23) (0.36)
N 241 258 266 274 274 281 284 282 281 276R2 72.76% 77.98% 77.79% 71.10% 77.51% 70.20% 52.81% 59.82% 55.48% 40.44%
21
Table 8 Panel A: Beta Differentials (Multivariate)
Full Sample βNC βND βNV βEC βED βEVX 0.569 −0.693 0.089 −0.038 −0.052 0.140I 0.926 −1.166 0.072 0.103 −0.060 0.329|D| -0.357 -0.473 0.017 -0.075 -0.008 -0.189
Early Sample βNC βND βNV βEC βED βEVX 0.515 −0.535 −0.339 −0.003 0.007 0.182I 0.904 −1.111 −0.247 0.0599 −0.001 0.366|D| -0.389 -0.576 0.092 -0.063 0.008 -0.184
Late Sample βNC βND βNV βEC βED βEVX 0.815 −1.053 0.057 −0.086 −0.081 0.048I 1.024 −1.296 0.354 0.271 −0.247 0.193|D| -0.204 -0.243 -0.297 -0.357 -0.166 -0.145
Panel B: Beta Differentials (Lag Adjusted Betas)
Portfolio βNC βND βNV βEC βED βEVX 0.284 −0.856 −0.091 2.145 −1.974 −0.279I 0.557 −1.550 −0.164 3.044 −2.589 0.078|D| -0.273 -0.694 -0.074 -0.899 -0.615 -0.357
Early Sample βNC βND βNV βEC βED βEVX 0.319 −0.843 −0.120 2.837 −2.643 0.365I 0.800 −1.799 −0.218 2.202 −1.637 −0.153|D| -0.481 -0.956 -0.098 0.635 1.006 0.518
Late Sample βNC βND βNV βEC βED βEVX 0.228 −0.876 −0.045 1.044 −0.910 −0.141I 0.311 −1.154 −0.976 4.066 −3.787 −0.443|D| -0.083 -0.278 -0.931 -3.022 -2.877 -0.298
Panel C: Beta Differentials (GMM)
Full Sample βNC βND βNVX 0.869 −1.140 0.010I 0.919 −0.828 0.038|D| -0.050 -0.312 -0.028
Early Sample βNC βND βNVX 0.614 −0.600 0.101I 1.050 −1.145 0.616|D| -0.436 -0.545 -0.515
Late Sample βNC βND βNVX 1.272 −1.125 −0.370I 1.050 −1.070 0.176|D| 0.222 0.055 -0.526
22
4.7. Lagged Adjusted Betas and GMM estimation
Tables 6 and 7 document the estimation results of our lag adjusted and GMM beta estimation. Table
6 reports lag adjusted betas computed in accordance with equations 9 and 10. Table 7 documents
my GMM beta estimation using the moment condition specified in equation 17. Table 8 documents the
beta differentials between the winner and loser portfolios using our beta estimates from our 3 approaches.
Table 8 documents a broadly similar result to our multivariate betas: the loser portfolio has higher cash
flow and discount rate betas than the winner portfolio, again inconsistent with our model. This result is
relatively robust across both the full sample and the two sub-samples. The only exception to this result
is the late sample of our GMM estimation which yields higher cash flow and discount rate betas for
the winner portfolio, consistent with our model. Whilst such a development is promising for our model,
in light of all other contrary evidence we must treat this evidence as an anomaly rather any concrete
evidence that the model predictions are correct.
Another interesting result is that the lag adjusted betas reported in table 6 are considerably lower than
the multivariate and GMM betas for both the full sample and the two sub-samples. This indicates that
the illiquidity effect of stale prices is not only biasing upwards our beta estimates in the earlier sample
but also in the later sample. This result is consistent with recent literature documenting the existence of
a liquidity risk premium (Amihud and Mendelson 1986; Brennan and Subrahmanyan (1996) and Easley
et al 2002). Also the signs flip for βEV indicating that after illiquidity is controlled for momentum
returns are decreasing in expected volatility, a finding that is consistent with the observed volatility
hedging motivation amongst investors (Bakshi and Kapadia 2003).
23
5. Concluding Remarks
The primary contribution of this paper has been to test whether or not long run risk models of the
Stochastic Discount Factor can account for the momentum anomaly. Using a model framework that
largely borrows from Campbell and Vuolteenaho (2004) and Campbell et al (2014), I derive a no-arbitrage
model of the momentum anomaly based on a representative investor with Epstein and Zin recursive util-
ity preferences. Whilst I find that this model can explain a large portion of the cross-sectional variation
in momentum returns, I do not find that the main predictions of the model are consistent with the data.
Loser stocks have higher cash flow, discount rate and volatility betas than winner stocks, whereas my
model indicates that the converse should be true.
Whilst the main predictions of the model are not supported by the data, the strong in-sample perfor-
mance of the model indicates that the model contains some informative predictive power for momentum
returns. I interpret the high R2 for our multivariate regressions and the GMM beta estimations for the
loser portfolio as evidence that our model does a good job of explaining return variation in the loser
portfolio. However the lower R2 for the winner portfolio indicates that the model is still missing a key
ingredient in the pricing of the momentum anomaly.
One possible area of future research is to look at higher moments of the news terms Nc,t+1, Nd,t+1 and
Nv,t+1. As pointed out by Kalev, Saxena and Zolotoy (2017), the standard derivation of ICAPM risk
prices in the long run risk framework of Campbell and Vuolteenaho (2004) defined by equation 2 relies
on a first order taylor approximation to the Epstein and Zin pricing kernel. This first order approx-
imation by construction removes higher moments of the news terms from the model set-up. Kalev,
Saxena and Zolotoy (2017) demonstrate that exclusion of these higher order news terms may be costly
for pricing momentum given empirical evidence that the squared news terms N2c,t+1, N
2d,t+1 and the
covariance news term Ncd,t+1 are priced for the 25 size and momentum sorted portfolios available on the
Kenneth French data library. Thus it may very well be possible that if our model were to include these
higher order news moments, the results would become consistent with the main predictions of the model.
Another possible area of future research would be to examine some of the methodological approaches
to estimating the news terms employed by this paper as well as by Campbell and Vuolteenaho (2004)
and Campbell et al (2016). In particular it may be necessary to rethink alternative approaches to the
VAR system which has not been proven to be robust to changes in the state variables used in the VAR
system (Chen and Zhao 2008). In this regard Kozak (2017) provides a novel alternative to the VAR
system which involves using future realized returns as a proxy for future expected returns which yields
consistent estimates of the news terms without requiring an estimate for future returns. Re-examining
the work in this paper using this alternative construction to estimate our news and expectation terms
may very well also yield results more consistent with the predictions of the model.
24
6. References
Ang, A., Hodrick, R.J., Xing, Y., Zhang, X., 2006. The cross-section of volatility and expected returns.
Journal of Financial and Quantitative Analysis 45, 1341-1365
Amihud, Y., Mendelson, J., 1986a. Asset Pricing and the Bid-Ask Spread, Journal of Financial Eco-
nomics, 17, 223249
Asness, C.S., Moscowitz, T.J., Pedersen, L.H., 2013. Value and Momentum Everywhere. Journal of
Finance 68 (3), 929-985
Bakshi, G., Kapadia, N., 2003. Delta-Hedged Gains and the Negative Market Volatility Risk Premium,
Review of Financial Studies, 16 (2), 527-566
Bansal, R., Kiku D., Shaliastovich, I., Yaron, A., 2014. Volatility, the macroeconomic and asset prices.
The Journal of Finance 69 (6), 2471-2511.
Bansal, R., Yaron, A., 2004. Risks for the Long Run: a potential resolution of Asset Pricing Puzzles.
Journal of Finance 59 (4), 1481-1509
Barroso, P., Santa-Clara, P., 2014. Momentum Has Its Moments. Journal of Financial Economics, 116
(1), 111-120
Bekaert, G., Hodrick, R.J., Zhang X., 2012. Aggregate idiosyncratic volatility. Journal of Financial and
Quantitative Analysis 47 (6), 1155-1185
Black, F., 1976. The Pricing of Commodity Contracts, Journal of Financial Economics, 3, 167-179
Brennan, M., Subrahmanyam, A., 1996. Market Microstructure and Asset Pricing: On the Compensa-
tion for Illiquidity in Stock Returns, Journal of Financial Economics, 41 (3), 441-464
Campbell, J.Y., 1993. Intertemporal Asset Pricing without Consumption Data. American Economic
Review 83, 487-512.
Campbell, J.Y., Giglio, S.W., Polk, C., Turley, R., 2017. An intertemporal CAPM with Stochastic
Volatility. Journal of Financial Economics, Forthcoming.
Campbell, J.Y., Shiller, R.J., 1988a. The Dividend-Price Ratio and Expectations of Future Dividends
and Discount Factors. Review of Financial Studies 1, 195-228
25
Campbell, J.Y., Vuolteenaho, T.O., 2004. Bad Beta, Good Beta. American Economic Review 94, 1249-
1275
Chen, N.F., Roll, R., Ross, S.A., 1986. Economic forces and the stock market. Journal of Business,
383-403
Dimson, E., 1979. Risk Measurement When Shares Are Subject to Infrequent Trading, Journal of Fi-
nancial Economics, 7 (2), 197-226
Easley, D., Hvidkjaer, Soeren., O’Hara, M., 2002. Is Information Risk a Determinant of Asset Returns?,
Journal of Finance, 57 (5), 2185-2221
Epstein, L.G., Zin, S.E., 1989. Substitution, risk aversion, and the temporal behaviour of consumption
and asset returns: A theoretical framework. Econometrics, 937-969
Fama, E.F., 1984. Forward and spot exchange rates. Journal of Monetary Economics 14, 319-338
Fama, E.F., French., K.R., 1988. Dividend yields and expected stock returns. Journal of Financial
Economics 22 (1), 3-25
Fama, E.F., French., K.R., 1989. Business conditions and expected returns on stocks and bonds. Jour-
nal of Financial Economics 25 (1), 23-49
Fama, E.F., French., K.R., 1993. Common risk factors in the returns on stocks and bonds. Journal of
Financial Economics 33 (1), 3-56
Fama, E.F., French., K.R., 1996. Multifactor Explanations of Asset Pricing Anomalies. Journal of
Finance 51 (1), 55-84
Glosten, L., Jagannathan, R., Runkle, D., 1993. On the Relation between the Expected Value and the
Volatility of the Nominal Excess Return on Stocks, Journal of Finance, 48 (5), 1779-1801
Jegadeesh, N., Titman, S., 1993. Returns to buying winners and selling losers: Implications for stock
market efficiency. Journal of Finance 48, 65-91
Koijen, R., Moskowitz, T.J., Pederson, L.H., Vrugt, E.B., 2013. Carry. Fama-Miller Working Paper,
Chicago, IL
Kalev, S., Saxena, K., and Zolotoy, L., 2017. Coskewness Risk Decomposition, Covariation Risk, and
Intertemporal Asset Pricing. Journal of Financial and Quantitative Analysis (JFQA), Forthcoming.
26
Lettau, M., Maggiori, M., and Weber., 2013. Conditional Risk Premia in Currency Markets and Other
Asset Classes. Journal of Financial Economics (JFE), 114 (2).
Li, Q., Vassalou, M., Xing., Y., 2006. Sector Investment Growth Rates and the Cross Section of Equity
Returns. Journal of Business, 79 (3), 1637-1665
Liu, L., Zhang, L., 2008. Momentum Profits, Factor Pricing, and Macroeconomic Risk. Review of
Financial Studies, 21 (6), 24172448
Merton, R.C., 1974. On the Pricing of Corporate Debt: The Risk Structure of Interest Rates, Journal
of Finance,29 (2), 449-470
Scholes, W., Williams, J., 1977. Estimating betas from non-synchronous data, Journal of Financial
Economics, 5 (3), 309-327
27