forecasting hierarchical time series - rob j hyndman · pdf fileoutline 1 hierarchical time...

AA AB AC

BA BB BC

CA CB CC

Rob J Hyndman

Forecastinghierarchical time series

Outline

1 Hierarchical time series

2 Forecasting framework

3 Optimal forecasts

4 Approximately optimal forecasts

5 Application to Australian tourism

6 hts package for R

7 References

Forecasting hierarchical time series Hierarchical time series 2

Introduction

AA AB AC

BA BB BC

CA CB CC

Examples

Manufacturing product hierarchiesNet labour turnoverPharmaceutical salesTourism demand by region and purpose

Introduction

AA AB AC

BA BB BC

CA CB CC

Examples

Introduction

AA AB AC

BA BB BC

CA CB CC

Examples

Introduction

AA AB AC

BA BB BC

CA CB CC

Examples

Introduction

AA AB AC

BA BB BC

CA CB CC

Examples

Forecasting the PBS

ATC drug classificationA Alimentary tract and metabolismB Blood and blood forming organsC Cardiovascular systemD DermatologicalsG Genito-urinary system and sex hormonesH Systemic hormonal preparations, excluding sex hor-

mones and insulinsJ Anti-infectives for systemic useL Antineoplastic and immunomodulating agentsM Musculo-skeletal systemN Nervous systemP Antiparasitic products, insecticides and repellentsR Respiratory systemS Sensory organsV Various

ATC drug classification

A Alimentary tract and metabolism14 classes

A10 Drugs used in diabetes84 classes

A10B Blood glucose lowering drugs

A10BA Biguanides

A10BA02 Metformin

Australian tourism

Also split by purpose of travel:

Holiday

Visits to friends and relatives

Business

Hierarchical/grouped time seriesA hierarchical time series is a collection ofseveral time series that are linked together in ahierarchical structure.

Example: Pharmaceutical products are organized ina hierarchy under the Anatomical TherapeuticChemical (ATC) Classification System.

A grouped time series is a collection of timeseries that are aggregated in a number ofnon-hierarchical ways.

Example: Australian tourism demand is grouped byregion and purpose of travel.

Hierarchical/grouped time seriesForecasts should be “aggregateconsistent”, unbiased, minimum variance.

Existing methods:ã Bottom-upã Top-downã Middle-out

How to compute forecast intervals?

Most research is concerned about relativeperformance of existing methods.

There is no research on how to deal withforecasting grouped time series.

Top-down method

Advantages

Works well inpresence of lowcounts.

Single forecastingmodel easy tobuild

Provides reliableforecasts foraggregate levels.

Disadvantages

Loss of information,especiallyindividual seriesdynamics.

Distribution offorecasts to lowerlevels can bedifficult

No predictionintervals

Top-down method

Advantages

Disadvantages

Top-down method

Advantages

Disadvantages

Top-down method

Advantages

Disadvantages

Top-down method

Advantages

Disadvantages

Top-down method

Advantages

Disadvantages

Bottom-up method

Advantages

No loss ofinformation.

Better capturesdynamics ofindividual series.

Disadvantages

Large number ofseries to beforecast.

Constructingforecasting modelsis harder becauseof noisy data atbottom level.

Bottom-up method

Advantages

Disadvantages

Bottom-up method

Advantages

Disadvantages

Bottom-up method

Advantages

Disadvantages

Bottom-up method

Advantages

Disadvantages

A new approach

We propose a new statistical framework forforecasting hierarchical time series which:

1 provides point forecasts that areconsistent across the hierarchy;

2 allows for correlations and interactionbetween series at each level;

3 provides estimates of forecast uncertaintywhich are consistent across the hierarchy;

4 allows for ad hoc adjustments andinclusion of covariates at any level.

A new approach

Hierarchical data

Yt : observed aggregate of allseries at time t.

YX,t : observation on series X attime t.

Bt : vector of all series atbottom level in time t.

Hierarchical data

Y t = [Yt, YA,t, YB,t, YC,t]′ =

1 1 11 0 00 1 00 0 1

YA,tYB,tYC,t

Hierarchical data

1 1 11 0 00 1 00 0 1

︸︷︷︸

YA,tYB,tYC,t

Hierarchical data

1 1 11 0 00 1 00 0 1

︸︷︷︸

YA,tYB,tYC,t

︸︷︷︸

Hierarchical data

1 1 11 0 00 1 00 0 1

︸︷︷︸

YA,tYB,tYC,t

︸︷︷︸

BtY t = SBt

Hierarchical dataTotal

AX AY AZ

BX BY BZ

CX CY CZ

YtYA,tYB,tYC,tYAX,tYAY,tYAZ,tYBX,tYBY,tYBZ,tYCX,tYCY,tYCZ,t

1 1 1 1 1 1 1 1 11 1 1 0 0 0 0 0 00 0 0 1 1 1 0 0 00 0 0 0 0 0 1 1 11 0 0 0 0 0 0 0 00 1 0 0 0 0 0 0 00 0 1 0 0 0 0 0 00 0 0 1 0 0 0 0 00 0 0 0 1 0 0 0 00 0 0 0 0 1 0 0 00 0 0 0 0 0 1 0 00 0 0 0 0 0 0 1 00 0 0 0 0 0 0 0 1

︸︷︷︸

YAX,tYAY,tYAZ,tYBX,tYBY,tYBZ,tYCX,tYCY,tYCZ,t

︸︷︷︸

AX AY AZ

BX BY BZ

CX CY CZ

1 1 1 1 1 1 1 1 11 1 1 0 0 0 0 0 00 0 0 1 1 1 0 0 00 0 0 0 0 0 1 1 11 0 0 0 0 0 0 0 00 1 0 0 0 0 0 0 00 0 1 0 0 0 0 0 00 0 0 1 0 0 0 0 00 0 0 0 1 0 0 0 00 0 0 0 0 1 0 0 00 0 0 0 0 0 1 0 00 0 0 0 0 0 0 1 00 0 0 0 0 0 0 0 1

︸︷︷︸

AX AY AZ

BX BY BZ

CX CY CZ

1 1 1 1 1 1 1 1 11 1 1 0 0 0 0 0 00 0 0 1 1 1 0 0 00 0 0 0 0 0 1 1 11 0 0 0 0 0 0 0 00 1 0 0 0 0 0 0 00 0 1 0 0 0 0 0 00 0 0 1 0 0 0 0 00 0 0 0 1 0 0 0 00 0 0 0 0 1 0 0 00 0 0 0 0 0 1 0 00 0 0 0 0 0 0 1 00 0 0 0 0 0 0 0 1

︸︷︷︸

Y t = SBt

Grouped dataTotal

YtYA,tYB,tYX,tYY,tYAX,tYAY,tYBX,tYBY,t

1 1 1 11 1 0 00 0 1 11 0 1 00 1 0 11 0 0 00 1 0 00 0 1 00 0 0 1

︸︷︷︸

YAX,tYAY,tYBX,tYBY,t

︸︷︷︸

Grouped dataTotal

1 1 1 11 1 0 00 0 1 11 0 1 00 1 0 11 0 0 00 1 0 00 0 1 00 0 0 1

︸︷︷︸

Grouped dataTotal

1 1 1 11 1 0 00 0 1 11 0 1 00 1 0 11 0 0 00 1 0 00 0 1 00 0 0 1

︸︷︷︸

Y t = SBt

Outline

3 Optimal forecasts

6 hts package for R

7 References

Forecasting hierarchical time series Forecasting framework 16

Forecasting notation

Let Yn(h) be vector of initial h-step forecasts,made at time n, stacked in same order as Y t.(They may not add up.)

Hierarchical forecasting methods of the form:Yn(h) = SPYn(h)

for some matrix P.

P extracts and combines base forecastsYn(h) to get bottom-level forecasts.S adds them upRevised reconciled forecasts: Yn(h).

for some matrix P.

Bottom-up forecasts

Yn(h) = SPYn(h)

Bottom-up forecasts are obtained using

P = [0 | I] ,

where 0 is null matrix and I is identity matrix.

P matrix extracts only bottom-levelforecasts from Yn(h)

S adds them up to give the bottom-upforecasts.

Bottom-up forecasts

Yn(h) = SPYn(h)

P = [0 | I] ,

Bottom-up forecasts

Yn(h) = SPYn(h)

P = [0 | I] ,

Top-down forecasts

Yn(h) = SPYn(h)

Top-down forecasts are obtained using

P = [p | 0]

where p = [p1, p2, . . . , pmK]′ is a vector of

proportions that sum to one.

P distributes forecasts of the aggregate tothe lowest level series.

Different methods of top-down forecastinglead to different proportionality vectors p.

Top-down forecasts

Yn(h) = SPYn(h)

P = [p | 0]

Top-down forecasts

Yn(h) = SPYn(h)

P = [p | 0]

General properties: bias

Yn(h) = SPYn(h)

Assume: base forecasts Yn(h) are unbiased:E[Yn(h)|Y1, . . . ,Yn] = E[Yn+h|Y1, . . . ,Yn]

Let Bn(h) be bottom level base forecastswith βn(h) = E[Bn(h)|Y1, . . . ,Yn].Then E[Yn(h)] = Sβn(h).We want the revised forecasts to be unbiased:E[Yn(h)] = SPSβn(h) = Sβn(h).Result will hold provided SPS = S.True for bottom-up, but not for any top-downmethod or middle-out method.

Yn(h) = SPYn(h)

General properties: variance

Yn(h) = SPYn(h)

Let variance of base forecasts Yn(h) be givenby

Σh = V[Yn(h)|Y1, . . . ,Yn]

Then the variance of the revised forecasts isgiven by

V[Yn(h)|Y1, . . . ,Yn] = SPΣhP′S′.

This is a general result for all existing methods.Forecasting hierarchical time series Forecasting framework 21

Yn(h) = SPYn(h)

Σh = V[Yn(h)|Y1, . . . ,Yn]

V[Yn(h)|Y1, . . . ,Yn] = SPΣhP′S′.

Yn(h) = SPYn(h)

Σh = V[Yn(h)|Y1, . . . ,Yn]

V[Yn(h)|Y1, . . . ,Yn] = SPΣhP′S′.

Outline

3 Optimal forecasts

6 hts package for R

7 References

Forecasting hierarchical time series Optimal forecasts 22

Forecasts

Key idea: forecast reconciliationå Ignore structural constraints and forecast

every series of interest independently.

å Adjust forecasts to impose constraints.

Let Yn(h) be vector of initial h-step forecasts,made at time n, stacked in same order as Y t.

Y t = SBt . So Yn(h) = Sβn(h) + εh .

βn(h) = E[Bn+h | Y1, . . . ,Yn].εh has zero mean and covariance Σh.Estimate βn(h) using GLS?

Forecasts

Optimal combination forecasts

Yn(h) = Sβn(h) = S(S′Σ†hS)−1S′Σ†hYn(h)

Σ†h is generalized inverse of Σh.

Optimal P = (S′Σ†hS)−1S′Σ†h

Revised forecasts unbiased: SPS = S.Revised forecasts minimum variance:

V[Yn(h)|Y1, . . . ,Yn] = SPΣhP′S′

= S(S′Σ†hS)−1S′

Problem: Σh hard to estimate.Forecasting hierarchical time series Optimal forecasts 24

Initial forecasts

V[Yn(h)|Y1, . . . ,Yn] = SPΣhP′S′

Revised forecasts Initial forecasts

V[Yn(h)|Y1, . . . ,Yn] = SPΣhP′S′

Outline

3 Optimal forecasts

6 hts package for R

7 References

Forecasting hierarchical time series Approximately optimal forecasts 25

Yn(h) = S(S′Σ†hS)−1S′Σ†hYn(h)

Revised forecasts Base forecasts

Solution 1: OLSAssume εh ≈ SεB,h where εB,h is theforecast error at bottom level.

Then Σh ≈ SΩhS′ where Ωh = V(εB,h).

If Moore-Penrose generalized inverse used,then (S′Σ†hS)

−1S′Σ†h = (S′S)−1S′.

Yn(h) = S(S′S)−1S′Yn(h)Forecasting hierarchical time series Approximately optimal forecasts 26

−1S′Σ†h = (S′S)−1S′.

Yn(h) = S(S′S)−1S′Yn(h)

GLS = OLS.

Optimal weighted average of initialforecasts.

Optimal reconciliation weights areS(S′S)−1S′.

Weights are independent of the data andof the covariance structure of thehierarchy!

GLS = OLS.

Yn(h) = S(S′S)−1S′Yn(h)Total

Weights:

S(S′S)−1S′ =

0.75 0.25 0.25 0.250.25 0.75 −0.25 −0.250.25 −0.25 0.75 −0.250.25 −0.25 −0.25 0.75

AA AB AC

BA BB BC

CA CB CC

Weights: S(S′S)−1S′ =

0.69 0.23 0.23 0.23 0.08 0.08 0.08 0.08 0.08 0.08 0.08 0.08 0.080.23 0.58 −0.17 −0.17 0.19 0.19 0.19 −0.06 −0.06 −0.06 −0.06 −0.06 −0.060.23 −0.17 0.58 −0.17 −0.06 −0.06 −0.06 0.19 0.19 0.19 −0.06 −0.06 −0.060.23 −0.17 −0.17 0.58 −0.06 −0.06 −0.06 −0.06 −0.06 −0.06 0.19 0.19 0.190.08 0.19 −0.06 −0.06 0.73 −0.27 −0.27 −0.02 −0.02 −0.02 −0.02 −0.02 −0.020.08 0.19 −0.06 −0.06 −0.27 0.73 −0.27 −0.02 −0.02 −0.02 −0.02 −0.02 −0.020.08 0.19 −0.06 −0.06 −0.27 −0.27 0.73 −0.02 −0.02 −0.02 −0.02 −0.02 −0.020.08 −0.06 0.19 −0.06 −0.02 −0.02 −0.02 0.73 −0.27 −0.27 −0.02 −0.02 −0.020.08 −0.06 0.19 −0.06 −0.02 −0.02 −0.02 −0.27 0.73 −0.27 −0.02 −0.02 −0.020.08 −0.06 0.19 −0.06 −0.02 −0.02 −0.02 −0.27 −0.27 0.73 −0.02 −0.02 −0.020.08 −0.06 −0.06 0.19 −0.02 −0.02 −0.02 −0.02 −0.02 −0.02 0.73 −0.27 −0.270.08 −0.06 −0.06 0.19 −0.02 −0.02 −0.02 −0.02 −0.02 −0.02 −0.27 0.73 −0.270.08 −0.06 −0.06 0.19 −0.02 −0.02 −0.02 −0.02 −0.02 −0.02 −0.27 −0.27 0.73

AA AB AC

BA BB BC

CA CB CC

Weights: S(S′S)−1S′ =

0.69 0.23 0.23 0.23 0.08 0.08 0.08 0.08 0.08 0.08 0.08 0.08 0.080.23 0.58 −0.17 −0.17 0.19 0.19 0.19 −0.06 −0.06 −0.06 −0.06 −0.06 −0.060.23 −0.17 0.58 −0.17 −0.06 −0.06 −0.06 0.19 0.19 0.19 −0.06 −0.06 −0.060.23 −0.17 −0.17 0.58 −0.06 −0.06 −0.06 −0.06 −0.06 −0.06 0.19 0.19 0.190.08 0.19 −0.06 −0.06 0.73 −0.27 −0.27 −0.02 −0.02 −0.02 −0.02 −0.02 −0.020.08 0.19 −0.06 −0.06 −0.27 0.73 −0.27 −0.02 −0.02 −0.02 −0.02 −0.02 −0.020.08 0.19 −0.06 −0.06 −0.27 −0.27 0.73 −0.02 −0.02 −0.02 −0.02 −0.02 −0.020.08 −0.06 0.19 −0.06 −0.02 −0.02 −0.02 0.73 −0.27 −0.27 −0.02 −0.02 −0.020.08 −0.06 0.19 −0.06 −0.02 −0.02 −0.02 −0.27 0.73 −0.27 −0.02 −0.02 −0.020.08 −0.06 0.19 −0.06 −0.02 −0.02 −0.02 −0.27 −0.27 0.73 −0.02 −0.02 −0.020.08 −0.06 −0.06 0.19 −0.02 −0.02 −0.02 −0.02 −0.02 −0.02 0.73 −0.27 −0.270.08 −0.06 −0.06 0.19 −0.02 −0.02 −0.02 −0.02 −0.02 −0.02 −0.27 0.73 −0.270.08 −0.06 −0.06 0.19 −0.02 −0.02 −0.02 −0.02 −0.02 −0.02 −0.27 −0.27 0.73

Features

Forget “bottom up” or “top down”. Thisapproach combines all forecasts optimally.

Method outperforms bottom-up andtop-down, especially for middle levels.

Covariates can be included in initial forecasts.

Adjustments can be made to initial forecastsat any level.

Very simple and flexible method. Can workwith any hierarchical or grouped time series.

Conceptually easy to implement: OLS onbase forecasts.

Features

Challenges

Computational difficulties in bighierarchies due to size of the S matrix andnon-singular behavior of (S′S).Need to estimate covariance matrix toproduce prediction intervals.Assumption might be unrealistic.Ignores covariance matrix in computingpoint forecasts.

Challenges

Solution 2: RescalingSuppose we rescale the original forecastsby Λ, reconcile using OLS, and backscale:

Y∗n(h) = S(S′Λ2S)−1S′Λ2Yn(h).

If Λ =(Σ†h)1/2

, we get the GLS solution.

Approximately optimal solution:

Λ = diagonal(Σ†1)1/2

That is, Λ contains inverse one-stepforecast standard deviations.

If Λ =(Σ†h)1/2

Solution 3: AveragingIf the bottom level error series areapproximately uncorrelated and havesimilar variances, then Λ is inverselyproportional to the number of seriesmaking up each element of Y.

So set Λ to be the inverse row sums of S.

Then ΛYn(h) is the average at each noderather than the sum at each node.

Y∗n(h) = S(S′Λ2S)−1S′Λ2Yn(h)

Outline

3 Optimal forecasts

6 hts package for R

7 References

Forecasting hierarchical time series Application to Australian tourism 34

Application to Australian tourism

Quarterly data on visitor nightsDomestic visitor nightsfrom 1998 – 2006Data from: National Visitor Survey,based on annual interviews of 120,000Australians aged 15+, collected byTourism Research Australia.

Also split by purpose of travel:

Holiday

Visits to friends and relatives

Business

Exponential smoothing methods

Seasonal ComponentTrend N A M

Component (None) (Additive) (Multiplicative)

N (None) N,N N,A N,M

A (Additive) A,N A,A A,M

Ad (Additive damped) Ad,N Ad,A Ad,M

M (Multiplicative) M,N M,A M,M

Md (Multiplicative damped) Md,N Md,A Md,M

N,N: Simple exponential smoothing

N,N: Simple exponential smoothingA,N: Holt’s linear method

N,N: Simple exponential smoothingA,N: Holt’s linear methodAd,N: Additive damped trend method

N,N: Simple exponential smoothingA,N: Holt’s linear methodAd,N: Additive damped trend methodM,N: Exponential trend method

N,N: Simple exponential smoothingA,N: Holt’s linear methodAd,N: Additive damped trend methodM,N: Exponential trend methodMd,N: Multiplicative damped trend method

N,N: Simple exponential smoothingA,N: Holt’s linear methodAd,N: Additive damped trend methodM,N: Exponential trend methodMd,N: Multiplicative damped trend methodA,A: Additive Holt-Winters’ method

N,N: Simple exponential smoothingA,N: Holt’s linear methodAd,N: Additive damped trend methodM,N: Exponential trend methodMd,N: Multiplicative damped trend methodA,A: Additive Holt-Winters’ methodA,M: Multiplicative Holt-Winters’ method

There are 15 separate exponentialsmoothing methods.

There are 15 separate exponentialsmoothing methods.Each can have an additive or multiplicativeerror, giving 30 separate models.

General notation E T S : ExponenTial Smoothing

Examples:A,N,N: Simple exponential smoothing with additive errorsA,A,N: Holt’s linear method with additive errorsM,A,M: Multiplicative Holt-Winters’ method with multiplicative errorsForecasting hierarchical time series Application to Australian tourism 37

General notation E T S : ExponenTial Smoothing

Examples:A,N,N: Simple exponential smoothing with additive errorsA,A,N: Holt’s linear method with additive errorsM,A,M: Multiplicative Holt-Winters’ method with multiplicative errorsForecasting hierarchical time series Application to Australian tourism 37

General notation E T S : ExponenTial Smoothing↑

TrendExamples:

A,N,N: Simple exponential smoothing with additive errorsA,A,N: Holt’s linear method with additive errorsM,A,M: Multiplicative Holt-Winters’ method with multiplicative errorsForecasting hierarchical time series Application to Australian tourism 37

General notation E T S : ExponenTial Smoothing↑

Trend SeasonalExamples:

General notation E T S : ExponenTial Smoothing ↑

Error Trend SeasonalExamples:

Innovations state space models

å All ETS models can be written ininnovations state space form (IJF, 2002).

å Additive and multiplicative versions givethe same point forecasts but differentprediction intervals.

Automatic forecasting

From Hyndman et al. (IJF, 2002):

Apply each of 30 models that areappropriate to the data. Optimizeparameters and initial values using MLE(or some other criterion).Select best method using AIC:

AIC = −2 log(Likelihood) + 2pwhere p = # parameters.Produce forecasts using best method.Obtain prediction intervals usingunderlying state space model.

Base forecasts

Domestic tourism forecasts: Total

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: NSW

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: VIC

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: Nth.Coast.NSW

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: Metro.QLD

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: Sth.WA

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: X201.Melbourne

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: X402.Murraylands

1998 2000 2002 2004 2006 2008

Base forecasts

Domestic tourism forecasts: X809.Daly

1998 2000 2002 2004 2006 2008

Hierarchy: states, zones, regions

Forecast Horizon (h)MAPE 1 2 4 6 8 Average

Top Level: Australia

Bottom-up 3.79 3.58 4.01 4.55 4.24 4.06OLS 3.83 3.66 3.88 4.19 4.25 3.94Scaling 3.68 3.56 3.97 4.57 4.25 4.04Averaging 3.76 3.60 4.01 4.58 4.22 4.06

Level 1: States

Based on a rolling forecast origin with at least 12 observations in thetraining set.

Hierarchy: states, zones, regions

Level 2: Zones

Bottom Level: Regions

Groups: Purpose, states, capital

Top Level: Australia

Level 1: Purpose of travel

Groups: Purpose, states, capital

Level 2: States

Bottom Level: Capital city versus other

Outline

3 Optimal forecasts

6 hts package for R

7 References

Forecasting hierarchical time series hts package for R 44

hts package for R

hts: Hierarchical and grouped time seriesMethods for analysing and forecasting hierarchical and groupedtime series

Version: 3.01Depends: forecastImports: SparseMPublished: 2013-05-07Author: Rob J Hyndman, Roman A Ahmed, and Han Lin ShangMaintainer: Rob J Hyndman <Rob.Hyndman at monash.edu>License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]

Example using Rlibrary(hts)

# bts is a matrix containing the bottom level time series# g describes the grouping/hierarchical structurey <- hts(bts, g=c(1,1,2,2))

# Forecast 10-step-ahead using optimal combination method# ETS used for each series by defaultfc <- forecast(y, h=10)

# Forecast 10-step-ahead using OLS combination method# ETS used for each series by defaultfc <- forecast(y, h=10)

# Select your own methodsally <- allts(y)allf <- matrix(, nrow=10, ncol=ncol(ally))for(i in 1:ncol(ally))

allf[,i] <- mymethod(ally[,i], h=10)allf <- ts(allf, start=2004)# Reconcile forecasts so they add upfc2 <- combinef(allf, Smatrix(y))

hts functionUsagehts(y, g)gts(y, g, hierarchical=FALSE)

Argumentsy Multivariate time series containing the bot-

tom level seriesg Group matrix indicating the group structure,

with one column for each series when com-pletely disaggregated, and one row for eachgrouping of the time series.

hierarchical Indicates if the grouping matrix should betreated as hierarchical.

Detailshts is simply a wrapper for gts(y,g,TRUE). Both return anobject of class gts.

forecast.gts functionUsageforecast(object, h,method = c("comb", "bu", "mo", "tdgsf", "tdgsa", "tdfp", "all"),fmethod = c("ets", "rw", "arima"), level, positive = FALSE,xreg = NULL, newxreg = NULL, ...)

Argumentsobject Hierarchical time series object of class gts.h Forecast horizonmethod Method for distributing forecasts within the hierarchy.fmethod Forecasting method to uselevel Level used for "middle-out" method (when method="mo")positive If TRUE, forecasts are forced to be strictly positivexreg When fmethod = "arima", a vector or matrix of external re-

gressors, which must have the same number of rows as theoriginal univariate time series

newxreg When fmethod = "arima", a vector or matrix of external re-gressors, which must have the same number of rows as theoriginal univariate time series

... Other arguments passing to ets or auto.arima

Utility functions

allts(y) Returns all series in thehierarchy

Smatrix(y) Returns the summing matrix

combinef(f) Combines initial forecastsoptimally.

More information

Vignette on CRAN

Outline

3 Optimal forecasts

6 hts package for R

7 References

Forecasting hierarchical time series References 53

References

RJ Hyndman, RA Ahmed, G Athanasopoulos, andHL Shang (2011). “Optimal combinationforecasts for hierarchical time series”.Computational Statistics and Data Analysis55(9), 2579–2589

RJ Hyndman, RA Ahmed, and HL Shang (2013).hts: Hierarchical time series.cran.r-project.org/package=hts.

RJ Hyndman and G Athanasopoulos (2013).Forecasting: principles and practice. OTexts.OTexts.org/fpp/.

References

RJ Hyndman, RA Ahmed, G Athanasopoulos, andHL Shang (2011). “Optimal combinationforecasts for hierarchical time series”.Computational Statistics and Data Analysis55(9), 2579–2589

RJ Hyndman, RA Ahmed, and HL Shang (2013).hts: Hierarchical time series.cran.r-project.org/package=hts.

RJ Hyndman and G Athanasopoulos (2013).Forecasting: principles and practice. OTexts.OTexts.org/fpp/.

å Papers and R code:

robjhyndman.com

å Email: [email protected]

forecasting hierarchical time series - rob j hyndman · pdf fileoutline 1 hierarchical time...

Documents