group a report

97
Group Assignment Questions 1 and 2 Name: Lee Rock. Jazmen Boone, Bas Süß, Henry Doku Student Number: 20038860, 20070746, 20069935, 20044757 Department: Department of Graduate Business Course: MSc in Global Financial Information Systems Module: Econometrics Presented To: Dr Tom Egan Assignment 1 of 1

Upload: lee-rock

Post on 26-Jan-2017

67 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Group A Report

Group Assignment

Questions 1 and 2

Name: Lee Rock. Jazmen Boone, Bas Süß, Henry Doku

Student Number: 20038860, 20070746, 20069935, 20044757

Department: Department of Graduate Business

Course: MSc in Global Financial Information Systems

Module: Econometrics

Presented To: Dr Tom Egan

Assignment 1 of 1

Signature:______________________________ Date:__________________________________

Signature:______________________________ Date:__________________________________

Signature:______________________________ Date:__________________________________

Signature:______________________________ Date:__________________________________

Page 2: Group A Report

We firmly declare that this is assignment was completed by our own accord and to the best of my abilities in accordance with the plagiarism regulations and in line with the standards of academia set out by Waterford Institute of Technology.

Table of Contents1.0 Introduction.....................................................................................................................................1

1.1 Definition of Terms......................................................................................................................1

2.0 Question 1.......................................................................................................................................2

2.1 Part A...........................................................................................................................................2

2.1.1 USA Descriptive Statistics Analysis........................................................................................5

2.1.2 Netherlands Descriptive Statistics Analysis...........................................................................6

2.2 Part B...........................................................................................................................................7

2.2.1 Consumption........................................................................................................................7

2.2.2 Investment............................................................................................................................8

2.2.3 Government Spending..........................................................................................................8

2.2.4 Net Exports...........................................................................................................................8

2.3 Part C...........................................................................................................................................8

2.3.1 USA OLS Regressions............................................................................................................8

2.3.2 USA Multiple Regression.....................................................................................................13

2.3.3 Netherlands OLS Regressions.............................................................................................15

2.3.4 Netherlands Multiple Regression........................................................................................20

2.4 Part D.........................................................................................................................................22

2.4.1 Autocorrelation Testing for USA.........................................................................................22

2.4.2 Autocorrelation Testing Netherlands..................................................................................24

2.4.3 Normal Distribution for the USA Multiple Regression Model Residuals.............................26

2.4.4 Normal Distribution for the Netherlands Multiple Regression Model Residuals................27

2.5 Part E.........................................................................................................................................29

2.5.1 Specification Error of the US model....................................................................................29

2.6 Part F.........................................................................................................................................30

2.6.1 Adding the Dummy Variable...............................................................................................30

2.6.2 Interaction Effect................................................................................................................31

2.7 Part G.........................................................................................................................................32

2.7.1 Lag of Government Spending for the USA..........................................................................32

2.7.2 Lag of Government Spending for the Netherlands.............................................................34

2.8 Part H.........................................................................................................................................36

Page 3: Group A Report

2.8.1 Forecasting GDP for the USA..............................................................................................36

2.8.2 Forecasting GDP for the Netherlands.................................................................................39

3.0 Question 2.....................................................................................................................................42

Equation 1 Open Economy GDP Theoretical Function...........................................................................7Equation 2 Standard Format for USA FDI OLS Regression Output.........................................................9Equation 3 Standard Format for USA Government Expenditure OLS Regression Output....................10Equation 4 Standard Format for USA Household Consumption OLS Regression Output.....................12Equation 5 Standard Format for USA Net Exports OLS Regression Output.........................................13Equation 6 Standard Format for USA Multiple Regression Output......................................................15Equation 7 Standard Format for Netherlands FDI OLS Regression Output..........................................16Equation 8 Standard Format for Netherlands Government Expenditure OLS Regression Output.......17Equation 9 Standard Format for Netherlands Household Consumption OLS Regression Output........18Equation 10 Standard Format for Netherlands Net Exports OLS Regression Output..........................20Equation 11 Standard Format for Netherlands Multiple Regression Output.......................................22Equation 12 Jarque Bera Equation for the USA Multiple Regression Model.......................................27Equation 13 Jarque Bera Equation for the Netherlands Multiple Regression Model..........................29Equation 14 Ramsey Reset Equation for USA......................................................................................30Equation 15 Standard Format for USA Two Period Distributive Lag of Government Spending Regression on GDP (Current US$)........................................................................................................34Equation 16 Standard Format for Netherlands Two Period Distributive Lag of Government Spending Regression on GDP (Current US$)........................................................................................................35

Figure 1 Bar Chart of USA GDP (Current US$)........................................................................................3Figure 2 Bar Chart of Netherland GDP (Current US$)............................................................................3Figure 3 Boxplot of USA GDP (Current US$)..........................................................................................4Figure 4 Boxplot of Netherlands GDP (Current US$).............................................................................4Figure 5 Overall Residual Plot for the USA Model...............................................................................23Figure 6 Overall Residual Plot for the Netherlands Model..................................................................24Figure 7 Histogram of USA Multiple Regression Residuals..................................................................26Figure 8 Histogram of the Netherlands Multiple Regression Residuals...............................................28Figure 9 Line Graph of US GDP (Current US$) from 1970-2013...........................................................37Figure 10 Line Graph of the Natural Log US GDP (Current US$) from 1970-2013...............................37Figure 11 Line Graph of the Change in US GDP (Current US$) from 1970-2013..................................38Figure 12 Line Graph of the change in the Natural Log of US GDP (Current US$) from 1970-2013.....38Figure 13 Line Graph of the Netherlands GDP (Current US$) from 1970-2013...................................40Figure 14 Line Graph of the Natural Log of the Netherlands GDP (Current US$) from 1970-2013......40Figure 15 Line Graph of the Change in the Netherlands GDP (Current US$) from 1970-2013.............41Figure 16 Line Graph of the Change in the Natural Log of the Netherlands GDP (Current US$) from 1970-2013...........................................................................................................................................41

Table 1 Descriptive Statistics of the USA and Netherlands GDP (Current US$).....................................2

Page 4: Group A Report

Table 2 Normal Distribution for USA GDP (Current US$).......................................................................5Table 3 Normal Distribution for Netherlands GDP (Current US$)..........................................................5Table 4 Correlation Matrix for the USA Multiple Regression Model....................................................14Table 5 Collinearity Diagnostics for the USA Multiple Regression Model............................................15Table 6 Correlation Matrix for the Netherlands Multiple Regression Model.......................................21Table 7 Collinearity Diagnostics for the Netherland Multiple Regression Model................................22Table 8 Model Summary for the USA Multiple Regression Model.......................................................23Table 9 Model Summary for the Netherlands Multiple Regression Model..........................................24Table 10 USA Multiple Regression Residual Descriptive Statistics.......................................................26Table 11 Netherlands Multiple Regression Residual Descriptive Statistics..........................................28Table 12 Coefficients and Significance Values for the USA Government Spending Lag Model............33Table 13 Collinearity Diagnostics for the USA Government Spending Lag Model...............................33Table 14 Coefficients and Significance Values for the Netherlands Government Spending Lag Model.............................................................................................................................................................35Table 15 Collinearity Diagnostics for the Netherlands Government Spending Lag Model..................35Table 16 Autocorrelation Functions for USA GDP (Current US$).........................................................36Table 17 Autocorrelation Functions for Netherlands GDP (Current US$)............................................39Table 18 Cumulative Abnormal Returns for Individual Companies.....................................................45Table 19 Cumulative Abnormal Returns for Portfolio..........................................................................46

Page 5: Group A Report

1.0 IntroductionThe following assignment has been completed Lee Rock, Jazmen Boone, Henry Doku, and

Bas Süß, consisting of two questions. The first question is regarding a regression analysis of

two chosen countries (United States of America and the Netherlands) to analyse, compare

and interpret these results. The second question is an event study based on analysing merger

and acquisition data for the year 2011. Data used for question one has been sourced from the

World Bank’s database and the lecturer has provided data for question two internally.

Appendix B contains the power point presentation concerning question 1 that the group

delivered on the 1st of December 2015.

1.1 Definition of TermsUS or USA: United States of America

NETH or Neth: Netherlands

STD Dev: Standard Deviation

GDP: Gross Domestic Product at Current Prices Denominated in US dollars

Max: Maximum

Min: Minimum

FDI: Foreign Direct Investment

Y or y: Dependent Variable

X or x: Independent Variable

Dum: Dummy Variable

GDP: Gross domestic product at current US dollars

C: Consumption (Household final consumption expenditure, etc. at current US dollars)

I: Investment (Foreign direct investment, net inflows, balance of payments at current US dollars)

G: Government Expenditure (General government final consumption expenditure at current US dollars)

1

Page 6: Group A Report

NX: Net Exports (Exports less imports at current US dollars)

X: Exports (Exports of goods and services at current US dollars)

M: Imports (Imports of goods and services at current US dollars)

2.0 Question 1

2.1 Part A USA GDP (current US$) NETH GDP (current US$)

Mean 7.38E+12 Mean 3.79E+11Standard Error 7.41E+11 Standard Error 4.1E+10Median 6.36E+12 Median 3.35E+11Mode #N/A Mode #N/AStandard Deviation 4.91E+12 Standard Deviation 2.72E+11Sample Variance 2.41E+25 Sample Variance 7.41E+22Kurtosis -1.12598 Kurtosis -0.7363Skewness 0.43018 Skewness 0.686606Range 1.57E+13 Range 8.94E+11Minimum 1.08E+12 Minimum 3.75E+10Maximum 1.68E+13 Maximum 9.31E+11Sum 3.25E+14 Sum 1.67E+13Count 44 Count 44Largest(1) 1.68E+13 Largest(1) 9.31E+11Smallest(1) 1.08E+12 Smallest(1) 3.75E+10Confidence Level (95.0%) 1.49E+12 Confidence Level (95.0%) 8.28E+10Table 1 Descriptive Statistics of the USA and Netherlands GDP (Current US$)

2

Page 7: Group A Report

Figure 1 Bar Chart of USA GDP (Current US$)

Figure 2 Bar Chart of Netherland GDP (Current US$)

3

Page 8: Group A Report

Figure 3 Boxplot of USA GDP (Current US$)

Figure 4 Boxplot of Netherlands GDP (Current US$)

4

Page 9: Group A Report

Normal Distribution for US GDPDeviations 1 Deviation 2 Deviations 3 DeviationsTails Tail 1 Tail 2 Tail 1 Tail 2 Tail 1 Tail 2Max/Min 1.23E+13 -2.5E+12 1.72E+13 -2.5E+12 2.21E+13 -7.4E+12Count 35 44 44Sample 45 45 45STD Dev 78% 98% 98%Table 2 Normal Distribution for USA GDP (Current US$)

Normal Distribution of Netherland GDPDeviations 1 Deviation 2 Deviations 3 DeviationsTails Tail 1 Tail 2 Tail 1 Tail 2 Tail 1 Tail 2Max/Min 6.51E+11 -1.1E+11 9.23E+11 -1.7E+11 1.2E+12 -4.4E+11Count 35 43 44Sample 45 45 45STD Dev 78% 96% 98%Table 3 Normal Distribution for Netherlands GDP (Current US$)

2.1.1 USA Descriptive Statistics Analysis The mean is the standard arithmetic average of all the variables added up and divided by the

amount, in this case the mean GDP at current prices in US dollars for the USA between1970-

2013 is 7.3775E+12.

The median is the middle number in a given data set when arranged in chronological order;

the median for this particular data set is 6.35668E+12.

The mode is the most common number to appear in the dataset, the mode cannot be

calculated in this case as no GDP-Value at current prices in US dollars has replicated itself

between 1970 -2013.

The mean and median are not equal with a difference of 1.02083E+12 which suggests

skewness in the normal distribution of the data. The skewness figure of 0.430179959

suggests a significant skewness to the right due to the value being positive. This is consistent

with the boxplot diagram as it can be seen the upper whisker is longer than the bottom

whisker indicating skewness to the right. The boxplot also identifies the median which

appears to be lower than the centre of the box. This indicates that more of the data is on the

upper side of the median rather than below it which may further indicate skewness to the

right. The boxplot does not show any outliers "o" or extreme outliers "*". The histogram

further backs up this significant skewness with 9 bars to the right of the highest bar compared

5

Page 10: Group A Report

to one on the left. No bars appear to pass the normal distribution line when it collapses

suggesting there are no outliers in the data.

For a dataset to fit a normal distribution 68% of the data occurs within one standard deviation

from the mean, a standard deviation is a measure of the average variance of data from the

mean itself. Also, 95% of the data should be within 2 standard deviations and practically

100% should rest within 3 standard deviations.

As can be seen from the table the USA GDP at current prices in US dollars, 78% occurs

within one deviation, 98% within two and 98% within three suggesting the data is normally

distributed but skewed to the right in lieu of previous arguments made.

The heavy significant skewness to the right suggests that the USA GDP at current prices in

US dollars generally increases year on year without any large sigma or unusual events that

cause an abnormal increase or decrease in the variable.

2.1.2 Netherlands Descriptive Statistics AnalysisThe mean is the standard arithmetic average of all the variables added up and divided by the

amount, in this case the mean GDP at current prices in US dollars for the Netherland

between1970-2013 is 3.78828E+11.

The median is the middle number in a given data set when arranged in chronological order;

the median for this particular data set is 3.34864E+11.

The mode is the most common number to appear in the dataset, the mode cannot be

calculated in this case as no GDP-Value at current prices in US dollars has replicated itself

twice between 1970 -2013.

The mean and median are not equal with a difference of 43964145552, which suggests

skewness in the normal distribution of the data. The skewness figure of 0.686605536

suggests a heavy skewness to the right due to the value being positive and quite close to 1.

This is consistent with the boxplot diagram as it can be seen the upper whisker is longer than

the bottom whisker indicating skewness to the right. The boxplot also identifies the median

which appears to be centred in the box indicating an even spread of data both above and

below the median. The boxplot does not show any outliers "o" or extreme outliers "*". This

histogram further back up this heavy skewness with 8 bars to the right of the highest bar

6

Page 11: Group A Report

compared to one on the left. No bars appear to pass the normal distribution line when it

collapses suggesting there are no outliers in the data.

For a dataset to fit a normal distribution 68% of the data occurs within one standard deviation

from the mean, a standard deviation is a measure of the average variance of data from the

mean itself. 95% should be within 2 standard deviations and practically 100% should rest

within 3 standard deviations.

As can be seen from the table the Netherlands GDP at current prices in US dollars, 78%

occurs within one deviation, 96% within two and 98% within three suggesting the data is

normally distributed but skewed to the right in lieu of previous arguments made.

The heavy skewness to the right suggests that the Netherlands GDP at current prices in US

dollars generally increases year on year without any large sigma or unusual events that cause

an abnormal increase or decrease in the variable.

2.2 Part BAll of the variables selected for the model have be denominated in US dollars to allow for

greater consistency during interpretation between the two datasets. The chosen variables have

been selected at current prices for the year 2014. Current prices can be defined as “prices of

the current reporting period” (OECD.com). Again this will allow for greater consistency

during interpretation between the two datasets.

GDP at current prices denominated in US dollars (GDP) has been chosen for the Y variable.

This GDP type was chosen for the reasons outlined above and GDP has been identified as the

mandatory variable in answering question 1.

Standard economic theory presents the GDP Equation as:

GDP=C+ I+G+NX(X−M )

Equation 1 Open Economy GDP Theoretical Function

We decided to test this theory by regressing GDP (Y) on consumption (X1), investment (X2),

government spending (X3) and net exports i.e. exports less imports (X4). As the equation

suggests we expect all variables will have a positive slope against GDP.

2.2.1 ConsumptionHousehold final consumption expenditure i.e. consumption (C) covers all purchase made by

resident households (home and abroad) to meet their everyday needs such as food, clothing,

7

Page 12: Group A Report

housing service (rents), energy, transport, durables goods, health, leisure and miscellaneous.

It also includes a number of imputed expenditures, for example agricultural products

produced for own-consumption.

2.2.2 InvestmentForeign direct investment, net inflows i.e. investment (I) has been used as a proxy for

investment as we were unable to source a more accurate investment variable for general

investment. FDI net inflows are inward direct investment made investors who do not reside in

the respective economy.

2.2.3 Government Spending General government final consumption expenditure i.e. government spending (G) includes all

government consumption, investment, and transfer payments. The acquisition by

governments of goods and services are used to directly satisfy the collective needs of the

respective population.

2.2.4 Net Exports Net exports (NX) have been derived by taking imports of goods and services from exports of

goods and services.

2.3 Part C

2.3.1 USA OLS RegressionsFull regression results for part C regarding the USA can be found in appendix A.

2.3.1.1 FDI The sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

the simple linear regression model had a high r-squared of 0.747 or 74.7% telling us that FDI

explains 74.7% of the variation in GDP. The adjusted r-square (which considers sample size)

is extremely close to the r square with a value of 0.741, which shows that this model is so far

useful. More importantly, the r-squared value is statistically significant, using the p-value

approach, as it has a p-value of 3.94E-14. This p-value surpasses the 1% standard so we can

conclude that the overall regression model is statistically significant at the 1% level.

After regressing GDP (y) on FDI (x), FDI's slope coefficient was 38.74517. This is

interpreted as GDP (US$) is expected to increase by 38.74517 for each dollar rise of FDI.

8

Page 13: Group A Report

We assessed FDI's significance level through the three statistical tests where it passed all

three evaluations: p-value, confidence interval, and t-test. The slope coefficient is statistically

significant with a p-value of 3.94E-14, so the model is very significant at a 99% level of

confidence (1% level of significance). The confidence intervals of the slope coefficient

further justifies significance of the variable, FDI, because zero is not in the range; thus, the

regression output shows that we can be 95% confident that the unknown population slope lies

between 31.73295 and 45.75738. Since zero is not in the confidence interval, we can

therefore be 95% confident that the real population slope is not zero and that there is a

positive relationship between the variables GDP and FDI. Lastly with a critical value of 1.98

with a 5% assumed level of confidence (two tailed), the regression output creates a t-stat of

11.15067 for the slope coefficient of FDI. The t stat of 11.15067 is greater than the critical

value 1.98 (11.15067 > 1.98), so there is a 5% level of significance as we are 95% confident

to pass the t-test. This adds more support that the slope coefficient is statistically significant

overall.

Equation 2 Standard Format for USA FDI OLS Regression Output

2.3.1.2 Government ExpenditureThe sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

the simple linear regression model had a very high r-squared of 0.990 or 99.0% telling us that

government consumption expenditure explains 99.0% of the variation in GDP. The high r-

squared tells us that it is a good model because it is close to an ideal value of 100%. The

adjusted r-square (which considers sample size) is extremely close to the r square with a

value of 0.990 demonstrating that this model is so far useful. More importantly, the r-squared

value is statistically significant, using the p-value approach, as the model’s p-value is 1.75E-

44. This p-value surpasses the 1% standard so we can conclude that the overall regression

model is statistically significant at the 1% level. Although the r-square is significantly high

for this model, it is important to note that the presence of outliers may have reduced the r-

9

Page 14: Group A Report

square value (-2.69 and -2.62 is outside of the +/- 2.5 outlier rule). There is an option to

regress the output of the model without the outliers (observation 43 and 44) to see its impact

on the overall regression model and its r-square value to help make the regression a better

one.

After regressing GDP (y) on government consumption expenditure (x), government

consumption expenditure's slope coefficient was 6.403442. This is interpreted as GDP (US$)

is expected to increase by 6.403442 for each dollar rise of government consumption

expenditure.

We assessed government consumption expenditure's significance level through the three

statistical tests where it passed all three evaluations: p-value, confidence interval, and t-test.

The slope coefficient is statistically significant with a p-value of 1.75E-44, so the model is

very significant at a 99% level of confidence (1% level of significance). The confidence

intervals of the slope coefficient further justifies significance of the variable, government

consumption expenditure, because zero is not in the range; thus, the regression output shows

that we can be 95% confident that the unknown population slope lies between 6.212212 and

6.594671. Since zero is not in the confidence interval, we can therefore be 95% confident that

the real population slope is not zero and that there is a positive relationship between the

variables GDP and government consumption expenditure. Lastly with a critical value of 1.98

with a 5% assumed level of confidence (two tailed), the regression output creates a t-stat of

67.57683 for the slope coefficient of government consumption expenditure. The t stat of

67.57683 is greater than the critical value 1.98 (67.57683 > 1.98), so there is a 5% level of

significance or a 95% confidence level to pass the t-test. This adds more support that the

slope coefficient is statistically significant overall.

Equation 3 Standard Format for USA Government Expenditure OLS Regression Output

10

Page 15: Group A Report

2.3.1.3 Household Consumption The sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

the simple linear regression model had a very high r-squared of 0.999 or 99.9% telling us that

household consumption expenditure explains 99.9% of the variation in GDP. The high r-

squared tells us that it is a good model because it is close to an ideal value of 100%. The

adjusted r-square (which considers sample size) is extremely close to the r-square with a

value of 0.999, which shows that this model is so far useful. More importantly, the r-squared

value is statistically significant, using the p-value approach, as its p-value is 2.13E-68. This

p-value surpasses the 1% standard so we can conclude that the overall regression model is

statistically significant at the 1% level.

After regressing GDP (y) on net exports (x), house consumption expenditure's slope

coefficient was 1.443829. This is interpreted as GDP (US$) is expected to increase by

1.443829 for each dollar rise of house consumption expenditure.

We assessed house consumption expenditure's significance level through the three statistical

tests where it passed all three evaluations: p-value, confidence interval, and t-test. The slope

coefficient is statistically significant with a p-value of 2.49E-30, so the model is very

significant at a 99% level of confidence (1% level of significance). The confidence intervals

of the slope coefficient further justifies significance of the variable, house consumption

expenditure, because zero is not in the range; thus, the regression output shows that we can be

95% confident that the unknown population slope lies between 1.432256 and 1.455403. Since

zero is not in the confidence interval, we can therefore be 95% confident that the real

population slope is not zero and that there is a positive relationship between the variables

GDP and house consumption expenditure. Lastly with a critical value of 1.98 with a 5%

assumed level of confidence (two tailed), the regression output creates a t-stat of 251.763 for

the slope coefficient of household consumption expenditure. The t-stat of`251.763 is greater

than the critical value 1.98 (251.763 > 1.98), so there is a 5% level of significance or a 95%

confidence level of passing the t-test. This adds more support that the slope coefficient is

statistically significant overall.

11

Page 16: Group A Report

Equation 4 Standard Format for USA Household Consumption OLS Regression Output

2.3.1.4 Net exportsThe sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

the simple linear regression model had a very high r-squared of 0.823 or 82.3% telling us that

net exports explains 82.3% of the variation in GDP. The adjusted r-square (which considers

sample size) is extremely close to the r-square with a value of 0.818 showing that this model

is so far useful. More importantly, the r-squared value is statistically significant, using the p-

value approach, which its p-value of 2.13E-17. This p-value surpasses the 1% standard so we

can conclude that the overall regression model is statistically significant at the 1% level.

After regressing GDP (y) on net exports (x), net export's slope coefficient was -17.7806. This

is interpreted as GDP (US$) is expected to decrease by 17.7806 for each dollar rise of net

exports.

We assessed net export's significance level through the three statistical tests where it passed

all three evaluations: p-value, confidence interval, and t-test. The slope coefficient is

statistically significant with a p-value of 2.13E-17, so the model is very significant at a 99%

level of confidence (1% level of significance). The confidence intervals of the slope

coefficient further justifies significance of the variable, net exports, because zero is not in the

range; thus, the regression output shows that we can be 95% confident that the unknown

population slope lies between -20.3472 and -15.2139. Since zero is not in the confidence

interval, we can therefore be 95% confident that the real population slope is not zero and that

there is a negative relationship between the variables GDP and net exports. Lastly with a

critical value of 1.98 with a 5% assumed level of confidence (two tailed), the regression

output creates a t-stat of -13.9806 for the slope coefficient of net exports. The t-stat of -

13.9806 (absolute value) is greater than the critical value 1.98 (13.9806 > 1.98), so there is a

5% level of significance or a 95% confidence level of passing the t-test. This adds more

support that the slope coefficient is statistically significant overall.

12

Page 17: Group A Report

Equation 5 Standard Format for USA Net Exports OLS Regression Output

2.3.2 USA Multiple RegressionWith a sample size of 44 observations in this multiple regression, it is acceptable to make

generalizations about the population because it is greater than 30. Along with a good sample

size, the model has a high r-squared of 0.9995 or 99.95% telling us that the independent

variables (FDI, Government Consumption Expenditure, House Consumption Expenditure and

Net Exports) explain 99.95% of the variation in GDP. The adjusted r-square (which considers

sample size) is extremely close to the r-square with a value of 0.99945 illustrating that this

model is so far useful. More importantly, the r-squared value is statistically significant, using

the p-value approach, which its p-value of 8.8397E-64. This p-value surpasses the 1%

standard so we can conclude that the overall regression model is statistically significant at a

99% confidence interval. It is important to note that the presence of the one outlier in the

standard residuals column may have reduced the r-square value (-2.55891 is outside of the +/-

2.5 outlier rule). There is an option to regress the output of the model without the outlier

(observation 44) to see its impact on the overall regression model and its r-square value to

help make the regression a better one. Despite one outlier, this multiple regression model is

good and useful thus far.

It is common in a multiple regression equation output to not be too concerned with the

intercept term as shown in the standard format, but interpreting the slope coefficients,

deciding if they are relevant (which all are theoretically relevant) and if they are statistically

significant is very important when determining the goodness of the regression model. An

example in interpreting the regression slope coefficients for FDI would be: GDP (US$) is

expected to increase by 38.74517 for each dollar rise of FDI ceteris paribus (keeping all other

independent variables constant). This interpretation goes for the other three independent

variables as well.

Testing if the slope coefficients are statistically significant through the three tests (p-value, t-

test, and confidence intervals) help determine if this model is useful and good. Government

13

Page 18: Group A Report

consumption expenditure, house consumption expenditure, and net exports are the only three

independent variables of the four that pass the hypothesis testing for the regression model.

FDI had a p-value exceeding 0.05 (specifically 0.637), had the presence of zero in the

confidence intervals (-0.5339888, 0.86206896) and has a t value (0.475342) that is lower than

the critical value (two sided) of 1.98. It also important to note that the government spending

variable for the USA had a negative sign when modern theory indicates that government

spending should have a positive correlation with GDP which may indicate the variable is not

relevant to the model, however the variable remains statistically significant.

The correlation matrix revealed very high correlations among the independent variables

ranging from 0.84 to 0.996, which indicates the presence of multicollinearity.

Multicollinearity may be caused by the presence of outliers, an irrelevant variable, an omitted

relevant variable or using data that basically measures the same thing.

Detail USA GDP

(current US$)

USA Foreign direct

investment, net inflows (BoP,

current US$)

USA General

government final

consumption

expenditure

(current US$)

USA Househol

d final consump

tion expenditu

re, etc. (current

US$)

USA Net

Exports

(Current

US$)

USA GDP (current US$) 1USA Foreign direct investment, net inflows (BoP, current US$)

0.864582

1

USA General government final consumption expenditure (current US$)

0.995433

0.846306 1

USA Household final consumption expenditure, etc. (current US$)

0.999669

0.863119 0.996601 1

USA Net Exports (Current US$) -0.9072

6

-0.83613 -0.90267 -0.91009 1

Table 4 Correlation Matrix for the USA Multiple Regression Model

Analysing the condition index and VIF for the US model, there is some discrepancy in the

results. For multicollinearity to be present the condition index must be greater than 20 and the

VIF must be greater than 10. The condition index of US net exports is 71.129 but the VIF is

less than 10 for the corresponding variable. Likewise the VIF for US consumption and US

government spending is greater than the 10 threshold but the condition index is less than 20

as can be seen in the table below. Thus we rule out the presence of multicollinearity.

14

Page 19: Group A Report

Detail VIF Condition Index

(Constant) 1.000

USFDI 4.630 3.302

USCON 167.942 6.783

USGOV 191.660 9.614

USEXP 6.181 71.129

Table 5 Collinearity Diagnostics for the USA Multiple Regression Model

Equation 6 Standard Format for USA Multiple Regression Output

Due to the possible presence of multicollinearity and presence of autocorrelation (examined

in part D) along with the government spending variable not sloping in the direction of

modern theory it can be concluded that this model of the USA is not very useful.

2.3.3 Netherlands OLS RegressionsFull regression results for part C regarding the Netherlands can be found in appendix B.

2.3.3.1 FDIThe sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

the simple linear regression model has a very low r-squared of 0.469 or 46.9% telling us that

FDI explains 46.9% of the variation in GDP. The adjusted r-square (which considers sample

size) is extremely close to the r-square with a value of 0.456, which is a good thing. More

importantly, the r-squared value is statistically significant, using the p-value approach, as its p

-value is 2.94E-07. This p-value surpasses the 1% standard so we can conclude that the

overall regression model is statistically significant at the 1% level.

After regressing GDP (y) on foreign direct investment (x), FDI's slope coefficient was

1.228757. This is interpreted as GDP (in US$) is expected to increase by 1.228757 for each

dollar rise of FDI.

15

Page 20: Group A Report

We assessed FDI's significance level through the three statistical tests where it passed all

three evaluations: p-value, confidence interval, and t-test. The slope coefficient is statistically

significant with a p-value of 2.94E-07, so the model is very significant at a 99% level of

confidence (1% level of significance). The confidence intervals of the slope coefficient

further justifies significance of the variable, FDI, because zero is not in the range; thus, the

regression output shows that we can be 95% confident that the unknown population slope lies

between 0.821652 and 1.635862. Since zero is not in the confidence interval, we can

therefore be 95% confident that the real population slope is not zero and that there is a

positive relationship between the variables GDP and FDI. Lastly with a critical value of 1.98

with a 5% assumed level of confidence (two tailed), the regression output creates a t-stat of

6.091136 for the slope coefficient of FDI. The t stat of 6.091136 is greater than the critical

value 1.98 (6.091136 > 1.98), so there is a 5% level of significance or a 95% confidence

passing the t-test. This adds more support that the slope coefficient is statistically significant

overall.

Equation 7 Standard Format for Netherlands FDI OLS Regression Output

2.3.3.2 Government ExpenditureThe sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

the simple linear regression model has a very low r-squared of 0.986 or 98.6% telling us that

government consumption expenditure explains 98.6% of the variation in GDP. The high r-

squared tells us that it is a good model because it is close to an ideal value of 100%. The

adjusted r square (which considers sample size) is extremely close to the r square with a value

of 0.985 as this is a good thing. More importantly, the r squared value is statistically

significant, using the p-value approach, which its p-value of1.22E-40. This p-value surpasses

the 1% standard so we can conclude that the overall regression model is statistically

significant at the 1% level.

16

Page 21: Group A Report

After regressing GDP (y) on government consumption expenditure (x), government

consumption expenditure's slope coefficient was 3.881761. This is interpreted as GDP (US$)

is expected to increase by 3.881761 for each dollar rise in government consumption

expenditure.

We assessed government consumption expenditure's significance level through the three

statistical tests where it passed all three evaluations: p-value, confidence interval, and t-test.

The slope coefficient is statistically significant with a p-value of 1.22E-40, so the model is

very significant at a 99% level of confidence (1% level of significance). The confidence

intervals of the slope coefficient further justifies significance of the variable, government

consumption expenditure, because zero is not in the range; thus, the regression output shows

that we can be 95% confident that the unknown population slope lies between 3.738303 and

4.025219. Since zero is not in the confidence interval, we can therefore be 95% confident that

the real population slope is not zero and that there is a positive relationship between the

variables GDP and government consumption expenditure. Lastly with a critical value of 1.98

with a 5% assumed level of confidence (two tailed), the regression output creates a t-stat of

54.60619 for the slope coefficient of government consumption expenditure. The t-stat of

54.60619 is greater than the critical value 1.98 (54.60619 > 1.98), so there is a 5% level of

significance with 95% confidence level to pass the t-test. This adds more support that the

slope coefficient is statistically significant overall.

Equation 8 Standard Format for Netherlands Government Expenditure OLS Regression Output

2.3.3.3 ConsumptionThe sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

the simple linear regression model has a very low r-squared of 0.995 or 99.5% telling us that

household consumption expenditure explains 99.5% of the variation in GDP. The high r-

squared tells us that it is a good model because it is close to an ideal value of 100%. The

adjusted r-square (which considers sample size) is extremely close to the r-square with a

17

Page 22: Group A Report

value of 0.995 exemplifying that this model is so far useful. More importantly, the r-squared

value is statistically significant, using the p-value approach, which its p-value of 3.4E-51.

This p-value surpasses the 1% standard so we can conclude that the overall regression model

is statistically significant at the 1% level.

After regressing GDP (y) on household consumption expenditure (x), household consumption

expenditure’s slope coefficient was 2.21298. This is interpreted as GDP (US$) is expected to

increase by 2.21298 for each dollar rise of household consumption expenditure.

We assessed household consumption expenditure’s significance level through the three

statistical tests where it passed all three evaluations: p-value, confidence interval, and t-test.

The slope coefficient is statistically significant with a p-value of 3.4E-51, so the model is

very significant at a 99% level of confidence (1% level of significance). The confidence

intervals of the slope coefficient further justifies significance of the variable, household

consumption expenditure, because zero is not in the range; thus, the regression output shows

that we can be 95% confident that the unknown population slope lies between 2.167342 and

2.258617. Since zero is not in the confidence interval, we can therefore be 95% confident that

the real population slope is not zero and that there is a positive relationship between the

variables GDP and household consumption expenditure. Lastly with a critical value of 1.98

with a 5% assumed level of confidence (two tailed), the regression output creates a t-stat of

97.85771 for the slope coefficient of household consumption expenditure. The t-stat of

97.85771 is greater than the critical value 1.98 (97.85771 > 1.98), so there is a 5% level of

significance along with a 95% confidence level of passing the t test. This adds more support

that the slope coefficient is statistically significant overall.

Equation 9 Standard Format for Netherlands Household Consumption OLS Regression Output

2.3.3.4 Net ExportsThe sample size of the regression is acceptable to make generalizations about the population

because it is greater than 30 (specifically 44 observations). Along with a good sample size,

18

Page 23: Group A Report

the simple linear regression model had a very high r-squared of 0.957 or 95.7% telling us that

net exports explains 95.7% of the variation in GDP. The high r-squared tells us that it is a

good model because it is close to an ideal value of 100%. The adjusted r-square (which

considers sample size) is extremely close to the r-square with a value of 0.955 showing that

this is a good thing for the model. More importantly, the r-squared value is statistically

significant, using the p-value approach, where the p-value of the model is 2.49E-30. This p-

value surpasses the 1% standard so we can conclude that the overall regression model is

statistically significant at the 1% level. Although the r-square is significantly high for this

model, it is important to note that the presence of the outlier may have reduced the r-square

value (-3.00 is outside of the +/- 2.5 outlier rule). There is an option to regress the output of

the model without the outlier (observation 44) to see its impact on the overall regression

model and its r-square value to help make the regression a better one.

After regressing GDP (y) on net exports (x), net export's slope coefficient was 10.27151. This

is interpreted as GDP (US$) is expected to increase by 10.27151 for each dollar rise of net

exports.

We assessed net export's significance level through the three statistical tests where it passed

all three evaluations: p-value, confidence interval, and t-test. The slope coefficient is

statistically significant with a p-value of 2.49E-30, so the model is very significant at a 99%

level of confidence (1% level of significance). The confidence intervals of the slope

coefficient further justifies significance of the variable, net exports, because zero is not in the

range; thus, the regression output shows that we can be 95% confident that the unknown

population slope lies between 9.593672 and 10.94936. Since zero is not in the confidence

interval, we can therefore be 95% confident that the real population slope is not zero and that

there is a positive relationship between the variables GDP and net exports. Lastly with a

critical value of 1.98 with a 5% assumed level of confidence (two tailed), the regression

output creates a t-stat of 30.5805 for the slope coefficient of net exports. The t-stat of 30.5805

is greater than the critical value 1.98 (30.5805 > 1.98), so there is a 5% level of significance

of passing the t-test. This adds more support that the slope coefficient is statistically

significant overall.

19

Page 24: Group A Report

Equation 10 Standard Format for Netherlands Net Exports OLS Regression Output

2.3.4 Netherlands Multiple RegressionWith a sample size of 44 observations in this multiple regression, it is acceptable to make

generalizations about the population because it is greater than 30. Along with a good sample

size, the multiple linear regression model had a high r-squared of 0.9996 or 99.96% telling us

that the independent variables (FDI, Government Consumption Expenditure, House

Consumption Expenditure and Net Exports) explain 99.96% of the variation in GDP. The

adjusted r-square (which considers sample size) is extremely close to the r-square with a

value of 0.99959 displaying how this model is so far useful. More importantly, the r-squared

value is statistically significant, using the p-value approach, since its p-value is 3.15E-66.

This p-value surpasses the 1% standard so we can conclude that the overall regression model

is statistically significant at the 99% confidence interval. It is important to note that the

presence of the two outliers may have reduced the r square value (-2.81251 and 3.848636 are

outside of the +/- 2.5 outlier rule). There is an option to regress the output of the model

without the outlier (observation 1 and 44) to see its impact on the overall regression model

and its r square value to help make the regression a better one. Despite outliers, this multiple

regression model is good and useful thus far.

It is common in a multiple regression equation output to not be too concerned with the

intercept term as shown in the standard format, but interpreting the slope coefficients,

deciding if they are relevant (which all are theoretically relevant) and if they are statistically

significant is very important when determining the goodness of the regression model. An

example in interpreting the regression slope coefficients for household consumption would

be: GDP (US$) is expected to increase by 2.21298 for each dollar rise of household

consumption expenditure ceteris paribus (keeping all other independent variables constant).

This interpretation goes for the other three independent variables as well.

Testing if the slope coefficients are statistically significant through the three tests (p-value, t-

test, and confidence intervals) help determine if this model is useful and good. All of the

20

Page 25: Group A Report

independent variables, FDI, government consumption expenditure, house consumption

expenditure, and net exports pass the hypothesis testing for the regression model. Despite the

presence of the two outliers, this model is still very good or useful thus far due to the an

overall statistically significant regression model, high r-square, good sample size, high

adjusted r-square, and all independent variables that are statistically significant.

However, the correlation matrix revealed very high correlations among some of the

independent variables such as government spending and household consumption ranging

from 0.84 to 0.996, which indicates the presence of multicollinearity. Multicollinearity may

be caused by the presence of outliers, an irrelevant variable, an omitted relevant variable or

using data that basically measures the same thing.

Detail NETH GDP (current

US$)

NETH Foreign direct

investment, net inflows

(BoP, current US$)

NETH General government

final consumption expenditure

(current US$)

NETH Household

final consumption expenditure, etc. (current

US$)

NETH Net Exports (Current

US$)

NETH GDP (current US$)

1

NETH Foreign direct investment, net inflows (BoP, current US$)

0.684865 1

NETH General government final consumption expenditure (current US$)

0.993031 0.695656 1

NETH Household final consumption expenditure, etc. (current US$)

0.997814 0.661969 0.984356 1

NETH Net Exports (Current US$)

0.978273 0.753447 0.983997 0.966786 1

Table 6 Correlation Matrix for the Netherlands Multiple Regression Model

Analysing the condition index and VIF for the US model, there is some discrepancy in the

results. For multicollinearity to be present the condition index must be greater than 20 and the

VIF must be greater than 10. The condition index for consumption and net exports are greater

than 20 and the VIF for the corresponding variables are greater than 10. We can conclude that

house consumption expenditure and net exports for the Netherlands data set has

multicollinearity present. Government expenditure has a VIF of 68.526 with which are

21

Page 26: Group A Report

greater than 10 but a condition index of 4.644 which is less than the 20 threshold. The

presence of multicollinearity cannot be ruled out as can be seen from the table below.

Detail VIF Condition Index

(Constant) 1.000NETHFDI 2.817 2.620

NETHGOV 68.526 4.644

NETHCON 33.230 21.178

NETHEXP 44.560 33.918Table 7 Collinearity Diagnostics for the Netherland Multiple Regression Model

Equation 11 Standard Format for Netherlands Multiple Regression Output

With the presence of multicollinearity and autocorrelation (examined in part D) in the

Netherlands model, we can conclude that the model is not very useful as not all of the noise

in the error term is random and may be affected by an omitted or irrelevant variable in the

model.

2.4 Part D

2.4.1 Autocorrelation Testing for USAThe overall residual plot for the USA appears largely homoscedastic excluding the outliers;

however, the residuals appear to form a pattern of peaks and troughs throughout the time

period. This indicates that GDP will increase for a number of years and decrease for a

number of years, which underestimates true variance making the statistical slope tests

irrelevant. A miss-specified model may cause these conclusions where such models occur

from omitted variables, irrelevant variables, or outliers. The pattern can be seen from the

graph below and the data will be further tested using the Durban Watson test.

22

Page 27: Group A Report

1960 1970 1980 1990 2000 2010 2020

-350000000000

-300000000000

-250000000000

-200000000000

-150000000000

-100000000000

-50000000000

0

50000000000

100000000000

150000000000

Overall Residual Plot for the USA Model

Time (years)

Resid

uals

Figure 5 Overall Residual Plot for the USA Model

Model Summaryb

Mode

l

R R

Square

Adjusted R

Square

Std. Error of

the Estimate

Durbin-

Watson

1 1.000a .999 .999 11862001770

3.45859

.348

a. Predictors: (Constant), netexport, FDI, govexpenditure, householdconsump

b. Dependent Variable: GDPTable 8 Model Summary for the USA Multiple Regression Model

The D-W (= 0.348) is below 2 which indicates positive serial correlation but this needs to be

compared to critical values (which can be obtained from D-W tables) before one can decide

whether we can accept or reject the null hypothesis of no serial correlation in the data.

The hypothesis test for autocorrelation is as follows:

Step 1: H 0 : No autocorrelation in the regression model.

H a : There is autocorrelation in the regression model.

Step 2: Critical Value from Durbin Watson tables.

From the extract of D-W critical values, we generate two values dL and dU which represent

upper and lower limits for the D-W statistic – for the data in the autocorrelation sheet with n

= 45 (closest to n = 44, our sample size) and k = 4 (four X variables), we get dL = 1.156 and

dU = 1.528.

23

Page 28: Group A Report

Step 3: Calculate the Durbin-Watson statistic from SPSS – this is 0.348.

Step 4: Conclusion: We can reject H 0with 1% significance (these are 1% tables). In this case,

the Durbin-Watson statistic, 0.348, is less than the dL, 1.159, and therefore, we can conclude

that there is positive autocorrelation in this data. Thus we reject the null hypothesis and

accept the alternative hypothesis.

2.4.2 Autocorrelation Testing NetherlandsThe overall residual plot for the Netherlands appears homoscedastic; however, the residuals

appear to form a pattern of peaks and troughs throughout the time period. This indicates that

GDP will increase for a number of years and decrease for a number of years, which

underestimates true variance making the statistical slope tests irrelevant. A miss-specified

model may cause these conclusions where such models occur from omitted variables,

irrelevant variables, or outliers. The pattern can be seen from the graph below and the data

will be further tested using the Durban Watson test.

1960 1970 1980 1990 2000 2010 2020

-20000000000

-15000000000

-10000000000

-5000000000

0

5000000000

10000000000

15000000000

20000000000

25000000000

Overall Residual Plot for the Netherlands Model

Time (years)

Resid

uals

Figure 6 Overall Residual Plot for the Netherlands Model

Model Summaryb

Mode

l

R R

Square

Adjusted R

Square

Std. Error of

the Estimate

Durbin-

Watson

1 1.000a 1.000 1.000 5492329398.

62595

.896

a. Predictors: (Constant), householdconsump, netexport, govexpenditure, FDI

b. Dependent Variable: GDPTable 9 Model Summary for the Netherlands Multiple Regression Model

24

Page 29: Group A Report

The D-W (0.896) is below 2 which indicates positive serial correlation but this needs to be

compared to critical values (which can be obtained from D-W tables) before one can decide

whether we can accept or reject the null hypothesis of no serial correlation in the data.

The hypothesis test for autocorrelation is as follows:

Step 1: H 0 : No autocorrelation in the regression model.

H a : There is autocorrelation in the regression model.

Step 2: Critical Value from Durbin Watson tables.

From the extract of D-W critical values, we generate two values dL and dU which represent

upper and lower limits for the D-W statistic – for the data in the autocorrelation sheet with n

= 45 (closest to n = 44, our sample size) and k = 4 (four X variables), we get dL = 1.156 and

dU = 1.528.

Step 3: Calculate the Durbin-Watson statistic from SPSS – this is 0.896.

Step 4: Conclusion: We can reject H 0 with 1% significance (these are 1% tables). In this

case, the Durbin-Watson statistic, 0.896, is less than the dL, 1.159, and therefore, we can

conclude that there is positive autocorrelation in this data. Thus we reject the null hypothesis

and accept the alternative hypothesis.

25

Page 30: Group A Report

2.4.3 Normal Distribution for the USA Multiple Regression Model ResidualsLooking at the normal distribution of the USA residuals we can see that the shape of the

distribution is not symmetrical a fat tail to the right. However, the histogram appears to show

a heavy skewness to the left with all of the bars to the left of the highest bar.

Figure 7 Histogram of USA Multiple Regression Residuals

The negative skewness of -.0781 as seen in the table below further signifies skewness to the

left.

USA Residuals Descriptive Statistics

Mean -0.000965465Standard Error 16569744409Median 24395340639Mode #N/AStandard Deviation 1.09911E+11Sample Variance 1.20805E+22Kurtosis -0.420720907Skewness -0.781885131Range 4.0583E+11Minimum -2.81252E+11Maximum 1.24577E+11Sum -0.042480469Count 44Largest(1) 1.24577E+11Smallest(1) -2.81252E+11Confidence Level(95.0%) 33416074292Table 10 USA Multiple Regression Residual Descriptive Statistics

26

Page 31: Group A Report

The Jarque Bera test is a statistical analysis to identify whether or not the residuals of a given

data set are normally distributed. It is a hypothesis test with the following null and alternative

hypothesis:

H 0 : Skewness is zero and kurtosis is 3.

H a : Skewness is not zero and kurtosis is not 3.

The critical value (chi value) was determined by the "=Chiinv(0.05,2)" function in excel

which equates to 5.991. The formula to derive the Jarque Bera statistic (JB) can be seen

below were "n" equals the sample size, "k" equals kurtosis and "s" equals skewness.

JB=n ( s2

6+

(k−3 )2

24 )JB=44(−0.7812

6+ (−0.421−3 )2

24 ) JB=¿ 25.93

Equation 12 Jarque Bera Equation for the USA Multiple Regression Model

The JB statistic of 25.93 is greater than the critical value of 9.497 which means we must

reject the null hypothesis and accept the alternative hypothesis. Combining the analysis of the

Jarque Bera test as well as the analysis of the histogram and skewness we can determine that

the residuals for the USA multiple regression model are not normally distributed and do not

have a kurtosis of 3.

2.4.4 Normal Distribution for the Netherlands Multiple Regression Model ResidualsAs can be seen on the histogram below, the residuals for the Netherlands multiple regression

model appear to be skewed to the left with 4 boxes on the left of the highest box on the chart

compared to one box on the right which is deemed to be an outlier as it occurs where the bell

curve flattens out.

27

Page 32: Group A Report

Figure 8 Histogram of the Netherlands Multiple Regression Residuals

The skewness figure of 0.594 contradicts this analysis as data, which is skewed to the left,

tends to have a negative skewness figure, though this contradiction may be caused by the

outlier as seen on the histogram.

Netherlands Residual Descriptive Statistics

Mean -3.5546E-06Standard Error 794443405.1Median 1089217220Mode #N/AStandard Deviation 5269741384Sample Variance 2.77702E+19Kurtosis 5.028523526Skewness 0.594789731Range 35102516448Minimum -14821202345Maximum 20281314103Sum -0.000156403Count 44Largest(1) 20281314103Smallest(1) -14821202345Confidence Level (95.0%) 1602147818Table 11 Netherlands Multiple Regression Residual Descriptive Statistics

28

Page 33: Group A Report

The Jarque Bera test is a statistical analysis to identify whether or not the residuals of a given

data set are normally distributed. It is a hypothesis test with the following null and alternative

hypothesis:

H 0 : Skewness is zero and kurtosis is 3

H a : Skewness is not zero and kurtosis is not 3

The critical value (chi value) was determined by the "=Chiinv(0.05,2)" function in excel

which equates to 5.991. The formula to derive the Jarque Bera statistic (JB) can be seen

below were "n" equals the sample size, "k" equals kurtosis and "s" equals skewness.

JB=n ( s2

6+

(k−3 )2

24 )JB=44( 0.5942

6+

(5.028−3 )2

24 ) JB=¿ 10.138

Equation 13 Jarque Bera Equation for the Netherlands Multiple Regression Model

The Jarque Bera statistic of 10.138 is greater than the critical value of 9.497 which means we

must reject the null hypothesis and accept the alternative hypothesis. Combining the analysis

of the Jarque Bera test as well as the analysis histogram and skewness we can determine that

the residuals for the Netherlands multiple regression model are not normally distributed and

do not have a kurtosis of 3.

2.5 Part E

2.5.1 Specification Error of the US model As a part a classical linear regression model, assumption ten verifies if the model is correctly

specified or not. This ensures that all drivers of the dependent variable are relevant and

included to visualize their impact on the dependent variable. The Ramsey RESET test

determines whether or not there is a specification error by measuring if non-linear

combinations of the fitted values explain the dependent variable. If the combinations do not

explain the dependent variable properly, then the model is most likely miss-specified. Below

gives the procedure of the Ramsey RESET test:

29

Page 34: Group A Report

Step 1: H 0 : The correct specification is linear.

H a : The correct specification is non-linear.

Step 2: Get a Critical Value from F tables – this will be based on the degrees of freedom in

an unrestricted regression model. 2.49 (at 5% significance level, v_1=7 and v_2=36).

Step 3: Get an F statistic - Compare the fit of the two equations using an F test.

F=(SSRM−SSRU ) /MSSRU /(n−k−1)U

F=(519460765112148000000000.00−35229096389330200000000.00 )/3

35229096389330200000000.00 /(44−7−1)

F=164.9426361

Equation 14 Ramsey Reset Equation for USA

Step 4: Conclusion – reject H 0 if the F statistic exceeds the critical value.

As the F statistic is greater than the critical value, we will reject the null hypothesis and

accept the alternative hypothesis meaning that that the true specification of the model is non-

linear (with 5% level of significance). Since we are 95% confident that the USA model is

miss-specified, the rejection shows that our model has non-linear combinations of the

independent variables that have power in explaining the dependent variable, GDP. Possible

reasons for miss-specification in our model could be an inclusion of an irrelevant variable,

exclusion of a relevant variable (it’s omitted), or miss-specification of the functional form

such as an inappropriate linear relationship.

2.6 Part F

2.6.1 Adding the Dummy VariableA multiple regression was run pooling both the US and Netherlands datasets together. This

regression was run twice, one with the dummy variable and one without the dummy variable.

The slope of the dummy variable is "2.4E+11" where it can be interpreted in two ways since

the US = 1 and the Netherlands = 0. If the dummy variable is equal to zero then we can

expect GDP to be 3.2E+11 in the Netherlands or if the dummy variable is equalled to one

30

Page 35: Group A Report

then we can expect GDP to be 3.2E+11 - 2.4E+11 = 82116505677 (intercept plus the slope

multiplied by 1) in the United States.

As for the statistical significance of the dummy variable:

H 0 : A country has no effect on the growth rate in GDP (B=0)

H a : A country has an effect on the growth rate in GDP (B=1)

The dummy variable difference surpasses all three statistical significant tests of p-value, t-

test, and confidence interval. The slope coefficient is statistically significant with a p-value of

9.37E-11, so the model is significant at a 99% level of confidence (1% level of significance).

When the dummy variable is included, foreign direct investment becomes statistically

significant passing the p-value test at a 99% confidence interval. However government

spending remains statistically insignificant. There are two outliers present in the dummy

variable model at observation 1 and 2. Both these observations have a standard residual

greater than the nominal value of 2.5.

The confidence intervals of the slope coefficient further justifies significance of the dummy

variable because zero is not in the range; thus, the regression output shows that we can be

95% confident that the unknown population slope lies between -3.0204E+11 and -1.7E+11.

Lastly with a critical value of 1.98 with a 5% assumed level of confidence (two tailed), the

regression output creates a t-stat of -7.4282 for the dummy variable's slope coefficient. The t-

stat of -7.4282 (absolute value) is greater than the critical value 1.98 (7.4282 > 1.98), so there

is a 5% level of significance or a 95% confidence level of passing the t test. Thus we can

reject the null hypothesis accept the alternative hypothesis. This adds more support that the

slope coefficient is statistically significant overall.

2.6.2 Interaction EffectWe are looking to test the hypothesis that household consumption has a different impact on

GDP in the United States as opposed to the Netherlands. We multiplied the dummy variable

by consumption and performed a multiple linear regression with all of the other variables

present in the model. The model has a high-adjusted r-square of 0.999 but also has a high

standard error of 8.19E+10. All of the variables in this model are statistically significant with

the exception of foreign direct investment as its p-value of 0.272 is greater than the

significance level of 0.05. The significance F is also less than 0.05 indicating that the overall

model is statistically significant.

31

Page 36: Group A Report

H 0 :Consumption has the same effect in both countries.

H a :Consumption has a differing effect in both countries.

With all other independent variables being held constant and subbing a 1 into the model to

simulate the USA we find that GDP is 3.53021E+11. However, when we sub in 0 into the

model to simulate the Netherlands we find that GDP is negative at -2.2618E+10. This

indicates that household consumption expenditure has a positive effect on GDP in the USA

with all other variables being held constant and a negative effect in the Netherlands, which

does not make theoretical sense, as consumption cannot be negative.

With these results in mind we can reject the null hypothesis that consumption has the same

effect in both countries and accept the alternative hypothesis and assumption that

consumption has a differing effect in both countries.

2.7 Part G

2.7.1 Lag of Government Spending for the USASome variables may not have an immediate effect on Y and thus take time before their impact

is realised. For example when the government spends money on infrastructure the impact on

GDP may not occur until a number of years after the event. As a result distributed lags are

used on variables whose output has a time delay on Y. The more periods you lag the more

degrees of freedom you give up which can hinder the goodness of the model during the

significance hypothesis tests.

Government expenditure (General government final consumption expenditure (current US$))

was chosen to test the effect of lagging a variable. Government expenditure (GE) was lagged

for two periods 1972-2013 with 42 observations as opposed to 1970-2013 with 44

observations.

The slope of government expenditure is 16.738, which indicate when all other variables are

held constant GDP will increase in the US by 16.738 for every one-unit increase in

government spending.

When government spending is lagged by a period of one, the slope drastically changes to -

20.743 indicating when all other variables are held constant government spending has a

negative effect on GDP by 20.743 for the US when increased by a unit of one.

32

Page 37: Group A Report

When government spending is lagged for a second period the slope reverts to 10.465. Thus

when all other variables are held constant the second lag of government spending will

increase GDP by 10.465 in the US for every one-unit increase in government spending.

The independent variables in this model are statistically significant at a 95% confidence

interval as the p-value for the independent variables are all less than the significance level of

0.05 (5%) as can be seen in the table below under the p-value heading. Thus we can reject the

null hypothesis that the slopes are statistically insignificant and not representative of the

sample size and do not reject the alternative hypothesis that the slopes are statistically

significant and relevant of the sample size.

Coefficients Standard Error t Stat P-valueIntercept 1.57E+09 1.43E+11 0.010972 0.991303GE 16.73863 3.354576 4.98979 1.37E-05Lag of GE X1 -20.743 6.754004 -3.07122 0.003927Lag of Ge X2 10.46563 3.86703 2.706373 0.010131Table 12 Coefficients and Significance Values for the USA Government Spending Lag Model

Multicollinearity is also present between the lag variables, which are another downside of

distributive lags. The VIF for the lagged independent variables are greater than 10 and the

condition index for these variables are greater than 20 as seen in the table below. Thus we can

conclude that multicollinearity is present for the two period distributive lag for the

Netherland’s government spending which can be seen from the table below. The unlagged

government consumption expenditure variable (USAGE) has a VIF greater than 10 at

1181.612 and a conflicting condition index which is less than 20 at 4.033. As a result we

cannot rule out the possibility of multicollinearity with regards to unlagged government

spending variable, which can be seen from the table below.

Detail Condition Index VIF

(Constant) 1.000

USAGE 4.033 1181.612

USAGELAG1 66.689 4464.390

USAGELAG2 294.452 1401.786

Table 13 Collinearity Diagnostics for the USA Government Spending Lag Model

33

Page 38: Group A Report

Equation 15 Standard Format for USA Two Period Distributive Lag of Government Spending Regression on GDP (Current US$)

2.7.2 Lag of Government Spending for the NetherlandsSome variables may not have an immediate effect on Y and thus take time before their impact

is realised. For example when the government spends money on infrastructure the impact on

GDP may not occur until a number of years after the event. As a result distributed lags are

used on variables whose output has a time delay on Y. The more periods you lag the more

degrees of freedom you give up which can hinder the goodness of the model during the

significance hypothesis tests.

Government expenditure (General government final consumption expenditure (current US$))

was chosen to test the effect of lagging a variable. Government expenditure (GE) was lagged

for two periods from 1972-2013 with 42 observations as opposed to 1970-2013 with 44

observations.

The slope of government expenditure is 4.923 which indicate when all other variables are

held constant GDP will increase in the Netherlands by 4.923 for every one-unit increase in

government spending.

When government spending is lagged by a period of one, the slope drastically changes to -

0.729 indicating when all other variables are held constant government spending will

decrease GDP in the Netherlands by 0.729 for every one-unit in government spending.

When government spending is lagged for a second period the slope increases slightly to -

0.394; thus when all other variables are held constant the second lag of government spending

will decrease GDP by 0.394 in the Netherlands for every one unit increase in government

spending.

Government spending is statistically significant as its p-value is less than the significance

level of 0.05 (5%) meaning we can reject the null hypothesis that the slope is statistically

insignificant and not representative of the population size and accept the alternative

hypothesis that the slope is statistically significant and representative of the population size.

34

Page 39: Group A Report

The lag variables failed the significance tests as their p-value was greater than the

significance level of 0.05 (5%) as can be seen on the table below under the p-value section.

Thus we must not reject the null hypothesis as aforementioned before and reject the

alternative hypothesis.

Coefficients Standard Error t Stat P-valueIntercept 3.61E+10 8.22E+09 4.39445 8.62E-05

GE 4.923062 0.560679 8.780533 1.11E-10Lag of GE X1 -0.72973 0.916904 -0.79586 0.431058Lag of GE X2 -0.3941 0.579334 -0.68027 0.500458

Table 14 Coefficients and Significance Values for the Netherlands Government Spending Lag Model

Multicollinearity is also present between the lag variables, which are another downside of

distributive lags. The VIF for the lagged independent variables are greater than 10 and the

condition index for these variables are greater than 20 as seen in the table below. Thus we can

conclude that multicollinearity is present for the two period distributive lag for the

Netherland’s government spending which can be seen from the table below. In summation,

the failing of the significance tests by the lag variables and the presence of multicollinearity

indicate that lagging government expenditure for a period of 1 or 2 is not effective to the

model.

Detail Condition Index VIF

(Constant) 1.000

NethGE 3.552 61.210

NethGELAG1 20.443 153.173

NethGELAG2 48.385 57.515

Table 15 Collinearity Diagnostics for the Netherlands Government Spending Lag Model

Equation 16 Standard Format for Netherlands Two Period Distributive Lag of Government Spending Regression on GDP (Current US$)

35

Page 40: Group A Report

2.8 Part H

2.8.1 Forecasting GDP for the USATime series collects random variables over time in attempt to forecast future values of the

random variables based on its past and present values. However, such models assume the data

is stationary, meaning there is not variance over the time or the mean is constant, as well as

no presence of autocorrelation in the data. By looking at the graphs without the change of

GDP to the left, there is stationary and constant over time. The bottom two graphs with the

change of GDP do not show a constant, stationary line implying. It is clear that there are

collinearity issues by looking at the autocorrelation functions table, which is common with

distributed lag models. The table exemplifies that GDP is highly correlated over time as

shown in the first two graphs with their clear trends, but the difference/change of time does

not have this problem as there is not much pattern in the bottom two time series graph.

Autocorrelation Functions

Lag Length Y Log Y Y log Y

1 0.999328 0.999736 0.57924 0.6846492 0.998009 0.99933 0.25287 0.5599753 0.99659 0.998943 0.066808 0.5313414 0.995435 0.99857 0.036035 0.5006415 0.994656 0.998353 0.26192 0.571386 0.994514 0.998251 0.471831 0.548999

Table 16 Autocorrelation Functions for USA GDP (Current US$)

For example, the correlation between y and y lagged by one period is 0.999328 whereas the

between change of GDP and log of GDP change lagged by one period is only 0.57924. We

can see the difference below in the graphs how using the change of GDP and the change in

the natural log of GDP removes the autocorrelation by transforming stationary data in

nonstationary data.

36

Page 41: Group A Report

1960 1970 1980 1990 2000 2010 20200.00E+00

2.00E+12

4.00E+12

6.00E+12

8.00E+12

1.00E+13

1.20E+13

1.40E+13

1.60E+13

1.80E+13

Line Graph of US GDP 1970-2013

Time (Years)

GD

P (C

urre

nt U

S$)

Figure 9 Line Graph of US GDP (Current US$) from 1970-2013

1960 1970 1980 1990 2000 2010 202026

26.5

27

27.5

28

28.5

29

29.5

30

30.5

31

Line Graph of US Ln(GDP) 1970-2013

Time (Years)

Nat

ural

Log

GD

P (C

urre

nt U

S$)

Figure 10 Line Graph of the Natural Log US GDP (Current US$) from 1970-2013

37

Page 42: Group A Report

1960 1970 1980 1990 2000 2010 2020

-4.00E+11

-2.00E+11

0.00E+00

2.00E+11

4.00E+11

6.00E+11

8.00E+11

1.00E+12

Line Graph of US (GDP) 1970-2013

Time (Years)

Chan

ge in

GD

P (C

urre

nt U

S$)

Figure 11 Line Graph of the Change in US GDP (Current US$) from 1970-2013

1960 1970 1980 1990 2000 2010 2020

-4.00E-02

-2.00E-02

0.00E+00

2.00E-02

4.00E-02

6.00E-02

8.00E-02

1.00E-01

1.20E-01

1.40E-01

Line Graph of US Ln (Y) 1970-2013

Time (Years)

Chan

ge in

Nat

ural

log

of G

DP

(Cur

rent

US$

)

Figure 12 Line Graph of the change in the Natural Log of US GDP (Current US$) from 1970-2013

38

Page 43: Group A Report

2.8.2 Forecasting GDP for the NetherlandsTime series collects random variables over time in attempt to forecast future values of the

random variables based on its past and present values. However, such models assume the data

is stationary, meaning there is not variance over the time or the mean is constant, as well as

no presence of autocorrelation in the data. By looking at the graphs without the change of

GDP to the left, there is stationary and constant over time. The bottom two graphs with the

change of GDP do not show a constant, stationary line implying. But the It is clear that there

are collinearity issues by looking at the autocorrelation functions table, which is common

with distributed lag models. The table exemplifies that GDP is highly correlated over time as

shown in the first two graphs with their clear trends, but the difference/change of time does

not have this problem as there is not much pattern in the bottom two time series graph.

Autocorrelation Functions

Lag Length Y Log Y Y log Y

1 0.98925 0.992835 0.155338 0.3879432 0.973977 0.979641 -0.15339 0.0477573 0.962269 0.965112 0.175418 0.0314994 0.944479 0.950051 0.11437 0.1169095 0.924037 0.93285 -0.15299 -0.177856 0.916906 0.919247 -0.39885 -0.25205

Table 17 Autocorrelation Functions for Netherlands GDP (Current US$)

For example, the correlation between y and y lagged by one period is 0.98925 whereas the

between change of GDP and log of GDP change lagged by one period is only 0.155338. We

can see the difference below in the graphs how using the change of GDP and the change in

the natural log of GDP removes the autocorrelation by transforming stationary data into non

stationary data.

39

Page 44: Group A Report

1960 1970 1980 1990 2000 2010 20200

100000000000

200000000000

300000000000

400000000000

500000000000

600000000000

700000000000

800000000000

900000000000

1000000000000

Line Graph of Netherlands GDP (Y) Over Time

Time (Years)

GD

P (C

urre

nt U

S$)

Figure 13 Line Graph of the Netherlands GDP (Current US$) from 1970-2013

1960 1970 1980 1990 2000 2010 202022

23

24

25

26

27

28

Line Graph of Netherlands Ln(GDP) Over Time

Time (Years)

Nat

ural

Log

of G

DP

(Cur

rent

US$

)

Figure 14 Line Graph of the Natural Log of the Netherlands GDP (Current US$) from 1970-2013

40

Page 45: Group A Report

1960 1970 1980 1990 2000 2010 2020

-100000000000

-50000000000

0

50000000000

100000000000

150000000000

Line Graph of Netherlands (GDP) 1970-2013

Time (Years)

Chan

ge in

GDP

(cur

rent

US$

)

Figure 15 Line Graph of the Change in the Netherlands GDP (Current US$) from 1970-2013

1960 1970 1980 1990 2000 2010 2020

-0.2

-0.1

0

0.1

0.2

0.3

0.4

Line Graph of Netherlands Ln (GDP) 1970-2013

Years (Time)

Chan

ge in

Nat

ural

Log

of G

DP (C

urre

nt U

S$)

Figure 16 Line Graph of the Change in the Natural Log of the Netherlands GDP (Current US$) from 1970-2013

41

Page 46: Group A Report

3.0 Question 2For our group, we were assigned the year 2011 for the Merger Deal Negotiations of Question

Two of this Group Assignment. We conducted an analysis for the following mergers: EP

Global Opportunities Trust, Carrillion PLC, Sinclair Pharma PLC, Pearson PLC, Qatar,

Smiths News PLC, Greene King PLC, Infrastructure India PLC, Premier Oil PLC, and

Quindell Portfolio PLC. The time period of data we utilized in the year 2011, we used 125

days before and 25 days after as estimation. First, we calculated the abnormal return (AR)

and Cumulative abnormal return (CAR) for each of our ten mergers. To find the AR, we

subtracted the stock price from its previous day stock price to get the stock return; to find the

market return, known as the expected return, we subtracted the stock exchange shares from

its previous day share amount where the abnormal return is the difference between the stock

return and the market return. As for the CAR, there are two ways to discover the CAR: one

by the regression (CAR Part (ii)) and the other manually through market return (CAR Part i).

First, we regressed Y (Stock Return) on X (Market Return) to obtain the predicted Y and

residual values of each merger for the previous 125 days to 25 days before the announcement.

Starting 25 days before up to 25 days after the market marks the time period needed to

calculate both sets of CAR outside of the regression results. To predict the results, the predict

y was calculated as multiplying the slope and intercept with that day’s abnormal return rate

and the residuals were calculated as the difference between that same day’s stock price with

the predicted residuals. For CAR Part (i), we added the abnormal return of that day and the

next days. For instance, for Day 1 would equal that day’s sum and for Day 2 it would be the

sum of Day 1 and Day 2 and so forth. The same concept was applied for CAR Part (ii),

except the data derived from the residuals of the regression model. The table below analyses

the results of each merger based on their individual graphs. It is important to note that dips or

spikes refer to overreaction to stock news and gradual changes refer to a delayed response to

the stock news. Results can be seen in the table below.

Merger Strengths GraphEP The model displays a downward trend the

event window. Initially it made a positive abnormal return rate, but will drift off in the future (bad news). There is no immediate reaction on the announcement date suggesting an inefficient market, as there is a delayed reaction to proposed bad news.

42

Page 47: Group A Report

Carrillion The model displays a downward trend event window. There are some spikes and consistency in the middle but still drifts downward eventually (bad news). The delayed reaction to the proposed bad news suggests an inefficient market, as there was no immediate reaction to bad news followed by a normal trend.

Sinclair The model displays a downward trend event window. There is a spike in the end for CAR (i), but drifts back down (bad news). The market would appear to be inefficient in this example due to the lack of reaction to the news announcement followed by a downward trend in cumulative abnormal returns.

Pearson The model displays an upward trend during the event window. Initially it is consistent in its positive abnormal return rate, but will shift up in the future. It is important note that there is a sharp dip in the rate before shooting back upwards (good news). In the Pearson announcement there was an immediate reaction to the announcement but there were a couple of over reactions post announcement which suggests market inefficiency

Qatar The model displays a consistent trend during the event window. Initially it held a consistent abnormal return rate, but dipped rapidly before remaining consistent at its new abnormal return rate in the future (no news – sudden dip means overreaction to bad news). The over-reaction 10 days prior to the announcement day suggests an inefficient market and no reaction on the announcement date. It would also be an indicator of insider trading as a sharp move in cumulative abnormal returns before a major merger announcement is a little suspicious.

43

Page 48: Group A Report

Smiths The model displays a consistent downward trend during the event window. There are some dips and spikes along the way in the abnormal return rate, and it will drift off in the future (no news). The delayed over-reaction indicated by a price spike 9 days after the announcement suggests inefficiency in the market.

Greene The model displays a constant trend during the event window. Eventually it is expected to drift off by CAR Part (ii), but CAP Part (i) expects the continuation of its constant rate (no news except end predicts overreaction to news). The fact that there was no immediate reaction on the announcement date followed by a delayed downward over-reaction from the CAR (ii) suggests inefficiency in the market.

Infrastructure

The model displays a downward trend during the event window on the CAR Part (ii) results. Initially it was consistency through its abnormal return rate. There is some erratic behaviour of peaks and dips in the future in both trends (no news with some overreactions and delayed responses). The cumulative abnormal returns exhibit an almost immediate downward reaction on the announcement date. The fact that the trend starts a little before the announcement date may indicate some insider trading. The CAR (i) becomes a little bit volatile having a couple of over-reactions ins opposite trends and the CAR (ii) line continues a rather steep decent which indicates an inefficient market.

Premier The model displays an upward trend during the event window and is expected to have a positive abnormal return rate (good news). Premier exhibits an immediate negative reaction on the announcement date but then begins to rebound in the upward direction, especially from the CAR (ii) line. The fact that the cumulative abnormal returns trend upward as opposed to remaining constant after the news announcement suggests an inefficient market.

44

Page 49: Group A Report

Quindell The model displays a constant trend during the event window in the abnormal return rate. There is a steady decline in the CAR Part (ii), but levels out eventually (no news). Quindell does not exhibit and immediate reaction to the news announcement and cumulative abnormal returns appears to continue relatively consistent after the announcement suggesting an inefficient market.

Table 18 Cumulative Abnormal Returns for Individual Companies

The table below identifies overall market return and overall market model for each merger.

For the Overall Using Market Return, we gathered each merger’s entire abnormal return rate

into separate columns to calculate the average abnormal rate, the t-score and CAR as one

overall average of all the mergers for the market that particular day. In doing so, we took

each mergers abnormal rate for that day and averaged it to obtain the average abnormal rate.

For the t-score, we divided the current day’s average abnormal rate by the standard deviation

of the entire mergers daily abnormal rate. To calculate the CAR, we summed the current

day’s average abnormal rate with the previous day’s average abnormal rate and graphed the

CAR’s results. For the Overall Using Market Model, we followed the same steps except the

data of each merger derived from the residual regression results. Table 19 will summarize,

analyse and conclude the cumulative average abnormal returns for both the market return and

market model.

Overall the cumulative abnormal returns from the ten mergers analysed appear to provide

evidence for an inefficient market in the FTSE All Share from but CAR (i) and CAR (ii)

testing. The studied merges in general tended to have delayed over reactions to the news

announcements both before and after the announcement date and did not continue a relatively

stable sideward trend after the announcement rather a more volatile period existed during the

post 25 days of announcement which in general suggests an inefficient market.

Event Window Conclusions GraphOverall Using

Market ReturnThis model displays the

cumulative average abnormal rate of the market return. The trend displayed indicates that there is not much reaction to the stock

news. In other words, there was not much news revealed within this market of all the mergers.

45

Page 50: Group A Report

Overall Using Market Model

This model displays the cumulative average abnormal rate of the market model (regression residuals). The trend displayed indicates that there is a lot of reaction to the stock news. In other words, there was a lot of overreaction to the bad news

revealed within this market of all the mergers. Towards the end of the trend, it levels out implying there wasn’t much news in the

market.Table 19 Cumulative Abnormal Returns for Portfolio

Appendix A will give a full size view of each of the graphs for the event studies.

46

Page 51: Group A Report

Appendix-A Event Study Graphs

-30 -20 -10 0 10 20 30

-0.1

-0.08

-0.06

-0.04

-0.02

0

0.02

0.04

CUMULATIVE ABNORMAL RETURNS FOR EP

CAR Part (i)CAR Part (ii)

DAYS

CAR

-30 -20 -10 0 10 20 30

-0.14

-0.12

-0.1

-0.08

-0.06

-0.04

-0.02

0

0.02

0.04

0.06

CUMULATIVE ABNORMAL RETURNS FOR CARRILLION

CAR Part (i)CAR Part (ii)

DAYS

CAR

-30 -20 -10 0 10 20 30

-0.35

-0.3

-0.25

-0.2

-0.15

-0.1

-0.05

0

0.05

CUMULATIVE ABNORMAL RETURNS FOR SINCLAIR

CAR Part (i)CAR Part (ii)

DAYS

CAR

Page 52: Group A Report

-30 -20 -10 0 10 20 30

-0.04

-0.02

0

0.02

0.04

0.06

0.08

CUMULATIVE ABNORMAL RETURNS FOR PEARSON

CAR Part (i)CAR Part (ii)

DAYS

CAR

-30 -20 -10 0 10 20 30

-0.25

-0.2

-0.15

-0.1

-0.05

0

0.05

0.1

CUMULATIVE ABNORMAL RETURNS FOR QATAR

CAR Part (i)CAR Part (ii)

DAYS

CAR

-30 -20 -10 0 10 20 30

-0.14

-0.12

-0.1

-0.08

-0.06

-0.04

-0.02

0

0.02

0.04

0.06

CUMULATIVE ABNORMAL RETURNS FOR SMITHS

CAR Part (i)CAR Part (ii)

DAYS

CAR

Page 53: Group A Report

-30 -20 -10 0 10 20 30

-0.2

-0.15

-0.1

-0.05

0

0.05

0.1

CUMULATIVE ABNORMAL RETURNS FOR GREENE

CAR Part (i)CAR Part (ii)

DAYS

CAR

-30 -20 -10 0 10 20 30

-0.35

-0.3

-0.25

-0.2

-0.15

-0.1

-0.05

0

0.05

CUMULATIVE ABNORMAL RETURNS FOR INFRA

CAR Part (i)CAR Part (ii)

DAYS

CAR

-30 -20 -10 0 10 20 30

-0.05

0

0.05

0.1

0.15

0.2

CUMULATIVE ABNORMAL RETURNS FOR PREMIER

CAR Part (i)CAR Part (ii)

DAYS

CAR

Page 54: Group A Report

-30 -20 -10 0 10 20 30

-2

-1.5

-1

-0.5

0

0.5

1

CUMULATIVE ABNORMAL RETURNS FOR QUINDELL

CAR Part (i)CAR Part (ii)

DAYS

CAR

-20

-18

-16

-14

-12

-10 -8 -6 -4 -2 0 2 4 6 8 10 12 14 16 18 20 22 24

0.4

0.42

0.44

0.46

0.48

0.5

0.52

0.54

OVERALL (i) CUMULATIVE ABNORMAL RETURNS

CAR

DAYS

CA

R

-20

-18

-16

-14

-12

-10 -8 -6 -4 -2 0 2 4 6 8 10 12 14 16 18 20 22 24

-0.25

-0.2

-0.15

-0.1

-0.05

0

OVERALL (ii) CUMALITIVE ABNORMAL RETURNS

CAR

DAYS

CA

R

Page 55: Group A Report

Appendix – B Power Point Presentation of Question 1 Delivered 01/12/2015

Slide 1 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 2 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 3 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Econometrics Group A’s Assessment Project

JazmenBoone (20070746)Henry Doku(20044757)Lee Rock (20038860)Bas Süβ(20069935)

Question 1

Introduction

United States Netherlands

Regression analysis for two countries:

Part A: USA

The distribution of the dependent variable, GDP:

• Histogram• Descriptive Statistics• Boxplot

Descriptive Statistics

Page 56: Group A Report

___________________________________

___________________________________

Slide 4 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 5 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 6 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part A: USA

Histogram Boxplot

Part A: USA

Part A: Netherlands

The distribution of the dependent variable, GDP:• Histogram• Descriptive Statistics• Boxplot

Descriptive Statistics

Page 57: Group A Report

Slide 7 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 8 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 9 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part A: NetherlandsHistogram Boxplot

Part A: Netherlands

Part B: Independent Variables Standard Economic Theory: ൌ�� ሺ ൌ� ሻ

C: Household Consumption Expenditure

I: Foreign Direct Investment

G: Government Consumption Expenditure

NX: Net Exports

Page 58: Group A Report

Slide 10 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 11 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 12 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part B: Independent Variables Our expectations of our independent variables on GDP:

Household Consumption Expenditure

Government Consumption Expenditure

Foreign Direct Investment

Net Exports

POSITIVE

POSITIVE

POSITIVE

POSITIVE

• High observations • High R squared & Adjusted R

Squared• Model is Statistical Significant • Outliers – 2• Slopes follow expectations?• Multicollinearity?

Part C: Regression on GDP

USA

Part C: Regression on GDPUSA

Page 59: Group A Report

Slide 13 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 14 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 15 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part C: Regression on GDPNetherlands

• High observations • High R squared & Adjusted R

Squared• Model is Statistical

Significant • Outliers – 2• Slopes follow expectations?• Multicollinearity?

Part C: Regression on GDPNetherlands

Part D – AutocorrelationUSA

Page 60: Group A Report

Slide 16 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 17 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 18 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part D – AutocorrelationUSA

Part D – AutocorrelationNetherlands

Part D – AutocorrelationNetherlands

Page 61: Group A Report

Slide 19 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 20 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 21 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part D – Normality of ResidualsUSA

Part D – Normality of ResidualsUSA

JarqueBeraTest

Part D – Normality of ResidualsNetherlands

Page 62: Group A Report

Slide 22 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 23 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 24 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part D – Normality of ResidualsNetherlands

JarqueBeraTest

Part E – Specification USA

Ramsey RESET Test

Part F – Addition of Dummy Variable

if the dummy variable is equalled to one then we can expect GDP to be

3.2E+11 -2.4E+11 = 82116505677

If the dummy variable is equal to zero then we can expect GDP to be

3.2E+11

USA = 1 NETHERLANDS = 0

Page 63: Group A Report

Slide 25 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 26 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 27 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part F – Addition of Dummy Variable

Outliers? 2

Statistical Significance? P-Value = 9.37E-11 Confidence Interval = (-

3.0204E+11, -1.7E+11) T-stat = 7.4282 (7.4282 >

1.98)

Reject or Accept Null?

Part F – Interaction Effect

Purpose of Interaction Effect?

Variable Chosen?

R Square? Model Statistically

Significant? Independent Variables

Statistically Significant?Household

Consumption Expenditure

Part F – Interaction Effect

Reject or Accept the Null?

Results?1 = USA we find that GDP is 3.53021E+11

0 = Netherlands we find that GDP is negative at -2.2618E+10

Page 64: Group A Report

Slide 28 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 29 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 30 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part G – Lag VariablesUSA

Government Consumption Expenditure lagged for 2 periods

Part G – Lag VariablesUSA

Results

Part G – Lag VariablesUSA

Presence of Multicollinearity?

Page 65: Group A Report

Slide 31 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 32 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 33 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part G – Lag VariablesNetherlands

Results?

Part G – Lag VariablesNetherlands

Standard Format of Lagged Regression Model

Part G – Lag VariablesUSA

Presence of Multicollinearity?

Page 66: Group A Report

Slide 34 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 35 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 36 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

• Purpose of Time Series?• CollinearityIssues?

Part H – Time SeriesUSA

Part H – Time Series USA

Part H – Time SeriesNetherlands

CollinearityIssues?

Page 67: Group A Report

Slide 37 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Slide 38 ___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

___________________________________

Part H – Time Series Netherlands

Thank You Questions?