regression [example 3.] as a motivating example, suppose we are modelling sales data over time....

Regression

[Example 3.] As a motivating example, suppose we are modelling sales data over time.SALES 3 5 4 5 6 7TIME 1990 1991 1992 1993 1994 1995

We seek the straight line “Y = m X + c” that best approximates the data. By “best” in this case, we mean the line which minimizes the sum of squaresof vertical deviations of points from the line:

SS = ( Yi - [ mXi + c ] ) 2.

Setting the partial derivatives of SS with respect to m and c to zero leads to the “normal equations”

Y = m X + n .c , where n = # points X .Y= m X2 + c X .

Let 1990 correspond to Year 0.

X.X X X.Y Y Y.Y 0 0 0 3 9 1 1 5 5 25 4 2 8 4 16 9 3 15 5 25 16 4 24 6 36 25 5 35 7 49

55 15 87 30 160

Y Y = m X + c

m Xi + c

Example 3 - Workings.

The normal equations are:30 = 15 m + 6 c => 150 = 75 m + 30 c 87 = 55 m + 15 c 174 = 110 m + 30 c

=> 24 = 35 m => 30 = 15 (24 / 35) + 6 c => c = 23/7

Thus the regression line of Y on X isY = (24/35) X + (23/7)

and to plot the line we need two points, soX = 0 => Y = 23/7 and X = 5 => Y = (24/35) 5 + 23/7 = 47/7.

It is easy to see that ( X, Y ) satisfies the normal equations, so that the regression line of Y on X passes through the “Center of Gravity” of the data. By expanding terms, we also get

( Yi - Y ) 2 = ( Yi - [ m Xi + c ] ) 2 + ( [ m Xi + c ] - Y ) 2

Total Sum ErrorSum Regression Sumof Squares of Squares of SquaresSST = SSE + SSR

In regression, we refer to the X variable as the independentvariable and Y as the dependent variable.

mXi +C

Correlation

The coefficient of determination r2 ( which takes values in the range 0 to 1) is a measure of the proportion of the total variation that is associated with the regression process:

r2 = SSR/ SST = 1 - SSE / SST.

The coefficient of correlation r ( which takes values in the range -1 to +1 ) is more commonly used as a measure of the degree to which a mathematical relationship exists between X and Y. It can be calculated from the formula:

r = ( X - X ) ( Y - Y )

( X - X )2 ( Y - Y ) 2

= n X Y - X Y

{ n X 2 - ( X ) 2 } { n Y 2 - ( Y ) 2 }

Example. In our case r = {6(87) - (15)(30)}/ { 6(55) - (15)2 } { 6(160) - (30)2 } = 0.907.

r = - 1 r = + 1r = 0

Collinearity

If the value of the correlation coefficient is greater than 0.9 or less than - 0.9, we would take this to mean that there is a mathematical relationship between the variables. This does not imply that a cause-and-effect relationship exists.

Consider a country with a slowly changing population size, where a certain political party retains a relatively stable percentage of the poll in elections. Let

X = Number of people that vote for the party in an electionY = Number of people that die due to a given disease in a yearZ = Population size.

Then, the correlation coefficient between X and Y is likely to be close to 1, indicating that there is a mathematical relationship between them (i.e.) X is a function of Z and Y is a function of Z also. It would clearly be silly to suggest that the incidence of the disease is caused by the number of people that vote for the given political party. This is known as the problem of collinearity.

Spotting hidden dependencies between distributions can be difficult. Statistical experimentation can only be used to disprove hypotheses, or to lend evidence to support the view that reputed relationships between variables may be valid. Thus, the fact that we observe a high correlation coefficient between deaths due to heart failure in a given year with the number of cigarettes consumed twenty years earlier does not establish a cause-and-effect relationship. However, this result may be of value in directing biological research in a particular direction.

regression [example 3.] as a motivating example, suppose we are modelling sales data over time....

Documents

how to achieve maximum sales eﬃciency...in sales, we used...

business plan - attainyourhome.com€¦ · sales centre,...

pallet sales brochure - universalpallets.com€¦ · we...

can we systemise and humanise sales meetings

m8g biofilter example - university of...

sales management dashboard design example interactive...

crm: sales performance dashboard - eleadcrm.com sales... ·...

uptodate - welcome to gelita medical | gelita medical ·...

marketing strategies€¦ · advertising - in our example,...

masterclass series › assets › 54f59a39d4c... ·...

we love travel sales information pack 2013

setting expectations for sales succes: the example of jesus

example sales training -- prospecting guide

example 16.8 forecasting quarterly soft drink sales

§ 2.2 - 2.3 the banzhaf power index. example: example: now...

example of sales pitch slide library

2013 / 2014 · 13/03/2014 13 telephone sales techniques...

select [ [,...]] from [ ] [where [ ]] example: select {...

oracle sales compensation · sales compensation calculates...

example 16.7 forecasting quarterly sales at a pharmaceutical...