exploring relationships between variables ch. 10 scatterplots, associations, and correlations ch. 10...

20
Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations

Upload: tracy-daniel

Post on 24-Dec-2015

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Exploring relationships between variables

Exploring relationships between variables

Ch. 10Scatterplots, Associations,

and Correlations

Ch. 10Scatterplots, Associations,

and Correlations

Page 2: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

ScatterplotsScatterplots

• Shows change over time• Shows patterns• Shows Trends• Relationships• Outlier values

• Shows change over time• Shows patterns• Shows Trends• Relationships• Outlier values

Page 3: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Scatterplots Scatterplots

• Can be positive or negative• Show relationship amongst 2

variables• Can be shown more in depth

through the Z-scores of both variables (ZX, ZY)

• Can be positive or negative• Show relationship amongst 2

variables• Can be shown more in depth

through the Z-scores of both variables (ZX, ZY)

Page 4: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Z-scoresZ-scores

• X-MeanX / Standard Deviation (SX)

• Y-MeanY / Standard Deviation (SY)

• Calculating standard deviation in the same way as before.

• X-MeanX / Standard Deviation (SX)

• Y-MeanY / Standard Deviation (SY)

• Calculating standard deviation in the same way as before.

Page 5: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

RatioRatio

• Correlation coefficient• Sum of SX * SY / n-1• Correlation measures the

strength of the linear association between 2 variables

• Correlation coefficient• Sum of SX * SY / n-1• Correlation measures the

strength of the linear association between 2 variables

Page 6: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

variablesvariables

• Explanatory Variable – X• Response Variable - Y

• Explanatory Variable – X• Response Variable - Y

Page 7: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Least-Squares LineLeast-Squares Line• Y= a + bx• a = y intercept• b = slope• a = y – bx• b = SSxy/SSx • SSx = Sum of squares of x

• Y= a + bx• a = y intercept• b = slope• a = y – bx• b = SSxy/SSx • SSx = Sum of squares of x

Page 8: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

SSxSSx

• This is calculated by obtaining the sum of each squared x

• You then subtract the sum of x squared divided by n

• You can get SSx on the calculator by squaring the standard deviation then multiplying it by (n-1)

• This is calculated by obtaining the sum of each squared x

• You then subtract the sum of x squared divided by n

• You can get SSx on the calculator by squaring the standard deviation then multiplying it by (n-1)

Page 9: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

SSxySSxy

• Sum of squares of x and y• Take the sum of each x value

times each y value.• You then subtract from that

total the (Sum of x) * (Sum of y) n

• Sum of squares of x and y• Take the sum of each x value

times each y value.• You then subtract from that

total the (Sum of x) * (Sum of y) n

Page 10: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

SSxySSxy

• SSxy is a more efficient way of computing

• Sum of each (x-xbar) * (y-ybar)

• SSxy is a more efficient way of computing

• Sum of each (x-xbar) * (y-ybar)

Page 11: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Complete Guided Ex. #3 page 566

Complete Guided Ex. #3 page 566

Page 12: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Standard Error of Estimate

Standard Error of Estimate

• Se = square root of E(y-yp)squared/n – 2

• How to calculate square root of SDY – b(SDx * SDy) / n-2

• Se = square root of E(y-yp)squared/n – 2

• How to calculate square root of SDY – b(SDx * SDy) / n-2

Page 13: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

ResidualsResiduals

• You can graph the residual of the equation to see if the regression is accurate

• Residuals are the difference between the observed value and the predicted value

• R = observed - predicted

• You can graph the residual of the equation to see if the regression is accurate

• Residuals are the difference between the observed value and the predicted value

• R = observed - predicted

Page 14: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Confidence IntervalsConfidence Intervals

• Yp – E < y < yp + E• Yp = predicted value of y

• Yp – E < y < yp + E• Yp = predicted value of y

Page 15: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

What does this mean (better understanding)What does this mean

(better understanding)

Page 16: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Types of dataTypes of data

• Outlier• Leverage• Influential Point• Lurking Variable

• Outlier• Leverage• Influential Point• Lurking Variable

Page 17: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

OutlierOutlier

• Any data point that stands away from the others

• Any data point that stands away from the others

Page 18: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

LeverageLeverage

• Data points with X-values that are far from the mean

• Can alter the line of least regression

• Data points with X-values that are far from the mean

• Can alter the line of least regression

Page 19: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Influential PointInfluential Point

• Omitting this point can drastically alter the regression model

• Omitting this point can drastically alter the regression model

Page 20: Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations

Lurking VariableLurking Variable

• A variable that is hidden in the equation

• It is not explicitly part of the model but affects the way the variables in the model appear

• A variable that is hidden in the equation

• It is not explicitly part of the model but affects the way the variables in the model appear