math 3680 lecture #18 correlation. the correlation coefficient: intuition

Download Math 3680 Lecture #18 Correlation. The Correlation Coefficient: Intuition

If you can't read please download the document

Upload: philip-lane

Post on 17-Jan-2018

243 views

Category:

Documents


0 download

DESCRIPTION

Lingo: Age = independent variable Height = dependent variable

TRANSCRIPT

Math 3680 Lecture #18 Correlation The Correlation Coefficient: Intuition Lingo: Age = independent variable Height = dependent variable Correlations can be either strong or weak. Also, they can be either positive or negative, as illustrated in the following figures. The correlation coefficient, denoted by r, is a measure of linear association of clustering around a line. It turns out that r always lies between -1 and 1. Summary: Five numbers quantify the relationship between the two variables: The average of the x-values The SD of the x-values The average of the y-values The SD of the y-values The correlation coefficient, r. In a while we will discuss how to compute r. For now, we will focus on its significance. Note: 1. r = 0.80 does not mean 80% of the points are on the line. 2. r = 0.80 does not indicate twice as much linearity as a scatter plot with r = 0.40 Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. If women always married men who were five years older, the correlation between the ages of husbands and wives would be ________________. Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. The correlation between a students age and his/her mothers age is ___________. Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. The correlation between a cars age and fuel efficiency is _____________. Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. On a 10-question multiple choice test, the correlation between number of correct answers and number of wrong answers is ____________. Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. Darts are thrown at a dartboard. The correlation between the x-coordinate of a darts landing and its y-coordinate is ____________. Example. A study finds the following data for the IQs in a population: Husbands: Average = 100,SD = 15 Wives:Average = 100,SD = 15 r = 0.6 Draw a rough, rough sketch of the scatter diagram for this data. The Correlation Coefficient: Computation How to compute the correlation coefficient Convert the x-values and y-values into standard units. The average of the products (divide by n - 1, not n, just like with sample standard deviations) is the correlation coefficient. Excel: =CORREL(A1:A6,B1:B6) To make a scatter plot in Excel 2007: select the data, and on the Insert tab, choose Scatter and click on the picture with only markers. For Excel 2003 or before, see book p. 416 Exercise: Questions 1.How would we arrange the data so that r was as positive as possible? As negative as possible? 2.Suppose we interchange the x and y columns? Does the correlation coefficient change? Questions 3. Suppose we add 5 to all of the x-values. Does the correlation coefficient change? 4. Suppose we triple all of the y-values. Does the correlation coefficient change? The blue dots were obtained by tripling the x- and y- values of the red dots; the y-values were then increased by 8. Which set has the higher correlation coefficient? These changes of scale do not affect the strength of the association. The association is measured in relative terms, not absolute terms. Find the correlation coefficient of the following five data sets: yx yx yx yx yx yx yx r = 6 / 7 r = 11 / 14