measures of variability variability: describes the spread or dispersion of scores for a set of data

Measures of Variability

Variability: describes the spread or dispersion of scores for a set of data

Measures of Variation

Range: difference between the highest and lowest scores in a distribution Crude estimate of variation because a

couple extreme scores create a deceptively large range


Standard Deviation: computed measure of how much scores vary around the mean score Tells us how different the scores are from

each other Example: If your high school serves a community

where most families have similar incomes, family income data will have a relatively small standard deviation compared with a school in a more diverse community population

Table 2.4 – page 39


Standard Deviation: Lower standard deviations = more similar

the scores Higher standard deviation = less similar

the scores If a set of test scores has a standard deviation of 5,

then everyone who took the test scored similarly If the scores have standard deviation of 50, then the

test-takers’ scores were not very similar at all!

More on Standard Deviation Standard deviation is kind of the “mean of the mean” and can often help you

get the real story behind the data. It is how far, on average, scores deviate from the mean.

The standard deviation is a statistic that tells you how tightly all the various examples are clustered around the mean in a set of data.

When the examples are pretty tightly bunched together and the bell-shaped curve is steep, the standard deviation is small.

When the examples are spread apart and the bell curve is relatively flat, that tells you that you have a relatively large standard deviation.

See how standard deviation changes as you manipulate a graph! http://www.stat.tamu.edu/~west/applets/normaldemo1.html

Descriptive Statistics

http://bcs.worthpublishers.com/psychsim5/Descriptive%20Statistics/PsychSim_Shell.html

Correlations

A correlation is a statistical technique that allows us to understand the degree to which two variables are related.

The strength and direction of correlations can be illustrated graphically in a scatterplot, in which paired X and Y scores for each subject are plotted as single points on a graph. The slope of a line that best fits the pattern of points

suggests the degree and direction of the relationship between the two variables.

Scatterplots and Correlations

Positive Correlation As one number increases, the other increases. Ex: Study time to GPANegative Correlation As one number increases, the other decreases. Ex: Absences to GPANo Correlation Variables do not affect one another in a significant way Ex: Height to GPA

Illusory Correlation

Illusory Correlation: We believe there is a relationship between two things when it actually doesn’t exist.

“Feline High-Rise Syndrome”

The New York Times reported evidence for cats’ exceptional ability to survive falls from high-rise buildings.

They cited evidence that from June – November, 1984, 132 such victims were admitted to the Animal Medical Center, only 8 of which died from their injuries. What is the problem with this correlation?

Correlation

http://bcs.worthpublishers.com/gray/content/psychsim5/Correlation/correlation.htm

Inferential Statistics Whereas descriptive statistics simply describe a

data set, inferential statistics attempt to make inferences about a larger population based on a data set. For example, if you're interested in studying CHS

student behavior, you would have a hard time collecting data from each and every student.

Instead, you collect data from a sample of students, representative of the entire CHS student body, and then make inferences about the student body based on the data collected from your sample.

Inferential Statistics (cont.) If this chance variability always exists in data

collection, how can a researcher be confident that the inferences he or she makes about the larger population (the entire CHS student body) is accurate? We use inferential statistics!

Instead of making absolute conclusions about the population, researchers make statements about the population using the laws of probability and statistical significance.

Statistical Significance

Statistical Significance: statistical statement of how likely it is that an obtained result occurred by chance

To Summarize

Descriptive Statistics:

Organize and summarize data

Central Tendency: mean, median, mode

Standard deviation: variation in data

Range: distance from smallest to largest

Inferential Statistics:

Interpret data and draw conclusions

Used to test validity of hypothesis

Biased or insufficient samples

“Four out of five dentists surveyed recommended Brand X gum.” The # of dentists surveyed is not

clear How were the dentists chosen?

Was it a random sample, or were 5 dentists chosen because they hold stock in Brand X gum?

Many mail-in surveys suffer from a selection bias – the people who send them in may differ in important ways from those who do not.

The Misleading Average

Example: The principal of a small private school met the criticism that his faculty has no teaching experience by issuing the statement that the average experience of each member of the faculty was 5 years.

This statement was technically true: there were five teachers in the school including the principal, but the principal neglected to mention that he had twenty-five years of experience while the remaining four members had none.

Exaggerated graphs

Depending on the height and width of each of the axes, different pictures of the data will emerge. Also, the data will look different, depending on how the axes (particularly the y-axis) are labeled.

measures of variability variability: describes the spread or dispersion of scores for a set of data

Documents

large standard deviation

small standard deviation

standard deviation changes

set of test scores

lowest scores

dispersion of scores

scores deviate

y scores