4 measures of central tendency
TRANSCRIPT
-
8/6/2019 4 Measures of Central Tendency
1/30
Measures of CentralMeasures of Central
TendencyTendency
to be or not to be
Normal
-
8/6/2019 4 Measures of Central Tendency
2/30
TOPICS
Normal Distributions
Skewness & Kurtosis
Normal Curves and Probability Z- scores
Confidence Intervals
Hypothesis Testing
The t-distribution
-
8/6/2019 4 Measures of Central Tendency
3/30
Is this normal ?
VAR00001
500.0400.0300.0200.0100.0
3.5
3.0
2.5
2.0
1.5
1.0
.5
0.0
St
.
= 160.68
M
an = 178.3
= 6.00
Statistics
VAR00001
6
0
178.3333
2.242
.845
5.2191.741
Vali
Missin
M
an
Sk
n ss
St
. Error o Sk
n ss
rtosisSt
. Error o
rtosis
VAR00001
1 16.7 16.7 16.7
2 33.3 33.3 50.0
2 33.3 33.3 83.3
1 16.7 16.7 100.06 100.0 100.0
70.00
100.00
150.00
500.00
Total
Vali
Fr
q
ncy Percent Vali
Percent
Cumulati
e
Percent
-
8/6/2019 4 Measures of Central Tendency
4/30
Normal DistributionsNormal Distributions
Are your curves normal?
Why do we care about normal curves?
What do normal curves tell us?
Answer:
The curves tell us something about the distribution
of the population
The curves allow us to make statistical inferencesregarding the probability of some outcomes
within some margin of error
-
8/6/2019 4 Measures of Central Tendency
5/30
The normal distributionThe normal distribution
A distribution is easily
depicted in a graph
where the height of the
line determined by thefrequency of cases for
the values beneath it
Most cases cluster
near the middle of adistribution if close to
normal
-
8/6/2019 4 Measures of Central Tendency
6/30
The Normal CurveThe Normal Curve
Bell-shaped distribution or curve
Perfectly symmetrical about the mean
Mean median mode
Tails are asymptotic: closer and closer to
horizontal axis but never reach it
-
8/6/2019 4 Measures of Central Tendency
7/30
Skewness and Sample DistributionsNot all curves are normal, even if still bell-shaped
-
8/6/2019 4 Measures of Central Tendency
8/30
Skewness
Formula for skewness
Sy
medianmeanSkewness
)(3 !
-
8/6/2019 4 Measures of Central Tendency
9/30
Kurtosis (Its not a disease)Kurtosis (Its not a disease)
Beyond skewness, kurtosis tells us whenour distribution may have high or lowvariance, even if normal
The kurtosis value for a normal distributionwill equal Anything above this is apeaked value (low variance) and anything
below is platykurtic (high variance)
-
8/6/2019 4 Measures of Central Tendency
10/30
Back to normal distributions
The power of normal distributions, or those
close to it, is that we can predict where
cases will fall within a distribution
probabilistically
For example, what are the odds, given the
population parameter of human height, that
someone will grow to more than eight feet?
Answer, likely less than a probability
-
8/6/2019 4 Measures of Central Tendency
11/30
Sample DistributionSample Distribution W
hat doesA
ndre theGiant do to the sampledistribution?
What is the probabilityof finding someonelike Andre in thepopulation?
Are you ready formore inferential
statistics?
Answer: Oh boy, yes!!
-
8/6/2019 4 Measures of Central Tendency
12/30
Normal Curves and probabilityNormal Curves and probability
We have answered the question of whatAndre and the Sumo wrestler would do tothe distribution
But what about the probability of findingsomeone the same height as Andre in thepopulation?
What is the probability of finding someonethe same height as Dr Pea or DrBoehmer?
-
8/6/2019 4 Measures of Central Tendency
13/30
More on normal curves and
probability
Andre would be hereDr Boehmer would be here
-
8/6/2019 4 Measures of Central Tendency
14/30
ZZ--Scores (no sleeping!!)Scores (no sleeping!!)
We can standardize the central tendency
away from the mean across different
samples with z-scores
The basic unit of the z-score is the standard
deviation
s
XXz i
)( !
-
8/6/2019 4 Measures of Central Tendency
15/30
We can use the z-score to score each
observation as a distance from the
mean.
How far is a given observation from the
mean when its z-score = 2?
Answer: standard deviations
Approximately what percentage of cases
is a given case higher than if its z-score
= 2?
Answer: %
-
8/6/2019 4 Measures of Central Tendency
16/30
Random Sampling Error
Ever hear a poll report a margin of error? What
is that?
andom Sampling Error standard deviation/ square
root of the sample sizeOr
NW As the variance of the
population increases, sodoes the chance that a
sample could not reflect the
population parameters
-
8/6/2019 4 Measures of Central Tendency
17/30
Standard Error
We often refer to both the random samplingerror with both the chance to err when
sampling but also the error of a specific
sample statistic, the mean We typicallyuse the term Standard Error
Asample statistic standard error is thedifference between the mean of a sample
and the mean of the population from which
it is drawn
-
8/6/2019 4 Measures of Central Tendency
18/30
Standard Error
Example: What if most humans werepounds and only million globally were
pounds?
The random sampling error would be low
since the chance of collecting a sample
consisting heavily of those heavier humans
would be unlikely There would not be
much error in general from sampling
because of the low variance
-
8/6/2019 4 Measures of Central Tendency
19/30
Standard Error
Example continued Now, when we take asample, each sample has a mean If apopulation has low variance, so should thesamples We should see this reflected in
low standard error in the mean of thesample, the sample statistic
Of course, higher variance in thepopulation also causes higher error in
samples taken from it
-
8/6/2019 4 Measures of Central Tendency
20/30
Some more notation
Distributions Mean Standard Dev.
Sample of
observed data sPopulation
epeated
Sampling
X
NW
Error in a Samples mean is the Standard Error
Random Sampling Error
ns
-
8/6/2019 4 Measures of Central Tendency
21/30
Central Limit TheoremCentral Limit Theorem
Remember that if we took an infinite number
of samples from a population, the means
of these samples would be normally
distributed
Hence, the larger the sample relative to the
population, the more likely the sample
mean will capture the population mean
-
8/6/2019 4 Measures of Central Tendency
22/30
Confidence IntervalsConfidence Intervals
We can actually use the information wehave about a standard deviation from themean and calculate the range of values forwhich a sample would have if they were tofall close to the mean of the population
This range is based on the probability that
the sample mean falls close to thepopulation mean with a probability of ,or % error
-
8/6/2019 4 Measures of Central Tendency
23/30
How Confident Are You?How Confident Are You?
Are you % sure? Social scientists use a % as a threshold
to test whether or not the results are
product of chance That is, we take out of chances to be
wrong
What do you MEAN?
We build a % confidence interval to makesure that the mean will be within thatrange
-
8/6/2019 4 Measures of Central Tendency
24/30
Confidence Interval (CI)Confidence Interval (CI)
s yZY WE 2/
Y mean
Z Z score related with a % CI
standard error
rorstandarder*)2(96.1 or sa ple eans
-
8/6/2019 4 Measures of Central Tendency
25/30
Building a CIBuilding a CI
N
y
Y
W
W!
Assume the following
40015
100
!
!
!
N
y
y
W
Q
750.400
15!!
yW
-
8/6/2019 4 Measures of Central Tendency
26/30
CI
53.98
47.101
)750.0)(96.1(100
!
!
s
Lower
Upper
Why do we use ?
-
8/6/2019 4 Measures of Central Tendency
27/30
Calculating a % CICalculating a % CI
Lets look at the class population
distribution of height
Is it a normal or s
kew distribution?
Lets build a % CI around the mean
height of the class
-
8/6/2019 4 Measures of Central Tendency
28/30
Why do we care about CI?Why do we care about CI?
We use CI interval for hypothesis testing
For instance, we want to know if there is
an income difference betweenE
lP
asoand Boston
We want to know whether or not taking
class at Kaplan makes a difference in our
GRE scores
-
8/6/2019 4 Measures of Central Tendency
29/30
Mean Difference testing
Income levels
El Paso BostonLas Cruces
Mean U
SA
-
8/6/2019 4 Measures of Central Tendency
30/30