Download - Applied Statistics and Experimental Design Chapter 3

Page 1: Applied Statistics and Experimental Design Chapter 3

Population models The t-test Two sample tests Checking assumptions

Inference based on population modelsApplied Statistics and Experimental Design

Chapter 3

Peter Hoff

Statistics, Biostatistics and the CSSSUniversity of Washington

Page 2: Applied Statistics and Experimental Design Chapter 3

Motivation

In some situations the experiment may

• be complicated,

• be non- or partially randomized, or

• include nuisance factors.

A null distribution based on randomization may be difficult to obtain.

An alternative approach to testing is to formulate a population model.

Page 3: Applied Statistics and Experimental Design Chapter 3

Motivation

• be complicated,

Page 4: Applied Statistics and Experimental Design Chapter 3

Motivation

• be complicated,

Page 5: Applied Statistics and Experimental Design Chapter 3

Motivation

• be complicated,

Page 6: Applied Statistics and Experimental Design Chapter 3

Motivation

• be complicated,

Page 7: Applied Statistics and Experimental Design Chapter 3

Population models

Consider the following model for our wheat yield experiment:

• There is a large/infinite population of plots,similar in size/shape/composition to the plots in our experiment.

• When A is applied to these plots,distribution of plot yields ≈ probability distribution pA

expectation = E[YA] =R

ypA(y)dy = µA,

variance =Var[YA] = E[(YA − µA)2] = σ2A.

• When B is applied to these plots,distribution of plot yields ≈ probability distribution pB

expectation = E[YB ] =R

ypB (y)dy = µB ,

variance =Var[YB ] = E[(YB − µB )2] = σ2B .

• A and B-plots in the experiment are independent samples from pA and pB :

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

Page 8: Applied Statistics and Experimental Design Chapter 3

Population models

ypA(y)dy = µA,

ypB (y)dy = µB ,

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

Page 9: Applied Statistics and Experimental Design Chapter 3

Population models

ypA(y)dy = µA,

ypB (y)dy = µB ,

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

Page 10: Applied Statistics and Experimental Design Chapter 3

Population models

ypA(y)dy = µA,

ypB (y)dy = µB ,

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

Page 11: Applied Statistics and Experimental Design Chapter 3

Population models

ypA(y)dy = µA,

ypB (y)dy = µB ,

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

Page 12: Applied Statistics and Experimental Design Chapter 3

Population model

0.00

0.02

0.04

0.06

0.08

µµA

‘All possible' A wheat yields

random sampling

yA5 10 15 20 25 30 35

0.00

0.02

0.04

0.06

0.08

0.10

Experimental samples

yA=18.37

sA=4.23

0.00

0.02

0.04

0.06

0.08

µµB

‘All possible' B wheat yields

random sampling

yB5 10 15 20 25 30 35

0.00

0.04

0.08

0.12

Experimental samples

yB=24.30sB=5.15

Figure: The population model

Page 13: Applied Statistics and Experimental Design Chapter 3

Population expectations and variances

Recall from intro stats:

E[YA] = E[1

nA

nAXi=1

Yi,A]

=1

nA

nAXi=1

E[Yi,A]

=1

nA

nAXi=1

µA = µA

We say that YA is an unbiased estimator of µA.

Page 14: Applied Statistics and Experimental Design Chapter 3

E[YA] = E[1

nA

nAXi=1

Yi,A]

=1

nA

nAXi=1

E[Yi,A]

=1

nA

nAXi=1

µA = µA

Page 15: Applied Statistics and Experimental Design Chapter 3

E[YA] = E[1

nA

nAXi=1

Yi,A]

=1

nA

nAXi=1

E[Yi,A]

=1

nA

nAXi=1

µA = µA

Page 16: Applied Statistics and Experimental Design Chapter 3

E[YA] = E[1

nA

nAXi=1

Yi,A]

=1

nA

nAXi=1

E[Yi,A]

=1

nA

nAXi=1

µA = µA

Page 17: Applied Statistics and Experimental Design Chapter 3

Furthermore, if Y1,A, . . . ,YnA,A are independent samples from population A,

Var[YA] = E[(YA − µA)2] = Var[1

nA

XYi,A]

=1

n2A

XVar[Yi,A]

=1

n2A

Xσ2

A = σ2A/nA.

This means that as nA →∞,

Var[YA] = E[(YA − µA)2]→ 0

which, with unbiasedness, implies

YA → µA

and we say that YA is a consistent estimator for µA.

Page 18: Applied Statistics and Experimental Design Chapter 3

nA

XYi,A]

=1

n2A

XVar[Yi,A]

=1

n2A

Xσ2

A = σ2A/nA.

Var[YA] = E[(YA − µA)2]→ 0

YA → µA

Page 19: Applied Statistics and Experimental Design Chapter 3

nA

XYi,A]

=1

n2A

XVar[Yi,A]

=1

n2A

Xσ2

A = σ2A/nA.

Var[YA] = E[(YA − µA)2]→ 0

YA → µA

Page 20: Applied Statistics and Experimental Design Chapter 3

nA

XYi,A]

=1

n2A

XVar[Yi,A]

=1

n2A

Xσ2

A = σ2A/nA.

Var[YA] = E[(YA − µA)2]→ 0

YA → µA

Page 21: Applied Statistics and Experimental Design Chapter 3

nA

XYi,A]

=1

n2A

XVar[Yi,A]

=1

n2A

Xσ2

A = σ2A/nA.

Var[YA] = E[(YA − µA)2]→ 0

YA → µA

Page 22: Applied Statistics and Experimental Design Chapter 3

nA

XYi,A]

=1

n2A

XVar[Yi,A]

=1

n2A

Xσ2

A = σ2A/nA.

Var[YA] = E[(YA − µA)2]→ 0

YA → µA

Page 23: Applied Statistics and Experimental Design Chapter 3

Asymptotic consistency

Generally, sample characteristics are consistent for population characteristics:

As n→∞,

YA → µA

s2A → σ2

A

#{Yi,A ≤ x}nA

= FA(x) → FA(x) =

Z x

−∞pA(y)dy

Page 24: Applied Statistics and Experimental Design Chapter 3

As n→∞,

YA → µA

s2A → σ2

A

#{Yi,A ≤ x}nA

= FA(x) → FA(x) =

Z x

−∞pA(y)dy

Page 25: Applied Statistics and Experimental Design Chapter 3

As n→∞,

YA → µA

s2A → σ2

A

#{Yi,A ≤ x}nA

= FA(x) → FA(x) =

Z x

−∞pA(y)dy

Page 26: Applied Statistics and Experimental Design Chapter 3

Testing for population differences

Population hypotheses:If µB > µA we would recommend B over A, and vice versa,

• H0 : µA = µB

• H1 : µA 6= µB

Testing procedure:The experiment is performed and it is observed that

18.37 = yA < yB = 24.3

This is some evidence that µA > µB .How much evidence is it?Should we reject the null hypothesis?

Page 27: Applied Statistics and Experimental Design Chapter 3

• H0 : µA = µB

• H1 : µA 6= µB

18.37 = yA < yB = 24.3

Page 28: Applied Statistics and Experimental Design Chapter 3

• H0 : µA = µB

• H1 : µA 6= µB

18.37 = yA < yB = 24.3

Page 29: Applied Statistics and Experimental Design Chapter 3

• H0 : µA = µB

• H1 : µA 6= µB

18.37 = yA < yB = 24.3

Page 30: Applied Statistics and Experimental Design Chapter 3

• H0 : µA = µB

• H1 : µA 6= µB

18.37 = yA < yB = 24.3

Page 31: Applied Statistics and Experimental Design Chapter 3

Testing with the t-statistic

Consider evaluating evidence against H0 with our t-statistic:

gt(yA, yB ) =|yB − yA|

sp

p1/nA + 1/nB

To decide whether to reject H0 or not, we need to know

what gt(myA, yB ) looks like under H0, i.e.

the distribution of g(YA,YB ) under H0.

Consider the following setup:

Assume:

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

Evaluate: H0 : µA = µB versus H1 : µA 6= µB , i.e., whether or notZypA(y)dy =

ZypB (y)dy .

To make this evaluation and obtain a p-value, we need the distribution ofg(YA,YB ) under µA = µB . This will involve assumptions about pA and pB .

Page 32: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

Assume:

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

ZypB (y)dy .

Page 33: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

Assume:

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

ZypB (y)dy .

Page 34: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

Assume:

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

ZypB (y)dy .

Page 35: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

Assume:

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

ZypB (y)dy .

Page 36: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

Assume:

Y1,A, . . . ,YnA,A ∼ i.i.d. pA

Y1,B , . . . ,YnB ,B ∼ i.i.d. pB

ZypB (y)dy .

Page 37: Applied Statistics and Experimental Design Chapter 3

The normal distribution

The normal distribution is useful because in many cases,

• our data are approximately normally distributed, and/or

• our sample means are approximately normally distributed.

These are both due to the central limit theorem.

Letting P(µ, σ) denote a population with mean µ and variance σ2, then

X1 ∼ P1(µ1, σ21)

X2 ∼ P2(µ2, σ22)

...Xm ∼ Pm(µm, σ

2m)

9>>>=>>>;⇒mX

j=1

Xi ∼ normal“X

µj ,X

σ2j

”.

(if the Xj ’s are independent)

Sums of varying quantities are approximately normally distributed.

Page 38: Applied Statistics and Experimental Design Chapter 3

X1 ∼ P1(µ1, σ21)

X2 ∼ P2(µ2, σ22)

2m)

9>>>=>>>;⇒mX

j=1

Xi ∼ normal“X

µj ,X

σ2j

”.

Page 39: Applied Statistics and Experimental Design Chapter 3

X1 ∼ P1(µ1, σ21)

X2 ∼ P2(µ2, σ22)

2m)

9>>>=>>>;⇒mX

j=1

Xi ∼ normal“X

µj ,X

σ2j

”.

Page 40: Applied Statistics and Experimental Design Chapter 3

X1 ∼ P1(µ1, σ21)

X2 ∼ P2(µ2, σ22)

2m)

9>>>=>>>;⇒mX

j=1

Xi ∼ normal“X

µj ,X

σ2j

”.

Page 41: Applied Statistics and Experimental Design Chapter 3

X1 ∼ P1(µ1, σ21)

X2 ∼ P2(µ2, σ22)

2m)

9>>>=>>>;⇒mX

j=1

Xi ∼ normal“X

µj ,X

σ2j

”.

Page 42: Applied Statistics and Experimental Design Chapter 3

Normally distributed data

Consider crop yields from plots of land:

Y1 = a1 × seed1 + a2 × soil1 + a3 × water1 + a4 × sun1 + · · ·Y2 = a1 × seed2 + a2 × soil2 + a3 × water2 + a4 × sun2 + · · ·

...

Yn = a1 × seedn + a2 × soiln + a3 × watern + a4 × sunn + · · ·

The empirical distribution ofyields from a population of fieldswith varying quantities of seed,soil, water, sun, etc.

willbe

approximately normal(µ, σ2),with µ and σ2 depend on theeffects a1, a2, a3, a4, . . . and thevariability of seed, soil, water,sun, etc..

Additive effects ⇒ normally distributed data

Page 43: Applied Statistics and Experimental Design Chapter 3

...

willbe

Page 44: Applied Statistics and Experimental Design Chapter 3

...

willbe

Page 45: Applied Statistics and Experimental Design Chapter 3

Normally distributed means

Consider a population p with E[Y ] = µ, Var[Y ] = σ2.

Experiment 1: sample y(1)1 , . . . , y

(1)n ∼ i.i.d. p and compute y (1);

...

Experiment s: sample y(s)1 , . . . , y

(s)n ∼ i.i.d. p and compute y (s).

The distribution of {y (1), . . . , y (m)} is approximately normal(µ, σ2):

sample mean {y (1), . . . , y (m)} ≈ µ

sample variance {y (1), . . . , y (m)} ≈ σ2/n

histogram {y (1), . . . , y (m)} ≈ normal(µ, σ2)

i.e. the sampling distribution of the mean is approximately normal(µ, σ2/n),even if the sampling distribution of the data are not normal.

Page 46: Applied Statistics and Experimental Design Chapter 3

...

sample mean {y (1), . . . , y (m)} ≈ µ

Page 47: Applied Statistics and Experimental Design Chapter 3

...

sample mean {y (1), . . . , y (m)} ≈ µ

Page 48: Applied Statistics and Experimental Design Chapter 3

...

sample mean {y (1), . . . , y (m)} ≈ µ

Page 49: Applied Statistics and Experimental Design Chapter 3

...

sample mean {y (1), . . . , y (m)} ≈ µ

Page 50: Applied Statistics and Experimental Design Chapter 3

Properties of the normal distribution

• Y ∼ norm(µ, σ2)⇒ aY + b ∼norm(aµ+ b, a2σ2).

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

22),Y1,Y2 independent

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

• if Y1, . . . ,Yn ∼ i.i.d. norm(µ, σ2), then Y is statistically independent of s2.

How does this help with hypothesis testing?Consider testing H0 : µA = µB (treatment doesn’t affect mean).Regardless of distribution of data, under H0 :

YA·∼ normal(µ, σ2

A/nA)

YB·∼ normal(µ, σ2

B/nB )

YB − YA·∼ normal(0, σ2

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

If we knew the variances, we’d have a null distribution.

Page 51: Applied Statistics and Experimental Design Chapter 3

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

A/nA)

B/nB )

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

Page 52: Applied Statistics and Experimental Design Chapter 3

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

A/nA)

B/nB )

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

Page 53: Applied Statistics and Experimental Design Chapter 3

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

A/nA)

B/nB )

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

Page 54: Applied Statistics and Experimental Design Chapter 3

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

A/nA)

B/nB )

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

Page 55: Applied Statistics and Experimental Design Chapter 3

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

A/nA)

B/nB )

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

Page 56: Applied Statistics and Experimental Design Chapter 3

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

A/nA)

B/nB )

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

Page 57: Applied Statistics and Experimental Design Chapter 3

• Y1 ∼ norm(µ1, σ21), Y2 ∼ norm(µ2, σ

⇒ Y1 + Y2 ∼ norm(µ1 + µ2, σ21 + σ2

2)

A/nA)

B/nB )

AB )

where σ2AB = σ2

A/nA + σ2B/nB .

Page 58: Applied Statistics and Experimental Design Chapter 3

A one sample test for the mean

Consider a simple one-sample hypothesis test:

Y1, . . . ,Yn ∼ i.i.d. P, with mean µ and variance σ2.

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

• Yi = muscle strength after treatment - muscle score before.• H0 : E[Yi ] = 0

Physics

• Yi = boiling point of a sample of an unknown liquid.• H0 : E[Yi ] = 100◦ C.

To test H0, we need a test statistic and its distribution under H0.

Page 59: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 60: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 61: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 62: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 63: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 64: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 65: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 66: Applied Statistics and Experimental Design Chapter 3

H0 : µ = µ0

H1 : µ 6= µ0

Examples:

Physical therapy

Physics

Page 67: Applied Statistics and Experimental Design Chapter 3

Choosing a test statistic

g(y) = |y − µ0| might make a good test statistic:

• it is sensitive to deviations from H0.

• its sampling distribution is approximately known:

E[Y ] = µ

Var[Y ] = σ2/n

Y is approximately normal.

Can we obtain a null distribution?

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

but we can’t use this as a null distribution because σ2 is unknown.

The null distribution must not depend on unknown parameters

Page 68: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 69: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 70: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 71: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 72: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 73: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 74: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 75: Applied Statistics and Experimental Design Chapter 3

E[Y ] = µ

Var[Y ] = σ2/n

Under H0

(Y − µ0) ∼ normal(0, σ2/n),

Page 76: Applied Statistics and Experimental Design Chapter 3

The t-statistic

Can we rescale (Y − µ0)?

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

This distribution has no unknown parameters⇒ could use as a null distribution.

However, having observed the data y, is f (y) a statistic?

• y is computable from the data and n is known;

• µ0 is our hypothesized value, a fixed number that we have chosen.

• σ is not determined by us and is unknown.

Can we approximate the population variance σ2 with the sample variance s2?

Page 77: Applied Statistics and Experimental Design Chapter 3

The t-statistic

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

Page 78: Applied Statistics and Experimental Design Chapter 3

The t-statistic

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

Page 79: Applied Statistics and Experimental Design Chapter 3

The t-statistic

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

Page 80: Applied Statistics and Experimental Design Chapter 3

The t-statistic

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

Page 81: Applied Statistics and Experimental Design Chapter 3

The t-statistic

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

Page 82: Applied Statistics and Experimental Design Chapter 3

The t-statistic

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

Page 83: Applied Statistics and Experimental Design Chapter 3

The t-statistic

f (Y) =Y − µ0

σ/√

n

Under H0

f (Y)·∼ normal(0, 1),

Page 84: Applied Statistics and Experimental Design Chapter 3

One sample t-statistic

t(Y) =Y − µ0

s/√

n

For a given value of µ0 this is a statistic. What is the null distribution of t(Y)?

if s ≈ σ thenY − µ0

s/√

n≈ Y − µ0

σ/√

n

If Y1, . . . ,Yn ∼ i.i.d. normal(µ0, σ2) then Y−µ0

σ/√

nis normal(0, 1).

If s ≈ σ then t(Y) = Y−µ0s/√

nis approximately normal(0, 1).

So it seems that normal(0,1) is an approximate null distribution for t(Y).

However, if the approximation s ≈ σ is poor, like when n is small, we need totake account of our uncertainty in the estimate of σ.

Page 85: Applied Statistics and Experimental Design Chapter 3

t(Y) =Y − µ0

s/√

n

s/√

n≈ Y − µ0

σ/√

n

σ/√

nis normal(0, 1).

Page 86: Applied Statistics and Experimental Design Chapter 3

t(Y) =Y − µ0

s/√

n

s/√

n≈ Y − µ0

σ/√

n

σ/√

nis normal(0, 1).

Page 87: Applied Statistics and Experimental Design Chapter 3

t(Y) =Y − µ0

s/√

n

s/√

n≈ Y − µ0

σ/√

n

σ/√

nis normal(0, 1).

Page 88: Applied Statistics and Experimental Design Chapter 3

t(Y) =Y − µ0

s/√

n

s/√

n≈ Y − µ0

σ/√

n

σ/√

nis normal(0, 1).

Page 89: Applied Statistics and Experimental Design Chapter 3

t(Y) =Y − µ0

s/√

n

s/√

n≈ Y − µ0

σ/√

n

σ/√

nis normal(0, 1).

Page 90: Applied Statistics and Experimental Design Chapter 3

The χ2 distributionGoal: Describe the variability of s2 around σ2

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n, chi-squared dist with n dof

X(Zi − Z)2 ∼ χ2

n−1

Y1, . . . ,Yn ∼ i.i.d. normal(µ, σ2) ⇒ (Y1 − µ)/σ, . . . , (Yn − µ)/σ ∼ i.i.d. norm(0, 1)

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

Some intuition: Which is bigger: (Z1, . . . ,Zn) or (Z1 − Z , . . . ,Zn − Z)?

The latter is of length n but lies in an n − 1 dimensional space.

It is has a singular multivariate normal distribution, with a cov matrix of rank (n− 1).

Getting back to the problem at hand,

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

which is a known distribution we can look up on a table or with a computer.

Page 91: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n, chi-squared dist with n dofX(Zi − Z)2 ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 92: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 93: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 94: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 95: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

Some intuition: Which is bigger: (Z1, . . . ,Zn) or (Z1 − Z , . . . ,Zn − Z)?The latter is of length n but lies in an n − 1 dimensional space.

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 96: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 97: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 98: Applied Statistics and Experimental Design Chapter 3

Z1, . . . ,Zn ∼ i.i.d. normal(0, 1) ⇒X

Z 2i ∼ χ2

n−1

⇒ 1

σ2

X(Yi − µ)2 ∼ χ2

n

1

σ2

X(Yi − Y )2 ∼ χ2

n−1

n − 1

σ2s2 =

n − 1

σ2

1

n − 1

X(Yi − Y )2 ∼ χ2

n−1,

Page 99: Applied Statistics and Experimental Design Chapter 3

The χ2 distribution

0 5 10 15 20 25 30

0.00

0.04

0.08

X

p((X

))

n=9n=10n=11

Figure: χ2 distributions

Page 100: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

• Z ,X statistically independent,

thenZpX/m

∼ tm, the t-distribution with m degrees of freedom

How does this help us? Recall that if Y1, . . . ,Yn ∼ i.i.d. normal(µ, σ2),

•√

n(Y − µ)/σ ∼normal(0,1)

• n−1σ2 s2 ∼ χ2

n−1

• Y , s2 are independent.

Page 101: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

thenZpX/m

•√

• n−1σ2 s2 ∼ χ2

n−1

Page 102: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

thenZpX/m

•√

• n−1σ2 s2 ∼ χ2

n−1

Page 103: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

thenZpX/m

•√

• n−1σ2 s2 ∼ χ2

n−1

Page 104: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

thenZpX/m

•√

• n−1σ2 s2 ∼ χ2

n−1

Page 105: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

thenZpX/m

•√

• n−1σ2 s2 ∼ χ2

n−1

Page 106: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

thenZpX/m

•√

• n−1σ2 s2 ∼ χ2

n−1

Page 107: Applied Statistics and Experimental Design Chapter 3

The t-distribution

If

• Z ∼ normal (0,1) ;

• X ∼ χ2m;

thenZpX/m

•√

• n−1σ2 s2 ∼ χ2

n−1

Page 108: Applied Statistics and Experimental Design Chapter 3

The t distributionLet Z =

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

Z , X are independentZp

X/(n − 1)∼ tn−1

In terms of Y and s2, we have

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

This is still not a statistic because µ is unknown.

But under a specific hypothesis like H0 : µ = µ0, it is a statistic:

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

It is called the t-statistic.

Page 109: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 110: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 111: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 112: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 113: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 114: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 115: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n

∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 116: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n∼ tn−1

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 117: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n∼ tn−1

This is still not a statistic because µ is unknown.But under a specific hypothesis like H0 : µ = µ0, it is a statistic:

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 118: Applied Statistics and Experimental Design Chapter 3

√n(Y − µ)/σ, X = n−1

σ2 s2. Then

Z ∼ normal(0, 1)

X ∼ χ2n−1

X/(n − 1)∼ tn−1

ZpX/(n − 1)

=

√n(Y − µ)/σq

n−1σ2 s2/(n − 1)

=Y − µs/√

n∼ tn−1

This is still not a statistic because µ is unknown.But under a specific hypothesis like H0 : µ = µ0, it is a statistic:

t(Y) =Y − µ0

s/√

n∼ tn−1 if E[Y ] = µ0

Page 119: Applied Statistics and Experimental Design Chapter 3

Some questions for discussion

• What does X/m converge to as m→∞?

• What happens to the distribution of t(Y) when n→∞? Why?

• Consider the situation where the data are not normally distributed.• What is the distribution of

√n(Y − µ)/σ for large and small n?

• What is the distribution of n−1σ2 s2 for large and small n?

• Are√

n(Y − µ) and s2 independent for small n? What about for large n?

Page 120: Applied Statistics and Experimental Design Chapter 3

• Are√

Page 121: Applied Statistics and Experimental Design Chapter 3

• Are√

Page 122: Applied Statistics and Experimental Design Chapter 3

• Are√

Page 123: Applied Statistics and Experimental Design Chapter 3

• Are√

Page 124: Applied Statistics and Experimental Design Chapter 3

The t-distribution

−3 −2 −1 0 1 2 3

0.0

0.1

0.2

0.3

0.4

t

p((t))

n=3n=6n=12n=∞∞

Figure: t-distributions

Page 125: Applied Statistics and Experimental Design Chapter 3

One sample t-test

1. Sampling model: Y1, . . . ,Yn ∼ i.i.d. normal(µ, σ2)

2. Null hypothesis: H0 : µ = µ0

3. Alternative hypothesis: H1 : µ 6= µ0

4. Test statistic: t(Y) =√

n(Y − µ0)/s• Pre-experiment we think of this as a random variable, an unknown quantity

that is to be randomly sampled from a population.

• Post-experiment this is a fixed number.

Page 126: Applied Statistics and Experimental Design Chapter 3

One sample t-test

Page 127: Applied Statistics and Experimental Design Chapter 3

One sample t-test

Page 128: Applied Statistics and Experimental Design Chapter 3

One sample t-test

Page 129: Applied Statistics and Experimental Design Chapter 3

One sample t-test

Page 130: Applied Statistics and Experimental Design Chapter 3

One sample t-test

Page 131: Applied Statistics and Experimental Design Chapter 3

One sample t-test

5. Null distribution: Under the normal sampling model and H0, the samplingdistribution of t(Y) is the t-distribution with n − 1 degrees of freedom:

t(Y) ∼ tn−1

If H0 is not true, then t(Y) does not have a t-distribution.

• If the data are normal but the mean is not µ0, then t(Y) has a non-centralt-distribution, which we will use to calculate the type II error rate (power).

• If the data are not normal then t(Y) does not have a t-distribution.

6. p-value: Let y be the observed data.

p-value = Pr(|t(Y)| ≥ |t(y)||H0)

= Pr(|Tn−1| ≥ |t(y)|)= 2× Pr(Tn−1 ≥ |t(y)|)= 2 ∗ (1− pt(tobs, n− 1))

= t.test(y, mu = mu0)

Page 132: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

Page 133: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

Page 134: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

Page 135: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

Page 136: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

= Pr(|Tn−1| ≥ |t(y)|)

= 2× Pr(Tn−1 ≥ |t(y)|)= 2 ∗ (1− pt(tobs, n− 1))

Page 137: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

= Pr(|Tn−1| ≥ |t(y)|)= 2× Pr(Tn−1 ≥ |t(y)|)

= 2 ∗ (1− pt(tobs, n− 1))

Page 138: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

Page 139: Applied Statistics and Experimental Design Chapter 3

One sample t-test

t(Y) ∼ tn−1

Page 140: Applied Statistics and Experimental Design Chapter 3

One sample t-test

7. Level-α decision procedure: Reject H0 if

• p-value ≤ α or equivalently

• |t(y)| ≥ t(n−1),1−α/2 (for α = .05, t(n−1),1−α/2 ≈ 2 ).

The value t(n−1),1−α/2 ≈ 2 is called the critical value value for this test.In general, the critical value is the value of the test statistic above whichwe would reject H0.

Question:Suppose our procedure is to reject H0 only when t(y) ≥ t(n−1),1−α.Is this a level-α test?

Page 141: Applied Statistics and Experimental Design Chapter 3

One sample t-test

• |t(y)| ≥ t(n−1),1−α/2 (for α = .05, t(n−1),1−α/2 ≈ 2 ).

Page 142: Applied Statistics and Experimental Design Chapter 3

One sample t-test

• |t(y)| ≥ t(n−1),1−α/2 (for α = .05, t(n−1),1−α/2 ≈ 2 ).

Page 143: Applied Statistics and Experimental Design Chapter 3

One sample t-test

• |t(y)| ≥ t(n−1),1−α/2 (for α = .05, t(n−1),1−α/2 ≈ 2 ).

Page 144: Applied Statistics and Experimental Design Chapter 3

One sample t-test

• |t(y)| ≥ t(n−1),1−α/2 (for α = .05, t(n−1),1−α/2 ≈ 2 ).

Page 145: Applied Statistics and Experimental Design Chapter 3

One sample t-test

• |t(y)| ≥ t(n−1),1−α/2 (for α = .05, t(n−1),1−α/2 ≈ 2 ).

Page 146: Applied Statistics and Experimental Design Chapter 3

Two-sample testsRecall the wheat example:

B A B A B B26.9 11.4 26.6 23.7 25.3 28.5B A A A B A

14.2 17.9 16.5 21.1 24.3 19.6

Sampling model:

Y1A, . . . ,YnAA ∼ i.i.d. normal(µA, σ2)

Y1B , . . . ,YnB B ∼ i.i.d. normal(µB , σ2).

In addition to normality we assume for now that both variances are equal.

Hypotheses: H0 : µA = µB H1: µA 6= µB

Recall that

YB − YA ∼ N

„µB − µA, σ

2

»1

nA+

1

nB

–«.

Hence if H0 is true then

YB − YA ∼ N

„0, σ2

»1

nA+

1

nB

–«.

Page 147: Applied Statistics and Experimental Design Chapter 3

B A B A B B26.9 11.4 26.6 23.7 25.3 28.5B A A A B A

14.2 17.9 16.5 21.1 24.3 19.6

Sampling model:

Recall that

YB − YA ∼ N

„µB − µA, σ

2

»1

nA+

1

nB

–«.

YB − YA ∼ N

„0, σ2

»1

nA+

1

nB

–«.

Page 148: Applied Statistics and Experimental Design Chapter 3

B A B A B B26.9 11.4 26.6 23.7 25.3 28.5B A A A B A

14.2 17.9 16.5 21.1 24.3 19.6

Sampling model:

Recall that

YB − YA ∼ N

„µB − µA, σ

2

»1

nA+

1

nB

–«.

YB − YA ∼ N

„0, σ2

»1

nA+

1

nB

–«.

Page 149: Applied Statistics and Experimental Design Chapter 3

B A B A B B26.9 11.4 26.6 23.7 25.3 28.5B A A A B A

14.2 17.9 16.5 21.1 24.3 19.6

Sampling model:

Recall that

YB − YA ∼ N

„µB − µA, σ

2

»1

nA+

1

nB

–«.

YB − YA ∼ N

„0, σ2

»1

nA+

1

nB

–«.

Page 150: Applied Statistics and Experimental Design Chapter 3

B A B A B B26.9 11.4 26.6 23.7 25.3 28.5B A A A B A

14.2 17.9 16.5 21.1 24.3 19.6

Sampling model:

Recall that

YB − YA ∼ N

„µB − µA, σ

2

»1

nA+

1

nB

–«.

YB − YA ∼ N

„0, σ2

»1

nA+

1

nB

–«.

Page 151: Applied Statistics and Experimental Design Chapter 3

Pooled sample variance

How should we estimate σ2 ?

s2p =

PnAi=1(yi,A − yA)2 +

PnBi=1(yi,B − yB )2

(nA − 1) + (nB − 1)

=nA − 1

(nA − 1) + (nB − 1)s2

A +nB − 1

(nA − 1) + (nB − 1)s2

B

This gives us the following two-sample t-statistic:

t(YA,YB ) =YB − YA

sp

q1

nA+ 1

nB

∼ tnA+nB−2

Self-check exercises:

1. Show that (nA + nB − 2)s2p/σ

2 ∼ χ2nA+nB−2

(recall how the χ2 distribution was defined).

2. Show that the two-sample t-statistic has a t-dist with nA + nB − 2 d.f.

Page 152: Applied Statistics and Experimental Design Chapter 3

s2p =

(nA − 1) + (nB − 1)

=nA − 1

(nA − 1) + (nB − 1)s2

A +nB − 1

(nA − 1) + (nB − 1)s2

B

sp

q1

nA+ 1

nB

∼ tnA+nB−2

2 ∼ χ2nA+nB−2

Page 153: Applied Statistics and Experimental Design Chapter 3

s2p =

(nA − 1) + (nB − 1)

=nA − 1

(nA − 1) + (nB − 1)s2

A +nB − 1

(nA − 1) + (nB − 1)s2

B

sp

q1

nA+ 1

nB

∼ tnA+nB−2

2 ∼ χ2nA+nB−2

Page 154: Applied Statistics and Experimental Design Chapter 3

s2p =

(nA − 1) + (nB − 1)

=nA − 1

(nA − 1) + (nB − 1)s2

A +nB − 1

(nA − 1) + (nB − 1)s2

B

sp

q1

nA+ 1

nB

∼ tnA+nB−2

2 ∼ χ2nA+nB−2

Page 155: Applied Statistics and Experimental Design Chapter 3

s2p =

(nA − 1) + (nB − 1)

=nA − 1

(nA − 1) + (nB − 1)s2

A +nB − 1

(nA − 1) + (nB − 1)s2

B

sp

q1

nA+ 1

nB

∼ tnA+nB−2

2 ∼ χ2nA+nB−2

Page 156: Applied Statistics and Experimental Design Chapter 3

Numerical Example (wheat again)

Suppose we want to have a type-I error rate of α = 0.05.

Decision procedure: Level α test of H0 : µA = µB , with α = 0.05

• Reject H0 if p-value < 0.05• Reject H0 if |t(yA, yB )| > t10,.975 = 2.23

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

• Hence the p-value= Pr(|T10| ≥ 2.18) = 0.054• Hence H0: µA = µB is not rejected at level α = 0.05

Page 157: Applied Statistics and Experimental Design Chapter 3

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

Page 158: Applied Statistics and Experimental Design Chapter 3

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

Page 159: Applied Statistics and Experimental Design Chapter 3

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

Page 160: Applied Statistics and Experimental Design Chapter 3

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

Page 161: Applied Statistics and Experimental Design Chapter 3

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

Page 162: Applied Statistics and Experimental Design Chapter 3

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

Page 163: Applied Statistics and Experimental Design Chapter 3

Data:

• yA = 18.36, s2A = 17.93, nA = 6

• yB = 24.30, s2B = 26.54, nB = 6

t-statistic:

• s2p = 22.24, sp = 4.72

• t(yA, yB ) = 5.93/(4.72p

1/6 + 1/6) = 2.18

Inference:

Page 164: Applied Statistics and Experimental Design Chapter 3

R-code> t . t e s t ( y [ x==”A” ] , y [ x==”B” ] , v a r . e q u a l=TRUE)

Two Sample t−t e s t

data : y [ x == ”A” ] and y [ x == ”B” ]t = −2.1793 , d f = 10 , p−v a l u e = 0.05431a l t e r n a t i v e h y p o t h e s i s : t r u e d i f f e r e n c e i n means i s not e q u a l to 095 p e r c e n t c o n f i d e n c e i n t e r v a l :−11.999621 0.132954

sample e s t i m a t e s :mean o f x mean o f y

18.36667 24.30000

Always keep in mind where the p-value comes from:

−4 −2 0 2 4

0.0

0.1

0.2

0.3

0.4

T

p((T

))

Page 165: Applied Statistics and Experimental Design Chapter 3

Comparison to the randomization test:

We have already compared the t-statistic to its randomization distribution.A sample from the randomization distribution is obtained as follows:

1. Sample a treatment assignment according to the randomization scheme.

2. Compute t(YA,YB ) under this treatment assignment and H0.

The randomization distribution of the t-statistic is then approximated by theempirical distribution of

t(1), . . . , t(S)

To obtain the p-value,compare tobs = t(yA, yB ) to the empirical distribution of t(1), . . . , t(S):

p-value =#(|t(s)| ≥ |tobs|)

S

Page 166: Applied Statistics and Experimental Design Chapter 3

t(1), . . . , t(S)

S

Page 167: Applied Statistics and Experimental Design Chapter 3

t(1), . . . , t(S)

S

Page 168: Applied Statistics and Experimental Design Chapter 3

t(1), . . . , t(S)

S

Page 169: Applied Statistics and Experimental Design Chapter 3

t(1), . . . , t(S)

S

Page 170: Applied Statistics and Experimental Design Chapter 3

t . s t a t . obs<−t . t e s t ( y [ x==”A” ] , y [ x==”B” ] , v a r . e q u a l=T) $ s t a tt . s t a t . sim<−r e a l ( )f o r ( s i n 1 : 1 0 0 0 0 ){

xsim<−sample ( x )tmp<−t . t e s t ( y [ xs im==”B” ] , y [ xs im==”A” ] , v a r . e q u a l=T)t . s t a t . s im [ s]<−tmp$stat

}

mean ( abs ( t . s t a t . s im ) >= abs ( t . s t a t . obs ) )

When I ran this, I got

#(|t(s)| ≥ 2.18)

S= 0.058 ≈ 0.054 = Pr(|TnA+nB−2| ≥ 2.18)

Is this surprising? These two p-values were obtained via two completelydifferent ways of looking at the problem!

Page 171: Applied Statistics and Experimental Design Chapter 3

}

#(|t(s)| ≥ 2.18)

S= 0.058 ≈ 0.054 = Pr(|TnA+nB−2| ≥ 2.18)

Page 172: Applied Statistics and Experimental Design Chapter 3

}

#(|t(s)| ≥ 2.18)

S= 0.058 ≈ 0.054 = Pr(|TnA+nB−2| ≥ 2.18)

Page 173: Applied Statistics and Experimental Design Chapter 3

Randomization and t null distributions

t(YA,YB)

Den

sity

−6 −4 −2 0 2 4 6

0.0

0.1

0.2

0.3

0.4

Page 174: Applied Statistics and Experimental Design Chapter 3

Assumptions: Under H0,• Randomization Test:

1. Treatments are randomly assigned

• t-test:1. Data are independent samples2. Each population is normally distributed3. The two populations have the same variance

Imagined Universes:

• Randomization Test: Numerical responses remain fixed, we imagineonly alternative treatment assignments.

• t-test: Treatment assignments remain fixed, we imagine analternative sample of experimental units and/or conditions, givingdifferent numerical responses.

Inferential Context / Type of generalization

• Randomization Test: inference is specific to our particularexperimental units and conditions.

• t-test: under our assumptions, inference claims to be generalizableto other units / conditions, i.e. to a larger population.

Yet the numerical results are often nearly identical.

Page 175: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 176: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 177: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 178: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 179: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 180: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 181: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 182: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 183: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 184: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 185: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 186: Applied Statistics and Experimental Design Chapter 3

Imagined Universes:

Page 187: Applied Statistics and Experimental Design Chapter 3

Keep the following concepts clear:

t-statistic : a scaled difference in sample means, computed from the data.

t-distribution : the probability distribution of a normal random variable dividedby the square-root of a χ2 random variable.

t-test : a comparison of a t-statistic to a t-distribution

randomization distribution : the probability distribution of a test statisticunder random treatment reassignments and H0

randomization test : a comparison of a test statistic to its randomizationdistribution

randomization test with the t-statistic :a comparison of the t-statistic to itsrandomization distribution

Page 188: Applied Statistics and Experimental Design Chapter 3

Page 189: Applied Statistics and Experimental Design Chapter 3

Page 190: Applied Statistics and Experimental Design Chapter 3

Page 191: Applied Statistics and Experimental Design Chapter 3

Page 192: Applied Statistics and Experimental Design Chapter 3

Page 193: Applied Statistics and Experimental Design Chapter 3

Page 194: Applied Statistics and Experimental Design Chapter 3

Page 195: Applied Statistics and Experimental Design Chapter 3

Page 196: Applied Statistics and Experimental Design Chapter 3

Page 197: Applied Statistics and Experimental Design Chapter 3

Page 198: Applied Statistics and Experimental Design Chapter 3

Page 199: Applied Statistics and Experimental Design Chapter 3

Page 200: Applied Statistics and Experimental Design Chapter 3

Some history

de Moivre (1733): Approximating binomial distributionsT =

Pni=1 Yi , Yi ∈ {0, 1}.

Laplace (1800s): Used normal distribution to model measurement error inexperiments.

Gauss (1800s) : Justified least squares estimation by assuming normallydistributed errors.

Gosset/Student (1908): Derived the t-distribution.

Gosset/Student (1925): “testing the significance”

Fisher (1925): “level of significance”

Fisher (1920s?): Fisher’s exact test.

Fisher (1935): “It seems to have escaped recognition that the physical act ofrandomization, which, as has been shown, is necessary for the validity ofany test of significance, affords the means, in respect of any particular bodyof data, of examining the wider hypothesis in which no normality ofdistribution is implied.”

Box and Andersen (1955): Approximation of randomization null distributions(and a long discussion).

Rodgers (1999): “If Fisher’s ANOVA had been invented 30 years later orcomputers had been available 30 years sooner, our statistical procedureswould probably be less tied to theoretical distributions as what they aretoday”

Page 201: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 202: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 203: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 204: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 205: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 206: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 207: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 208: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 209: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 210: Applied Statistics and Experimental Design Chapter 3

Some history

Pni=1 Yi , Yi ∈ {0, 1}.

Page 211: Applied Statistics and Experimental Design Chapter 3

Checking assumptions

sp

p1/nA + 1/nB

If Y1,A, . . . ,YnA,A and Y1,B , . . . ,YnB ,B are indep. samples from pA and pB , and

(a) µA = µB

(b) σ2A = σ2

B

(c) pA and pB are normal distributions

thent(YA,YB ) ∼ tnA+nB−2

So our null distribution really assumes conditions (a), (b) and (c).

If we perform a level-α test and reject H0,we are really just rejecting that (a), (b), (c) are all true.

reject H0 ⇒ one of (a), (b) or (c) is not true

Page 212: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 213: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 214: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 215: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 216: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 217: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 218: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 219: Applied Statistics and Experimental Design Chapter 3

sp

p1/nA + 1/nB

(a) µA = µB

(b) σ2A = σ2

B

Page 220: Applied Statistics and Experimental Design Chapter 3

(a) µA = µB

(b) σ2A = σ2

B

For this reason, we will check if conditions (b) and (c) are plausibly met.

If

(b) is met

(c) is met

H0 is rejected, then

this is evidence that H0 is rejected because µA 6= µB .

Page 221: Applied Statistics and Experimental Design Chapter 3

(a) µA = µB

(b) σ2A = σ2

B

If

(b) is met

(c) is met

Page 222: Applied Statistics and Experimental Design Chapter 3

(a) µA = µB

(b) σ2A = σ2

B

If

(b) is met

(c) is met

Page 223: Applied Statistics and Experimental Design Chapter 3

(a) µA = µB

(b) σ2A = σ2

B

If

(b) is met

(c) is met

Page 224: Applied Statistics and Experimental Design Chapter 3

Checking normality

Normality can be checked by a normal probability plot

Idea: Order observations within each group:

y(1),A ≤ y(2),A ≤ · · · ≤ y(nA),A

compare these sample quantiles to standard normal quantiles:

z 1nA− 1

2≤ z 2

nA− 1

2≤ · · · ≤ z nA

nA− 1

2

( here Pr(Z ≤ z knA− 1

2) = k

nA− 1

2. The − 1

2is a continuity correction. )

If data are normal, the relationship should be approximately linear.Thus we plot the pairs

(z 1nA− 1

2, y(1),A), . . . , (z nA

nA− 1

2, y(nA),A).

Normality may be roughly checked by fitting straight lines to the probabilityplots and examining their slopes

Page 225: Applied Statistics and Experimental Design Chapter 3

Checking normality

y(1),A ≤ y(2),A ≤ · · · ≤ y(nA),A

z 1nA− 1

2≤ z 2

nA− 1

2≤ · · · ≤ z nA

nA− 1

2

2) = k

nA− 1

2. The − 1

(z 1nA− 1

2, y(1),A), . . . , (z nA

nA− 1

2, y(nA),A).

Page 226: Applied Statistics and Experimental Design Chapter 3

Checking normality

y(1),A ≤ y(2),A ≤ · · · ≤ y(nA),A

z 1nA− 1

2≤ z 2

nA− 1

2≤ · · · ≤ z nA

nA− 1

2

2) = k

nA− 1

2. The − 1

(z 1nA− 1

2, y(1),A), . . . , (z nA

nA− 1

2, y(nA),A).

Page 227: Applied Statistics and Experimental Design Chapter 3

Checking normality

y(1),A ≤ y(2),A ≤ · · · ≤ y(nA),A

z 1nA− 1

2≤ z 2

nA− 1

2≤ · · · ≤ z nA

nA− 1

2

2) = k

nA− 1

2. The − 1

(z 1nA− 1

2, y(1),A), . . . , (z nA

nA− 1

2, y(nA),A).

Page 228: Applied Statistics and Experimental Design Chapter 3

Normal scores plots

●

−1.0 0.0 1.012

1620

24

Theoretical Quantiles

Sam

ple

Qua

ntile

s

●●●

●

−1.0 0.0 1.0

1418

2226

Sam

ple

Qua

ntile

s

●

−1.0 0.0 1.0

−0.

50.

51.

5

Sam

ple

Qua

ntile

s

●

●●

●

−1.0 0.0 1.0

0.0

0.5

1.0

1.5

Sam

ple

Qua

ntile

s

●

● ●

●

−1.0 0.0 1.0

−2.

0−

0.5

Sam

ple

Qua

ntile

s ●●

●●

●

−1.0 0.0 1.0−

2.0

−1.

00.

01.

0

Sam

ple

Qua

ntile

s ●

●●

●

−1.0 0.0 1.0

−1.

5−

0.5

Sam

ple

Qua

ntile

s ●

●

−1.0 0.0 1.0

−1.

00.

01.

0

Sam

ple

Qua

ntile

s

●

●●

−1.0 0.0 1.0

0.0

0.5

1.0

Sam

ple

Qua

ntile

s

●

●●

●

−1.0 0.0 1.0

−0.

50.

00.

5

Sam

ple

Qua

ntile

s

●

−1.0 0.0 1.0−

1.0

0.0

Sam

ple

Qua

ntile

s

●

−1.0 0.0 1.0

−1.

00.

01.

02.

0

Sam

ple

Qua

ntile

s

●

−1.0 0.0 1.0

−0.

50.

51.

52.

5

Sam

ple

Qua

ntile

s

●

−1.0 0.0 1.0

−1

01

2

Sam

ple

Qua

ntile

s ●

●

−1.0 0.0 1.0

−1.

00.

5

Sam

ple

Qua

ntile

s

●

● ●

●

−1.0 0.0 1.0−

1.5

−0.

50.

5

Sam

ple

Qua

ntile

s

Page 229: Applied Statistics and Experimental Design Chapter 3

Unequal variances

For now, we will use the following rule of thumb:

If 1/4 < s2A/s

2B < 4, we won’t worry too much about unequal variances.

This may not sound very convincing. In later sections, we will show how toperform formal hypothesis tests for equal variances. However, this won’tcompletely solve the problem. If variances do seem unequal we have a varietyof options available:

• use the randomization null distribution;

• transform the data to stabilize the variances (to be covered later);

• use a modified t-test that allows unequal variance.

Page 230: Applied Statistics and Experimental Design Chapter 3

Unequal variances

If 1/4 < s2A/s

Page 231: Applied Statistics and Experimental Design Chapter 3

Unequal variances

If 1/4 < s2A/s

Page 232: Applied Statistics and Experimental Design Chapter 3

Unequal variances

If 1/4 < s2A/s

Page 233: Applied Statistics and Experimental Design Chapter 3

Unequal variances

If 1/4 < s2A/s

Page 234: Applied Statistics and Experimental Design Chapter 3

Modified t-statisticThe modified t-statistic is

tw (yA, yB ) =yB − yAq

s2A

nA+

s2B

nB

This statistic looks reasonable: for large nA and nB its null distribution willindeed be a normal(0, 1) distribution. However, the exact null distribution isonly approximately a t-distribution, even if the data are actually normal.

The t-distribution we compare tw to is a tνw -distribution, where the degrees offreedom νw are given by

νw =

“s2A

nA+

s2B

nB

”2

1nA−1

“s2A

nA

”2

+ 1nB−1

“s2B

nB

”2 .

This is known as Welch’s approximation; it may not give an integer as thedegrees of freedom.

This t-distribution is not, in fact the exact sampling distribution of tdiff(yA, yB)under the null hypothesis that µA = µB , and σ2

A 6= σ2B . This is because the null

distribution depends on the ratio of the unknown variances, σ2A and σ2

B . Thisdifficulty is known as the Behrens-Fisher problem.

Page 235: Applied Statistics and Experimental Design Chapter 3

s2A

nA+

s2B

nB

νw =

“s2A

nA+

s2B

nB

”2

1nA−1

“s2A

nA

”2

+ 1nB−1

“s2B

nB

”2 .

Page 236: Applied Statistics and Experimental Design Chapter 3

s2A

nA+

s2B

nB

νw =

“s2A

nA+

s2B

nB

”2

1nA−1

“s2A

nA

”2

+ 1nB−1

“s2B

nB

”2 .

Page 237: Applied Statistics and Experimental Design Chapter 3

s2A

nA+

s2B

nB

νw =

“s2A

nA+

s2B

nB

”2

1nA−1

“s2A

nA

”2

+ 1nB−1

“s2B

nB

”2 .

Page 238: Applied Statistics and Experimental Design Chapter 3

Which two-sample t-test to use?

• If the sample sizes are the same (nA = nB ) then the test statisticstw (yA, yB ) and t(yA, yB ) are the same; however the degrees of freedomused in the null distribution will be different unless the sample standarddeviations are the same.

• If nA > nB , but σ2A < σ2

B , and µA = µB then the two sample test based oncomparing t(yA, yB ) to a t-distribution on nA + nB − 2 d.f. will reject morethan 5% of the time.• If the null hypothesis that both the means and variances are equal, i.e.

H0: µA = µB and σ2A = σ2

Bis scientifically relevant, then we are computing a valid p-value, and thishigher rejection rate is a good thing! Since when the variances are unequalthe null hypothesis is false.

• If however, the hypothesis that is most scientifically relevant is H0: µA = µB

without placing any restrictions on the variances, then the higher rejectionrate in the test that assumes the variances are the same could be verymisleading, since p-values may be smaller than they are under the correctnull distribution (in which σ2

A 6= σ2B ).

Likewise we will underestimate the probability of type I error.

• If nA > nB and σ2A > σ2

B , then the p-values obtained from the test usingt(yA, yB ) will tend to be conservative (= larger) than those obtained withtw (yA, yB ).

Page 239: Applied Statistics and Experimental Design Chapter 3

• If the sample sizes are the same (nA = nB ) then the test statisticstw (yA, yB ) and t(yA, yB ) are the same; however the degrees of freedomused in the null distribution will be different unless the sample standarddeviations are the same.

• If nA > nB , but σ2A < σ2

B , and µA = µB then the two sample test based oncomparing t(yA, yB ) to a t-distribution on nA + nB − 2 d.f. will reject morethan 5% of the time.• If the null hypothesis that both the means and variances are equal, i.e.

H0: µA = µB and σ2A = σ2

Bis scientifically relevant, then we are computing a valid p-value, and thishigher rejection rate is a good thing! Since when the variances are unequalthe null hypothesis is false.

• If however, the hypothesis that is most scientifically relevant is H0: µA = µB

without placing any restrictions on the variances, then the higher rejectionrate in the test that assumes the variances are the same could be verymisleading, since p-values may be smaller than they are under the correctnull distribution (in which σ2

A 6= σ2B ).

Likewise we will underestimate the probability of type I error.

• If nA > nB and σ2A > σ2

B , then the p-values obtained from the test usingt(yA, yB ) will tend to be conservative (= larger) than those obtained withtw (yA, yB ).

Page 240: Applied Statistics and Experimental Design Chapter 3

In short: one should be careful about applying the test based on t(yA, yB) if thesample standard deviations appear very different, and it is not reasonable toassume equal means and variances under the null hypothesis.

Download - Applied Statistics and Experimental Design Chapter 3

Top Related