repeated measure design of anova

Post on 23-Feb-2016

170 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

DESCRIPTION

Repeated Measure Design of ANOVA. AMS 572 Group 5. Outline. Jia Chen: Introduction of repeated measures ANOVA Chewei Lu: One-way repeated measures Wei Xi: Two-factor repeated measures Tomoaki Sakamoto : Three-factor repeated measures How-Chung Liu: Mixed models - PowerPoint PPT Presentation

TRANSCRIPT

Repeated Measure Design of ANOVA

AMS 572 Group 5

Outline

• Jia Chen: Introduction of repeated measures ANOVA

• Chewei Lu: One-way repeated measures • Wei Xi: Two-factor repeated measures• Tomoaki Sakamoto : Three-factor repeated

measures• How-Chung Liu: Mixed models• Margaret Brown: Comparison• Xiao Liu: Conclusion

Introduction of Repeated Measures ANOVA

Jia Chen

What is it ?

Definition: - It is a technique used to test the equality of

means.

When To Use It?

• It is used when all members of a random sample are measured under a number of different conditions.

• As the sample is exposed to each condition in turn, the measurement of dependent variable is repeated.

Introduction of One-Way Repeated Measures ANOVA

Che-Wei, LuProfessor:Wei Zhu

One-Way Repeated Measures ANOVA

• DefinitionA one-way repeated measures ANOVA instead of having one score per subject, experiments are frequently conducted in which multiple score are gathered for each case.

• Concept of Repeated Measures ANOVA One factor with at least two levels, levels are dependent. Dependent means that they share variability in some way. The Repeated Measures ANOVA is extended from standard

ANOVA.

One-Way Repeated Measures ANOVA

• When to Use Measuring performance on the same variable over time– for example looking at changes in performance during training or

before and after a specific treatment The same subject is measured multiple times under

different conditions – for example performance when taking Drug A and performance

when taking Drug B The same subjects provide measures/ratings on

different characteristics– for example the desirability of red cars, green cars and blue cars

Note how we could do some RM as regular between subjects designs– For example, Randomly assign to drug A or B

One-Way Repeated Measures ANOVA• Source of Variance in Repeated Measures ANOVA SStotal

– Deviation of each individual score from the grand mean SSb/t subjects

– Deviation of subjects' individual means (across treatments) from the grand mean.

– In the RM setting, this is largely uninteresting, as we can pretty much assume that ‘subjects differ’

SSw/in subjects: How Ss vary about their own mean, breaks down into:– SStreatment • As in between subjects ANOVA, is the comparison of treatment means

to each other (by examining their deviations from the grand mean)• However this is now a partition of the within subjects variation

– SSerror• Variability of individuals’ scores about their treatment mean

One-Way Repeated Measures ANOVA

• Partition of Sum of Square

𝑆𝑆 𝑡𝑜𝑡𝑎𝑙

𝑆𝑆𝑏 /𝑡𝑠𝑢𝑏𝑗𝑒𝑐𝑡𝑠 𝑆𝑆𝑤/ 𝑖𝑛𝑠𝑢𝑏𝑗𝑒𝑐𝑡𝑠

𝑆𝑆 𝑡𝑟𝑒𝑎𝑚𝑒𝑚𝑡 𝑆𝑆𝑒𝑟𝑟𝑜𝑟

Repeated Measures ANOVA

𝑆𝑆 𝑡𝑜𝑡𝑎𝑙

𝑆𝑆𝑏 /𝑡𝑡𝑟𝑒𝑎𝑚𝑒𝑛𝑡 𝑆𝑆𝑤 / 𝑖𝑛𝑒𝑟𝑟𝑜𝑟

Standard ANOVA

One-Way Repeated Measures ANOVA

Variation SS Df MS FBetween a-1 MSA= F=

Within N-a MSE=

Total N-1

• Standard ANOVA Table

• Repeated Measures ANOVA TableSS Df MS F

Between a-1 MSA= F=

Within N-a

-Subjects s-1

-Error MSE=

Total N-A

One-Way Repeated Measures ANOVA• Example:Researchers want to test a new anti-anxiety medication. They measure the anxiety of 7 participants three times: once before taking the medication, once one week after taking the medication, and once two weeks after taking medication. Anxiety is rated on a scale of 1-10,with 10 being ”high anxiety” and 1 being “low anxiety”. Are there any difference between the three condition using significant level

Participants Before Week1 Week21 9 7 42 8 6 33 7 6 24 8 7 35 8 8 46 9 7 37 8 6 2

One-Way Repeated Measures ANOVA

• Define Null and Alternative Hypotheses

Participants Before Week1 Week21 9 7 42 8 6 33 7 6 24 8 7 35 8 8 46 9 7 37 8 6 2

One-Way Repeated Measures ANOVA

• Define Degrees of Freedom N=21 s=7

-=18-6=12

Participants Before Week1 Week21 9 7 42 8 6 33 7 6 24 8 7 35 8 8 46 9 7 37 8 6 2

One-Way Repeated Measures ANOVA

• Analysis of Variance

Participants Before Week1 Week2

1 9 7 4

2 8 6 3

3 7 6 2

4 8 7 3

5 8 8 4

6 9 7 3

7 8 6 2

One-Way Repeated Measures ANOVA• Analysis of Variance(ANOVA Table)

SS Df MS F

Between 98.67 2 49.34 224.27

Within 10.29 18

-Subjects 7.62 6

-Error 2.67 12 0.22

Total 108.96 20

Error=within-Subjects=10.29-7.62=2.67Total=Beteen+Within=98.67+10.29=108.96

• Test Statistic:244.27

One-Way Repeated Measures ANOVA

• Critical Region

Now, the

SS Df MS F

Between 98.67 2 49.34 224.27

Within 10.29 18

-Subjects 7.62 6

-Error 2.67 12 0.22

Total 108.96 20

One-Way Repeated Measures ANOVA• SAS Code DATA REPEAT;

INPUT SUBJ BEFORE WEEK1 WEEK2;DATALINES;1 9 7 42 8 6 33 7 6 24 8 7 35 8 8 46 9 7 37 8 6 2;PROC ANOVA DATA=REPEAT;TITLE "One-Way ANOVA using the repeated Statment";MODEL BEFORE WEEK1 WEEK2= / NOUNI;REPEATED TIME 3 (1 2 3);RUN;

The Before, Week1, and Week2 are the time level at each participants.

In here, we don’t have CLASS statement

because our data set does not have an

independent variable

The NOUNI(no univariate) is a request not to conduct a separate analysis for each of the three times variables.

This indicates the labels we want to printed for each level of times

One-Way Repeated Measures ANOVA• SAS Result

Two-Factor ANOVA with Repeated Measures

Wei Xi

Stating of the Hypothesis

Within-Subjects Main Effect

Between-Subjects Main Effect

Between-Subjects Interaction Effect

Within-Subjects By Between-Subjects Interaction Effects

Two-Factor ANOVA with Repeated Measures on One Factor

Hypothesis

ANOVA TABLESource DF SS MS F

Factor A a-1 SSA SSA/(a-1) MSA/MSWA ~ F(a-1),n(a-1)

Factor B b-1 SSB SSB/(b-1)MSB/MSE ~ F(b-1),n(a-1)(b-1)

AB Interaction (a-1)(b-1) SSAB SSAB/(a-1)(b-1)MSAB/MSE ~ F(a-1)(b-1,n(a-1)(b-1)

Subjects within A (n-1)a SSWA

SSWA/(n-1)a

Error (n-1)a(b-1) SSE SSE/((n-1)a(b-1)

Total nab-1 SST

Example• The shape variable is the repeated variable. This produces an ANOVA with one

between-subjects factor. If you were to examine the expected mean squares for this setup, you would find that the appropriate error term for the test of calib is subject|calib. The appropriate error term for shape and shape#calib is shape#subject|calib (which is the residual error since we do not include the term in the model).

SAS Code

Data Q1;

set pre.Q1;

run;

proc anova data=Q1;

title' Two-way Anova with a Repeated Measure on One Factor';

class calib;

model shape_1 shape_2 shape_3 shape_4 = calib/nouni;

repeated shape 4;

means calib;

run;

MANOVA Test Criteria and Exact F Statistics for the Hypothesis of no shape EffectH = Anova SSCP Matrix for shape

E = Error SSCP Matrix

S=1    M=0.5    N=0Statistic Value F Value Num DF Den DF Pr > FWilks' Lambda 0.02529573 25.69 3 2 0.0377Pillai's Trace 0.97470427 25.69 3 2 0.0377Hotelling-Lawley Trace 38.53236607 25.69 3 2 0.0377Roy's Greatest Root 38.53236607 25.69 3 2 0.0377

MANOVA Test Criteria and Exact F Statistics for the Hypothesis of no shape*calib EffectH = Anova SSCP Matrix for shape*calib

E = Error SSCP Matrix

S=1    M=0.5    N=0Statistic Value F Value Num DF Den DF Pr > FWilks' Lambda 0.16750795 3.31 3 2 0.2404Pillai's Trace 0.83249205 3.31 3 2 0.2404Hotelling-Lawley Trace 4.96986607 3.31 3 2 0.2404Roy's Greatest Root 4.96986607 3.31 3 2 0.2404

At α=0.05,we reject the hypothesis and conclude that there is shape Effect

At α=0.05,we cannot reject the hypothesis and conclude that there is no shape*calib Effect

Analysis of SAS Output

Source DF Anova SS Mean Square F Value Pr > Fcalib 1 51.04166667 51.04166667 11.89 0.0261

Error 4 17.16666667 4.29166667

Source DF Anova SS Mean Square F Value Pr > F

Adj Pr > F

G - G H - Fshape 3 47.45833333 15.81944444 12.80 0.0005 0.0099 0.0011

shape*calib 3 7.45833333 2.48611111 2.01 0.1662 0.2152 0.1791

Error(shape) 12 14.83333333 1.23611111

Univariate Tests of Hypotheses for Within Subject Effects

Tests of Hypotheses for Between Subjects Effects

Two-Factor ANOVA with Repeated Measures on both Factors

Source DF SS MS F Subjects n-1 SSS SSS/I-1 MSS/MSEFactor A a-1 SSA SSA/(a-1) MSA/MSA*S ~ F(a-1),(n-1)(a-1)

Factor B b-1 SSB SSB/(b-1) MSB/MSB*S ~ F(b-1),(n-1)(b-1)

AB Interaction (a-1)(b-1) SSAB SSAB/((a-1)(b-1))

MSAB/MSE ~ F(a-1)(b-1),(n-1)(a-1)(b-1)

A*Subjects (n-1)(a-1) SSA*S SSWA/((n-1)a) SSA*S/MSEF(a-1)(n-1),(n-1)(a-1)(b-1)

B*Subjects (n-1)(b-1) SSB*S SSWB/((n-1)b) SSA*S/MSEF(n-1)(b-1),(n-1)(a-1)(b-1)

Error (n-a)(a-1)(b-1)

SSE SSE/((n-1)(a-1)(b-1))

Total nab-1 SST

ANOVA TABLE

Example Three subjects, each with nine accuracy scores on all combinations of the three

different dials and three different periods. With subject a random factor and both dial and period fixed factors, the appropriate error term for the test of dial is the dial#subject interaction. Likewise, period#subject is the correct error term for period, and period#dial#subject (which we will drop so that it becomes residual error) is the appropriate error term for period#dial.

SAS Code• Data Q2; • Input Mins1-Mins9; • Datalines; • 45 53 60 40 52 57 28 37 46 • 35 41 50 30 37 47 28 32 41 • 60 65 75 58 54 70 40 47 50 • ; • ODS RTF STYLE=BarrettsBlue; • Proc anova data=Q2; • Model Mins1-Mins9=/nouni; • Repeated period 3, dail 3/nom; • Run; • ods rtf close; •

SAS Output

Source DF Anova SS Mean Square F Value Pr > FAdj Pr > F

G - G H - Fperiod 2 1072.666667 536.333333 14.45 0.0148 0.0563 0.0394Error(period) 4 148.444444 37.111111

Greenhouse-Geisser Epsilon 0.5364

Huynh-Feldt Epsilon 0.6569

Source DF Anova SS Mean Square F Value Pr > FAdj Pr > F

G - G H - Fdail 2 978.6666667 489.3333333 50.91 0.0014 0.0169 0.0115Error(dail) 4 38.4444444 9.6111111

Greenhouse-Geisser Epsilon 0.5227

Huynh-Feldt Epsilon 0.5952

Source DF Anova SS Mean Square F Value Pr > FAdj Pr > F

G - G H - Fperiod*dail 4 8.66666667 2.16666667 0.30 0.8715 0.6603 0.7194

Error(period*dail) 8 58.22222222 7.27777778

Greenhouse-Geisser Epsilon 0.2827

Huynh-Feldt Epsilon 0.4006

Univariate Tests of Hypotheses for Within Subject Effects

At α=0.05,we cannot reject the hypothesis and conclude the there is no period*dail Effect

At α=0.05,we reject the hypothesis and conclude that there is dail Effect

At α=0.05,we reject the hypothesis and conclude that there is period Effect

Three-factor Experimentswith a repeated measure

T. Sakamoto

35/87

Example of a marketing experiment

Experiment The subjects who belong to a region X or Y see the

Liquid Crystal Display A, B, or C. Each type of LCD is seen twice; once in the light and

the other in the dark. The preferences of the LCD are measured by the

subjects, on a scale from 1 to 5 (1= lowest, 5=highest).

Case of this example• A company which produces some Liquid Crystal

Display wants to examine the characteristics of its prototype products.

36/87

Experimental Design and Data Three factors

Type of LCD Regions to which the specimens belong In the light / In the dark

Repeted measure factor : In the light / In the dark

Type of LCD

A B C

subj light dark subj light dark subj light dark

REGION

X

1 5 4 11 4 4 21 5 5

2 4 2 12 5 6 22 5 3

3 5 4 13 3 4 23 3 3

4 3 5 14 5 4 24 4 4

5 5 3 15 4 6 25 4 3

Y

6 4 4 16 5 5 26 3 5

7 3 5 17 4 3 27 4 3

8 4 3 18 5 3 28 5 2

9 2 5 19 5 5 29 3 4

10 5 4 20 4 4 30 4 3

37/87

data lcd;input subj type $ region $ light dark @@;datalines;1 a 5 4 2 a 4 2 3 a 5 4 4 a 3 5 5 a 5 36 a 4 4 7 a 3 5 8 a 4 3 9 a 2 5 10 a 5 4 11 b 4 4 12 b 5 6 13 b 3 4 14 b 5 4 15 b 4 616 b 5 5 17 b 4 3 18 b 5 3 19 b 5 5 20 b 4 421 c 5 5 22 c 5 3 23 c 3 3 24 c 4 4 25 c 4 326 c 3 5 27 c 4 3 28 c 5 2 29 c 3 4 30 c 4 3;run;

proc anova data=lcd;title ’Three-way ANOVA with a Repeated Measure';class type region;model light dark = type | region /nouni;repeated light_dark;means type | region;run;

SAS PROGRAM

OUTPUT(Part 1/4):

OUTPUT(Part 2/4):

40/81

OUTPUT(Part 3/4):

OUTPUT(Part 4/4):

Mixed Effect ModelsHow-Chang Liu

Mixed Models

• When we have a model that contains random effect as well as fixed effect, then we are dealing with a mixed model.

• From the above definition, we see that mixed models must contain at least two factors. One having fixed effect and one having random effect.

Why use mixed models?

• When repeated measurements are made on the same statistical units, it would not be realistic to assume that these measurements are independent.

• We can take this dependence into account by specifying covariance structures using a mixed model

Definition

• A mixed model can be represented in matrix notation by:

• is the vector of observations• is the vector of fixed effects• is the vector of random effects• is the vector of I.I.D. error terms• and are matrices relating and to

Assumptions

• R and G are constants• We also assume that and are independent• We get V = ZGZ' + R, where V is the variance of

y

How to estimate and ?

If R and G are given: Using Henderson’s Mixed Model equation, we have: =

So = And = )

What if G and R are unknown?

• We know that both and are normally distributed, so the best approach is to use likelihood based methods

• There are two methods used by SAS:• 1)Maximum likelihood (ML)• 2)Restricted/residual maximum likelihood

(REML)

ExampleBelow is a table of growth measurements for 11 girls and 16 boys at ages 8, 10, 12, 14:

Person gender age8 age10 age12 age141 F 21.0 20.0 21.5 23.0 2 F 21.0 21.5 24.0 25.5 3 F 20.5 24.0 24.5 26.0 4 F 23.5 24.5 25.0 26.5 5 F 21.5 23.0 22.5 23.5 6 F 20.0 21.0 21.0 22.5 7 F 21.5 22.5 23.0 25.0 8 F 23.0 23.0 23.5 24.0 9 F 20.0 21.0 22.0 21.5 10 F 16.5 19.0 19.0 19.5 11 F 24.5 25.0 28.0 28.0 12 M 26.0 25.0 29.0 31.0 13 M 21.5 22.5 23.0 26.5 14 M 23.0 22.5 24.0 27.5

Person gender age8 age10 age12 age1415 M 25.5 27.5 26.5 27.016 M 20.0 23.5 22.5 26.0 17 M 24.5 25.5 27.0 28.5 18 M 22.0 22.0 24.5 26.5 19 M 24.0 21.5 24.5 25.5 20 M 23.0 20.5 31.0 26.0 21 M 27.5 28.0 31.0 31.5 22 M 23.0 23.0 23.5 25.0 23 M 21.5 23.5 24.0 28.0 24 M 17.0 24.5 26.0 29.5 25 M 22.5 25.5 25.5 26.0 26 M 23.0 24.5 26.0 30.0 27 M 22.0 21.5 23.5 25.0

Using SASdata pr; input Person Gender $ y1 y2 y3 y4; y=y1; Age=8; output; y=y2; Age=10; output; y=y3; Age=12; output; y=y4; Age=14; output; drop y1-y4; datalines; 1 F 21.0 20.0 21.5 23.0 2 F 21.0 21.5 24.0 25.5 …;Run;

Using SAS

proc mixed data=pr method=ml covtest; class Person Gender; model y = Gender Age Gender*Age / s; repeated / type=un subject=Person r; run;

ResultsModel Information

Data Set WORK.PR

Dependent Variable y

Covariance Structure Unstructured

Subject Effect Person

Estimation Method ML

Residual Variance Method None

Fixed Effects SE Method Model-Based

Degrees of Freedom Method Between-Within

Class Level Information

Class Levels Values

Person 27 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27

Gender 2 F M

As one can see, the covariance matrix is unstructured, as we are going to estimate it using the maximum likelihood method

Dimensions

Covariance Parameters 10

Columns in X 6

Columns in Z 0

Subjects 27

Max Obs Per Subject 4

Number of Observations

Number of Observations Read 108

Number of Observations Used 108

Number of Observations Not Used 0

Iteration History

Iteration Evaluations -2 Log Like Criterion

0 1 478.24175986

1 2 419.47721707 0.00000152

2 1 419.47704812 0.00000000

Convergence criteria met.

As one can see, we do not have a Z matrix for this model

the convergence of the Newton-Raphson algorithm means that we have found the maximum likelihood estimates

Covariance Parameter Estimates

Cov Parm Subject Estimate Standard Error Z Value Pr Z

UN(1,1) Person 5.1192 1.4169 3.61 0.0002

UN(2,1) Person 2.4409 0.9835 2.48 0.0131

UN(2,2) Person 3.9279 1.0824 3.63 0.0001

UN(3,1) Person 3.6105 1.2767 2.83 0.0047

UN(3,2) Person 2.7175 1.0740 2.53 0.0114

UN(3,3) Person 5.9798 1.6279 3.67 0.0001

UN(4,1) Person 2.5222 1.0649 2.37 0.0179

UN(4,2) Person 3.0624 1.0135 3.02 0.0025

UN(4,3) Person 3.8235 1.2508 3.06 0.0022

UN(4,4) Person 4.6180 1.2573 3.67 0.0001

The table lists the 10 estimated covariance parameters in order. In other words, these are the estimates for R, the variance of

Solution for Fixed Effects

Effect Gender Estimate Standard Error DF t Value Pr > |t|

Intercept 15.8423 0.9356 25 16.93 <.0001

Gender F 1.5831 1.4658 25 1.08 0.2904

Gender M 0 . . . .

Age 0.8268 0.07911 25 10.45 <.0001

Age*Gender F -0.3504 0.1239 25 -2.83 0.0091

Age*Gender M 0 . . . .

From this table, we see that the boys intercept is at 15.8423, whole the girls intercept is at 15.8423+1.5831=17.42.The estimate of the boys’ slope is at 0.827, while the girls’ slpe is at 0.827-0.350=0.477So the girls’ starting point is higher than the girls but their growth rate is only about half of that of the boys

Type 3 Tests of Fixed Effects

Effect Num DF Den DF F Value Pr > F

Gender 1 25 1.17 0.2904

Age 1 25 110.54 <.0001

Age*Gender 1 25 7.99 0.0091

This is probably the most important table from our results:

The gender row tests the null hypothesis that girls and boys have a common intercept.As we can see we cannot reject that hypothesis

The Age tests the null hypothesis that age does not affect the growth rate.As we can see, we reject the null hypothesis as the F-value is large.

The Age*gender tests reveals that there is a difference in slope at the 1% significance level.

Repeated Measures ANOVA vs. Independent Measures ANOVA

Magarate Brown

• Can we just use standard ANOVA with repeated measures data?

• No, Independent Measures (standard) ANOVA assumes the data are independent.

• Data from a repeated measures experiment not independent

How are standard ANOVA and repeated measures ANOVA the same?

• Independent measures ANOVA: an extension of the pooled variance t-test

• Repeated Measures ANOVA: an extension of the paired sample t-test

How are standard ANOVA and repeated measures ANOVA the same?

• Independent measures ANOVA: assumes the population variances are equal (homogeneity of variance)

• Repeated Measures ANOVA: sphericity assumption that the population variances of all the differences are equal

How are standard ANOVA and repeated measures ANOVA the same?

• Both assume Normality of the population

Advantages to using Repeated Measures instead of Independent Measures

• Limited number of subjects available• Prefer to limit the number of subjects• Less variability (finger tapping with caffeine

example)• Can examine effects over time

Drawbacks

• Practice effect• Example: subjects get better at performing a task

each time with “practice”

• Differential transfer: “This occurs when the effects of one condition persist and affect participants’ experiences during subsequent conditions.” (format: http://www.psychmet.com/id16.html)

• Example: medical treatments

top related