t-static t-test is used to test hypothesis about an unknown population mean, µ, when the value of...
TRANSCRIPT
t-Statict-Static t-testt-test is used to test is used to test hypothesis about anhypothesis about an unknown population unknown population mean, µ, mean, µ, when the when the value of value of σσ or or σσ²² is is unknownunknown. . 22
Degrees of FreedomDegrees of Freedomdf=n-1df=n-1
33
Assumption of the Assumption of the t-test t-test (Parametric Tests)(Parametric Tests)
1.The Values in the sample must 1.The Values in the sample must consist ofconsist of independent independent observations observations
2. The population sample must be 2. The population sample must be normal normal
3. use a 3. use a large sample large sample n ≥ 30n ≥ 3044
Inferential StatisticsInferential Statistics t-Statistics:t-Statistics: There are different types of t- StatisticThere are different types of t- Statistic 1. 1. SingleSingle (one) Sample t-statistic (test) (one) Sample t-statistic (test) 2. 2. TwoTwo independent sample t-test, independent sample t-test,
Matched-Subject ExperimentMatched-Subject Experiment, or , or Between Subject DesignBetween Subject Design
3.3.RepeatedRepeated MeasureMeasure Experiment, or Experiment, or Related/Paired Related/Paired SampleSample t-test t-test
55
Hypothesis Testing Hypothesis Testing Step 3: Computations/ Calculations or Collect Step 3: Computations/ Calculations or Collect
Data and Compute Sample StatisticsData and Compute Sample StatisticsFYI FYI Z Score for ResearchZ Score for Research
66
Hypothesis TestingHypothesis TestingStep 3: Computations/ Calculations or Collect Step 3: Computations/ Calculations or Collect
Data and Compute Sample StatisticsData and Compute Sample Statistics
77
Calculations for Calculations for t-testt-test
M-M-μμ t= st= s
SSmm S Sm= orm= or
SSmm== M-M-μμ √n √n
t t
M=t.SM=t.Sm+m+μμ
μμ=M- S=M- Sm.tm.t
88
2 /S n
FYI FYI Variability VariabilitySS,SS, Standard Deviations and VariancesStandard Deviations and Variances
X X σ² = ss/N σ² = ss/N PopPop 1 1 σ = √σ = √ss/Nss/N 22 4 s = √ss/df 4 s = √ss/df 5 5 s² = ss/n-1 or ss/df s² = ss/n-1 or ss/df Sample Sample
SS=SS=ΣxΣx²-(Σx)²/N²-(Σx)²/N
SS=SS=ΣΣ(( x-x-μμ))²²
Sum Sum of of SquaredSquared DeviationDeviation from from MeanMean99
d=d=Effect SizeEffect SizeUse Use SS instead of instead of σσ for for t-testt-test
1010
Cohn’s d=Cohn’s d=Effect SizeEffect SizeUse Use SS instead of instead of σσ for for t-testt-test
1111
d = (M - µ)/sd = (M - µ)/s
s= (M - µ)/ds= (M - µ)/d
M= (d . s) + µM= (d . s) + µ
µ= (M – d) sµ= (M – d) s
Percentage of Variance Accounted Percentage of Variance Accounted for by the Treatment (similar to for by the Treatment (similar to Cohen’s d) Also known as Cohen’s d) Also known as ωω² ²
Omega SquaredOmega Squared
1212
22
2
tr
t df
percentage of Variance accounted percentage of Variance accounted for by the Treatment for by the Treatment
Percentage of Variance Explained Percentage of Variance Explained r²=0.01--------r²=0.01-------- Small Effect Small Effect
r²=0.09--------r²=0.09-------- Medium Effect Medium Effect
r²=0.25--------r²=0.25-------- Large Effect Large Effect
1313
2r2r
ProblemsProblems Infants, even Infants, even newbornsnewborns prefer to look at prefer to look at attractive faces (Slater, et al., 1998). In the attractive faces (Slater, et al., 1998). In the study, infants from study, infants from 1 to 6 days old 1 to 6 days old were shown were shown two photographs two photographs of women’s face. Previously, a of women’s face. Previously, a group of adults had rated one of the faces as group of adults had rated one of the faces as significantly more attractive than the other. The significantly more attractive than the other. The babies were positioned in front of a screen on babies were positioned in front of a screen on which the photographs were presented. The pair which the photographs were presented. The pair of faces remained on the screen until the baby of faces remained on the screen until the baby accumulated a total of accumulated a total of 20 seconds 20 seconds of looking at of looking at one or the other. The number of seconds looking one or the other. The number of seconds looking at the attractive face was recorded for each at the attractive face was recorded for each infant. infant. 1414
ProblemsProblems Suppose that the study used a sample of Suppose that the study used a sample of n=9n=9
infants and the data produced an average of infants and the data produced an average of M=13M=13 seconds for attractive face with seconds for attractive face with SS=72.SS=72.
Set the level of significance at Set the level of significance at αα=.05=.05
for for two tails two tails Note that all the available information comes Note that all the available information comes
from the sample. Specifically, from the sample. Specifically, we do not know we do not know the population mean the population mean μμ or the population or the population standard deviation standard deviation σσ. .
On the basis of this sample, can we On the basis of this sample, can we conclude that infants prefer to look at conclude that infants prefer to look at attractive faces?attractive faces? 1515
Null HypothesisNull Hypothesis t-Statistic:t-Statistic: If the Population mean or If the Population mean or µ and the µ and the
sigma are unknown sigma are unknown the statistic of the statistic of choice will be choice will be t-Statict-Static
1.1. Single Single (one) Sample t-statistic (test)(one) Sample t-statistic (test) Step 1Step 1 HH00 : µ : µ attractive attractive = 10 seconds = 10 seconds HH11 : µ : µ attractiveattractive ≠ 10 seconds≠ 10 seconds
1616
ProblemsProblems A psychologist has prepared an “Optimism A psychologist has prepared an “Optimism
Test” that is administered yearly to Test” that is administered yearly to graduating college seniors. The test graduating college seniors. The test measures how each graduating class measures how each graduating class feels about its future. The higher the score feels about its future. The higher the score , the more optimistic the class. Last year’s , the more optimistic the class. Last year’s class had a mean score of class had a mean score of μμ=15=15. A . A sample of sample of n=9 n=9 seniors from this year’s seniors from this year’s class was selected and tested.. class was selected and tested..
1717
ProblemsProblems The scores for these seniors are The scores for these seniors are 7, 12, 11, 7, 12, 11,
15, 7, 8, 15, 9, and 615, 7, 8, 15, 9, and 6, which produced a , which produced a sample mean of sample mean of M=10M=10 with with SS=94. SS=94.
On the basis of this sample, can the On the basis of this sample, can the psychologist conclude that this year’s psychologist conclude that this year’s class has a class has a differentdifferent level of optimism?level of optimism?
Note that this hypothesis test will use a t-Note that this hypothesis test will use a t-statistic because the statistic because the population variance population variance σσ² is not known. ² is not known. USE SPSSUSE SPSS
Set the level of significance at Set the level of significance at αα=.01 for two tails =.01 for two tails
1818
Null HypothesisNull Hypothesis t-Statistic:t-Statistic: If the Population mean or If the Population mean or µ and the µ and the
sigma are unknown sigma are unknown the statistic of the statistic of choice will be choice will be t-Statict-Static
1.1. Single Single (one) Sample t-statistic (test)(one) Sample t-statistic (test) Step 1Step 1 HH00 : µ : µ optimism optimism = 15 = 15 HH11 : µ : µ optimismoptimism ≠ 15≠ 15
1919
Chapter 10Chapter 10TwoTwo Independent Sample t-testIndependent Sample t-test Matched-Subject ExperimentMatched-Subject Experiment, or , or
Between Subject DesignBetween Subject DesignAn independent-measures studyAn independent-measures study
uses a separate sample to uses a separate sample to represent each of the represent each of the populations or treatment populations or treatment conditions being compared.conditions being compared.
2020
TwoTwo Independent Sample t-testIndependent Sample t-testNull HypothesisNull Hypothesis:: If the Population mean or µ is unknown If the Population mean or µ is unknown
the statistic of choice will be the statistic of choice will be t-Statict-Static Two Two independent sample t-test, independent sample t-test, Matched-Matched-
Subject ExperimentSubject Experiment, or Between Subject , or Between Subject DesignDesign
HH00 : µ : µ1 1 -µ-µ2 2 = 0= 0 HH11 : µ : µ11 -µ -µ22 ≠ 0 ≠ 0
2121
ProblemsProblems Research results suggest aResearch results suggest a relationship relationship
Between the TV viewing habits of 5-year-old children and their future performance in high school. For example, Anderson, Huston, Wright & Collins (1998) report that high school students who regularly watched Sesame Street as children had better grades in high school than their peers who did not watch Sesame Street.
2222
ProblemsProblems The researcher intends to examine The researcher intends to examine
this phenomenon using a sample of this phenomenon using a sample of 20 high school students. She first 20 high school students. She first surveys the students’ s parents to surveys the students’ s parents to obtain information on the family’s TV obtain information on the family’s TV viewing habits during the time that the viewing habits during the time that the students were 5 years old. Based on students were 5 years old. Based on the survey results, the researcher the survey results, the researcher selects a sample of selects a sample of n=10n=10
2323
ProblemsProblems students with a history of watching students with a history of watching
“Sesame Street“ and a sample of “Sesame Street“ and a sample of n=10 n=10 students who did not watch the students who did not watch the program. The program. The average high school average high school grade grade is recorded for each student is recorded for each student and the data are as follows: and the data are as follows: Set the Set the level of significance at level of significance at αα=.05=.05
for two tails for two tails
2424
ProblemsProblemsAverage High School GradeAverage High School Grade
Watched Sesame St (1). Watched Sesame St (1). Did not Watch Sesame St.(2)Did not Watch Sesame St.(2)
86 86 90 90
87 87 89 89
9191 82 82
9797 83 83
98 98 85 85
99 99 7979
9797 83 83
94 94 86 86
8989 81 81
9292 92 92
nn11=10 =10 n n22=10=10
MM11=93 =93 M M22= 85= 85
SSSS11=200 =200 SS SS22=160=1602525
TwoTwo Independent Sample t-testIndependent Sample t-testNull HypothesisNull Hypothesis:: Two Two independent sample t-test, independent sample t-test, Matched-Matched-
Subject ExperimentSubject Experiment, or Between Subject , or Between Subject Design-Design- non-directional or two-tailed non-directional or two-tailed testtest
Step 1.Step 1. HH00 : µ : µ1 1 -µ-µ2 2 = 0= 0 HH11 : µ : µ11 -µ -µ22 ≠ 0 ≠ 0
2626
TwoTwo Independent Sample t-testIndependent Sample t-testNull HypothesisNull Hypothesis:: Two Two independent sample t-test, independent sample t-test, Matched-Matched-
Subject ExperimentSubject Experiment, or Between Subject , or Between Subject Design -Design - directional or one-tailed testdirectional or one-tailed test
Step 1.Step 1. HH00 : µ : µ Sesame St . Sesame St . ≤ µ ≤ µ No Sesame St. No Sesame St.
HH11 : µ : µ Sesame St. Sesame St. >> µ µ No Sesame St.No Sesame St.
2727
Measuring d=Measuring d=Effect Size for the Effect Size for the independent measuresindependent measures
2828
2
1 2M Md
S p
Percentage of Variance Accounted Percentage of Variance Accounted for by the Treatment for by the Treatment (similar to (similar to
Cohen’s d) Also known as Cohen’s d) Also known as ωω² Omega ² Omega Squared Squared and Coefficient of and Coefficient of
DeterminationDetermination
2929
22
2
tr
t df
We use the We use the Point-Biserial Correlation Point-Biserial Correlation when one of our variable is dichotomous, in when one of our variable is dichotomous, in
this case (1) watched Sesame St. this case (1) watched Sesame St. (2) and didn’t watch Sesame St. (2) and didn’t watch Sesame St.
3030
22
2
tr
t df
ProblemsProblems In recent years, psychologists have In recent years, psychologists have
demonstrated repeatedly that using mental demonstrated repeatedly that using mental images can greatly improve memory. Here we images can greatly improve memory. Here we present a hypothetical experiment designed to present a hypothetical experiment designed to examine this phenomenon.examine this phenomenon.
The psychologist first prepares a list of The psychologist first prepares a list of 40 pairs 40 pairs of of nounsnouns (for example, dog/bicycle, grass/door, (for example, dog/bicycle, grass/door, lamp/piano). Nextlamp/piano). Next, two groups of participants , two groups of participants are obtained (two separate samples). are obtained (two separate samples). Participants in one group are given the list for 5 Participants in one group are given the list for 5 minutes and instructed to memorize the 40 minutes and instructed to memorize the 40 noun pairs. noun pairs. 3131
ProblemsProblems Participants in another group receive the same Participants in another group receive the same
list of words, but in addition to the regular list of words, but in addition to the regular instruction, they are told to form a instruction, they are told to form a mental image mental image for each pair of nouns for each pair of nouns (imagine a dog riding a (imagine a dog riding a bicycle, for example). Later each group is given bicycle, for example). Later each group is given a memory test in which they are given the first a memory test in which they are given the first word from each pair and asked to recall the word from each pair and asked to recall the second word. The psychologist records the second word. The psychologist records the number of words correctly recalled for each number of words correctly recalled for each individual. The data from this experiment are as individual. The data from this experiment are as follows: follows: Set the level of significance at Set the level of significance at αα=.01=.01
for two tails for two tails 3232
ProblemsProblemsData (Number of words recalled)Data (Number of words recalled)
Group 1 (Images) Group 1 (Images) Group 2 (No Images) Group 2 (No Images)
19 19 23 23
20 20 22 22
2424 15 15
3030 16 16
31 31 18 18
32 32 1212
3030 16 16
27 27 19 19
2222 14 14
2525 25 25
nn11=10 =10 n n22=10=10
MM11=26 =26 M M22= 18= 18
SSSS11=200 =200 SS SS22=160=1603333
3.3.RepeatedRepeated Measure Experiment, Measure Experiment, or or Related/PairedRelated/Paired Sample t-test Sample t-test Within Subject Within Subject Experiment DesignExperiment Design
A A single sample single sample of individuals of individuals is measured is measured more than once more than once on the same dependent on the same dependent variable. The variable. The same subjects same subjects are used in all of the treatment are used in all of the treatment conditions.conditions.
3434
Null HypothesisNull Hypothesis t-Statistics:t-Statistics: If the Population mean or µ is unknown If the Population mean or µ is unknown
the statistic of choice will be the statistic of choice will be t-Statistict-Statistic 3.Repeated Measure Experiment, or 3.Repeated Measure Experiment, or
Related/Paired Sample t-testRelated/Paired Sample t-test For For Non-directional or two tailed testNon-directional or two tailed test Step. 1Step. 1 HH00 : µ : µDD = 0= 0 HH11 : µ : µD D ≠ 0≠ 0
3535
Null HypothesisNull Hypothesis t-Statistics:t-Statistics: If the Population mean or µ is unknown If the Population mean or µ is unknown
the statistic of choice will be the statistic of choice will be t-Statistict-Statistic 3.Repeated Measure Experiment, or 3.Repeated Measure Experiment, or
Related/Paired Sample t-test Related/Paired Sample t-test For Directional or one tailed testsFor Directional or one tailed tests Step. 1Step. 1 HH00 : µ : µDD ≤ 0 ≤ 0 HH11 : µ : µD D > 0> 0
3636
ProblemsProblemsResearch indicates that the color red
increases men’s attraction to women (Elliot & Niesta, 2008). In the original study, men were shown women’s photographs presented on either white or red background. Photographs presented onpresented on redred were were rated significantly rated significantly more attractive more attractive than the same photographs mounted than the same photographs mounted on whit. on whit. 3737
ProblemsProblems
In a similar study, a researcher In a similar study, a researcher prepares prepares a set of 30 women’s a set of 30 women’s photographs, photographs, withwith 15 mounted on 15 mounted on a white a white background and background and 15 15 mounted on red. mounted on red. One picture is One picture is identified as the test photograph, identified as the test photograph, and appears twice in the set, once and appears twice in the set, once on white and once on red.on white and once on red.
3838
ProblemsProblems Each male participant looks through the entire Each male participant looks through the entire
set of photographs and rated the attractiveness set of photographs and rated the attractiveness of each woman on a of each woman on a 12-point scale12-point scale. The data . The data in the next slide summarizes the responses for in the next slide summarizes the responses for a sample of a sample of n=9n=9 men.men.
Set the level of significance at Set the level of significance at αα=.01=.01
for two tailsfor two tails
Do the data indicate that the color Do the data indicate that the color red red increases men’s attraction to increases men’s attraction to womenwomen ?? ??
3939
ProblemsProblemsParticipants White background Participants White background X1X1 Red Background Red Background X2 X2 DD=X2-X1 =X2-X1 DD² ²
A 6 9 +3 9A 6 9 +3 9
B 8 9 +1 1B 8 9 +1 1
C 7 10 +3 9C 7 10 +3 9
D 7 11 +4 16D 7 11 +4 16
E 8 11 +3 9E 8 11 +3 9
F 6 9 +3 9F 6 9 +3 9
G 5 11 +6 36G 5 11 +6 36
H 10 11 +1 1H 10 11 +1 1
I 8 11 +3 9 I 8 11 +3 9
ΣΣD =27 D =27 ΣΣDD²=99²=99
MMD =D =
4040
D
n
Null HypothesisNull Hypothesis For Non-Directional or two tailed testsFor Non-Directional or two tailed tests
Step. 1Step. 1HH00 : µ : µDD = 0 = 0HH11 : µ : µDD ≠ 0 ≠ 0
4141
ProblemsProblemsOne technique to help people deal with phobia is One technique to help people deal with phobia is
to have them counteract the feared objects by to have them counteract the feared objects by using using imaginationimagination to move themselves to a to move themselves to a place of safety. In an experiment test of this place of safety. In an experiment test of this technique, patients sit in front of a screen and technique, patients sit in front of a screen and are instructed to relax. Then they are shown a are instructed to relax. Then they are shown a slide of the feared object for example, a picture slide of the feared object for example, a picture of a spider, (arachnophobia). The patient of a spider, (arachnophobia). The patient signals the researcher as soon as feelings of signals the researcher as soon as feelings of anxiety begin to arise, and the researcher anxiety begin to arise, and the researcher records the amount of time that the patient was records the amount of time that the patient was able to endure looking at the slide. able to endure looking at the slide. 4242
ProblemsProblems The patient then spends two minutes The patient then spends two minutes
imagining a “safe scene” imagining a “safe scene” such as a such as a tropical tropical beachbeach (next slide) (next slide) beforebefore the slide is presented the slide is presented again. If patients can tolerate the feared object again. If patients can tolerate the feared object longer longer afterafter the imagination exercise, it is the imagination exercise, it is viewed as a reduction in the phobia. The data viewed as a reduction in the phobia. The data in next slide summarize the items recorded in next slide summarize the items recorded from a sample of from a sample of n=7n=7 patients. patients. Do the data Do the data indicate that the imagination technique indicate that the imagination technique effectively alters phobia? effectively alters phobia? .. Set the level of Set the level of significance at significance at αα=.05 for one tailed test.=.05 for one tailed test.
4343
ProblemsProblemsParticipant Before imagination Participant Before imagination X1 X1 After Imagination After Imagination X2 X2 DD=X2-X1 =X2-X1 D D² ²
A 15 24 +9 81A 15 24 +9 81
B 10 23 +13 169B 10 23 +13 169
C 7 11 +4 16C 7 11 +4 16
D 18 25 +7 49D 18 25 +7 49
E 5 14 +9 81E 5 14 +9 81
F 9 14 +5 25F 9 14 +5 25
G 12 21 +9 81G 12 21 +9 81
ΣΣD =56 D =56 ΣΣDD²=502²=502
MMD =D =
4545
D
n
Null HypothesisNull Hypothesis
For Directional or one tailed testsFor Directional or one tailed tests
Step. 1Step. 1 HH00 : µ : µD D ≤ 0 ≤ 0 (The amount of time is not increased.)(The amount of time is not increased.)
HH11 : µ : µDD > 0 > 0 (The amount of time is increased.)(The amount of time is increased.)
4646
Relationship of Statistical Relationship of Statistical TestsTests
Does this Diagram Make Sense to You?Does this Diagram Make Sense to You?
Structural Equation Modeling
Structural Model
Multiple Regression
ANOVA
t ratio
Correlation
Measurement Model
Confirmatory Factor Analysis
Exploratory Factor Analysis
EstimationEstimation The inferential process of using sample The inferential process of using sample
statistics to statistics to estimate estimate population parameter population parameter is called is called estimation.estimation.
We use estimation 1. After hypotheses test We use estimation 1. After hypotheses test when when HH0 0 is rejected. 2. when we know there is rejected. 2. when we know there is an effect and simply want to find out is an effect and simply want to find out how how much. much. 3. When we want some basic 3. When we want some basic information about an unknown population.information about an unknown population.
See the logic behind hypothesis tests See the logic behind hypothesis tests ans estimation.ans estimation.
4848
EstimationEstimation Point Estimate:Point Estimate: Interval Estimate:Interval Estimate: µ= M ± tsµ= M ± tsMM
µµ11- µ- µ 2 2 = M= M11-M-M22 ± ts(M ± ts(M11-M-M22))
µµD D = M= MDD ± ts ± tsMMDD
4949
5050
ANANALYSIS ALYSIS OOF F VAVARIANCERIANCE ANOVAANOVA
TESTS FOR DIFFERENCES AMONG TWO TESTS FOR DIFFERENCES AMONG TWO OR MORE POPULATION MEANSOR MORE POPULATION MEANS
σσ²²=S=S²²=MS=MS
MS=Mean Squared DeviationMS=Mean Squared Deviation Ex of ANOVA Research: The effect of Ex of ANOVA Research: The effect of
temperaturetemperature on on recallrecall. .
5151
StatisticsStatisticsStandard Deviations and VariancesStandard Deviations and Variances
X X σ² = ss/N Popσ² = ss/N Pop 1 1 σ = √σ = √ss/Nss/N 22 4 s = √ss/df 4 s = √ss/df SampleSample 5 5 s² = ss/n-1 or s² = ss/n-1 or ss/dfss/df
MSMS = = SS/dfSS/df
5252
Effects of Temperature (IV) on Recall (DV)Effects of Temperature (IV) on Recall (DV)
5353
5454
5555
MSMS betbet = SS = SS betbet / df / df betbet
MS MS withwith = SS = SS withwith /df /df withwith
5656
5757
ANOVAANOVA
SS SS betbet = =ΣΣ(T²/n-G²/n)(T²/n-G²/n) SS SS withwith = =ΣΣssss SS SS totaltotal =SS =SS betbet + SS + SS withwith df df betbet = K-1 = K-1 df df withwith =N-K =N-K df df totaltotal= df = df betbet + df + df withwith
5858
Post Hoc Tests (Post Tests)Post Hoc Tests (Post Tests)
Post Hoc Tests are Post Hoc Tests are additional additional hypothesis hypothesis tests that are tests that are done done after after an ANOVA to an ANOVA to determine exactly determine exactly which which mean mean difference are difference are significant and which are not.significant and which are not.
5959
Post Hoc Tests (Post Tests)Post Hoc Tests (Post Tests)
Tukey’s Honestly Significant Tukey’s Honestly Significant Difference(HSD) Test Difference(HSD) Test
HSD= qHSD= q
6060
with / nMS
ProblemsProblems The data in next slide were obtained from an The data in next slide were obtained from an
independent-measures experiment designed to independent-measures experiment designed to examine people’s performances for viewing examine people’s performances for viewing distance distance of a 60-inch high definition television. of a 60-inch high definition television. Four viewing Four viewing distances were evaluated, distances were evaluated, 99 feet, feet, 1212 feet, feet, 15 15 feet, and feet, and 1818 feet, with a separate feet, with a separate group of participants tested at each distance. group of participants tested at each distance. Each individual watched a 30-minute television Each individual watched a 30-minute television program from a specific distance and then program from a specific distance and then completed a brief questionnaire measuring their completed a brief questionnaire measuring their satisfaction with the experience. satisfaction with the experience.
6161
ProblemsProblems One question asked them to One question asked them to rate the viewing rate the viewing
distance on a scale from distance on a scale from 1 (Very Bad 1 (Very Bad definitely definitely need to move closer or farther away) to need to move closer or farther away) to 7 7 (Excellent(Excellent-perfect-perfect viewing distance). The viewing distance). The purpose of the ANOVA purpose of the ANOVA is to determine is to determine whether there are any significant differences whether there are any significant differences among the four viewing distances among the four viewing distances that were that were tested. Before we begin the hypothesis test, tested. Before we begin the hypothesis test, note that we have already computed several note that we have already computed several summary statistics for the data in next slide. summary statistics for the data in next slide. Specifically, the tretment Specifically, the tretment totals (T) totals (T) and and SS SS values are shown for the entire set of data. values are shown for the entire set of data. 6262
ProblemsProblems9 feet 9 feet 12 feet 12 feet 15 fet 15 fet 18 feet18 feet
3 3 44 7 7 6 6 NN=20=20
0 0 33 6 6 33 GG=60=60
2 2 11 5 5 4 4 ΣΣX² X² =262=262
0 0 11 4 4 33 K=K=44
0 0 11 3 3 44
TT11=5 =5 TT22=10=10 T T33=25 =25 TT44=20=20
SSSS11=8 =8 SSSS22= 8 = 8 SSSS33=10 =10 SSSS44=6=6
MM11=1 =1 MM22=2=2 M M33=5 =5 MM44=4=4
nn11=5 =5 nn22=5=5 n n33=5 =5 nn44=5=5 6363
ProblemsProblems Having these summary values simplifies the Having these summary values simplifies the
computations in the hypothesis test, and we computations in the hypothesis test, and we suggest that you always compute these suggest that you always compute these summary statistics before you begin an summary statistics before you begin an ANOVA. ANOVA.
Step 1) Step 1)
HH00 : µ : µ11== µµ22=µ=µ33=µ=µ44 (There is no treatment effect.)(There is no treatment effect.)
HH11 : : (At least one of the treatment means is different.)(At least one of the treatment means is different.)
We will set alpha at We will set alpha at α α =.05=.056464
Step 2Step 2
6565
ProblemsProblems A human factor psychologist A human factor psychologist
studied studied three computer keyboard three computer keyboard designs. designs. Three samples of Three samples of individuals were given material to individuals were given material to type on a particular keyboard, and type on a particular keyboard, and the number of errors committed the number of errors committed by each participant was recorded. by each participant was recorded. The data are on next slide.The data are on next slide.
6666
ProblemsProblemsKeyboard A Keyboard A Keyboard B Keyboard B Keyboard CKeyboard C
0 0 66 6 6 NN=15=15
4 4 8 8 5 5 GG=60=60
0 0 55 9 9 ΣΣX² X² =356=356
1 1 44 4 4
0 0 22 6 6
TT11=5 =5 TT22=25=25 T T33=30=30
SSSS11=12 =12 SSSS22=20 =20 SS SS33=14=14
MM11=1 =1 MM22=5=5 M M33=6=6 Is there a significant Is there a significant differences among the three computer keyboard designs ?differences among the three computer keyboard designs ?
ProblemsProblems Step 1) Step 1)
HH00 : µ : µ11== µµ22=µ=µ33 (No differences between the (No differences between the
computer keyboard designs )computer keyboard designs )
HH11 : : (At least one of the computer keyboard designs is (At least one of the computer keyboard designs is
different.)different.)
We will set alpha at We will set alpha at α α =.01=.01
6868
Step 2Step 2
6969
2-WAY ANOVA2-WAY ANOVA
7070
Correlation Correlation
& &
RegressionRegression
7171
X
Y
CorrelationCorrelation
Correlation measures the Correlation measures the strengthstrength and and the the directiondirection of the relationship of the relationship between two or more variables.between two or more variables.
A correlation has three components:A correlation has three components:– The The strengthstrength of the coefficient of the coefficient– The The directiondirection of the relationship of the relationship– The The formform of the relationship of the relationship
The The strengthstrength of the coefficient of the coefficient is is indicated by the absolute value of the indicated by the absolute value of the coefficient.coefficient.– The closer the value is to The closer the value is to 1.01.0, either , either
positive or negative, positive or negative, the stronger or more the stronger or more linear the relationship.linear the relationship.
– The closer the value is toThe closer the value is to 0 0, , the weaker or the weaker or nonlinearnonlinear the relationship. the relationship.
7272
X
Y
CorrelationCorrelation
The The directiondirection of coefficient of coefficient is indicated by is indicated by the sign of the correlation coefficient.the sign of the correlation coefficient.– A A positivepositive coefficient indicates that as one coefficient indicates that as one
variable (X) increases, so does the other (Y).variable (X) increases, so does the other (Y).– AA negativenegative coefficient indicates that as one coefficient indicates that as one
variable (X) increases, the other variable (Y) variable (X) increases, the other variable (Y) decreases.decreases.
– The The form form of the relationshipof the relationship The form of the relationship is The form of the relationship is linear.linear.
In correlation variables are In correlation variables are not identified as not identified as independent or dependent independent or dependent because the because the researcher is measuring the one researcher is measuring the one relationship that is mutually shared between relationship that is mutually shared between the two variablesthe two variables– As a result, causality should not be implied with As a result, causality should not be implied with
correlationcorrelation..
7373
CorrelationCorrelation Remember, the correlation coefficient can only Remember, the correlation coefficient can only
measure a measure a linear relationshiplinear relationship.. A zero correlation indicates A zero correlation indicates no linear relationshipno linear relationship..
However, does not indicate no relationshipHowever, does not indicate no relationship..
a coefficient of zero rules out a coefficient of zero rules out linear linear relationship, relationship,
but a but a curvilinearcurvilinear could still exist.could still exist.– The scatterplots below illustrate this point:The scatterplots below illustrate this point:
05
1015
200
5
10
15
20
No Relationshipr = .0
05
1015
200
5
10
15
20
No Linear Relationshipr = 0
7474
7575
The Correlation is based on a Statistic The Correlation is based on a Statistic Called Called CovarianceCovariance
Variance and Covariance Variance and Covariance are used to are used to measure the quality of measure the quality of an item in a test.an item in a test.
Reliability and validity measure the quality of Reliability and validity measure the quality of the entire test.the entire test.
σ²=SS/Nσ²=SS/N used for one set of data used for one set of data
VarianceVariance is the degree of variability is the degree of variability
of scores from meanof scores from mean..
The Correlational MethodThe Correlational MethodSS,SS, Standard Deviations and VariancesStandard Deviations and Variances
X X σ² = ss/N Pop σ² = ss/N Pop 1 1 σ = √σ = √ss/Nss/N 22 4 4 s = √ss/df s = √ss/df 5 5 s² = ss/n-1 or ss/df Sample s² = ss/n-1 or ss/df Sample
SS=SS=ΣxΣx²-(Σx)²/N²-(Σx)²/N
SS=SS=ΣΣ(( x-x-μμ))²²
Sum Sum of of SquaredSquared DeviationDeviation from from MeanMean7676
7777
VarianceVariance
X X σ² = ss/N Popσ² = ss/N Pop 1 1 s² = ss/n-1 or ss/df Samples² = ss/n-1 or ss/df Sample 2 2 4 4 55
SS=SS=ΣxΣx²-(Σx)²/N²-(Σx)²/N
SS=SS=ΣΣ(( x-x-μμ))²²
Sum Sum of of SquaredSquared DeviationDeviation from from MeanMean
7878
CovarianceCovariance CorrelationCorrelation is based on a statistic called is based on a statistic called
CovarianceCovariance (Cov xy or S xy) ….. (Cov xy or S xy) ….. r=sp/√ssx.ssyr=sp/√ssx.ssy
Covariance is a number that reflects the Covariance is a number that reflects the degree to which 2 variables vary together. degree to which 2 variables vary together.
Original DataOriginal Data X YX Y 1 31 3 2 62 6 4 44 4 5 75 7
7979
CovarianceCovariance
COVxy=SP/N-1COVxy=SP/N-12 ways to calculate the SP2 ways to calculate the SPSP= Σxy-(Σx.Σy)/N SP= Σxy-(Σx.Σy)/N ComputationComputation
SP= (x-μx)(y-μy) SP= (x-μx)(y-μy) DefinitionDefinition
SP requires 2 sets of dataSP requires 2 sets of dataSS requires only one set of SS requires only one set of datadata
The Correlational MethodThe Correlational Method
Correlational data can be graphed and a Correlational data can be graphed and a “line of best fit” “line of best fit” can be drawncan be drawn
1- Pearson 1- Pearson CorrelationsCorrelations
2-Spearman2-Spearman
3-Point-Biserial Correlation3-Point-Biserial Correlation
4- Partial Correlation4- Partial Correlation
8080
Types of CorrelationTypes of Correlation In correlational research we use continues In correlational research we use continues
variables (variables (interval or ratio scale) interval or ratio scale) for for Pearson Pearson Correlation (for linear relationship).Correlation (for linear relationship).
If it is difficult to measure a variable on an If it is difficult to measure a variable on an interval or ratio scale then we use interval or ratio scale then we use SpearmanSpearman Correlation Correlation
SpearmanSpearman Correlation Correlation uses ordinal or rank uses ordinal or rank ordered dataordered data
Spearman Correlation measures the Spearman Correlation measures the consistency of a relationship (Monotonic consistency of a relationship (Monotonic Relationship)Relationship). . Ex. nextEx. next
8181
MonotonicMonotonic Transformation Transformation
They are They are rank ordered rank ordered numbers numbers (DATA), and use (DATA), and use ordinal scale(data) ordinal scale(data) examples; 1, 2, 3, 4, or 2, 4, 6, 8, 10 examples; 1, 2, 3, 4, or 2, 4, 6, 8, 10 Spearman Correlation Spearman Correlation can be used to can be used to measure the measure the degree of Monotonic degree of Monotonic relationship relationship between two variables.between two variables.
8282
Ex. of Monotonic data Ex. of Monotonic data
X X YY
2222 8787
2525 102102
1919 1010
6 6 5 5
8383
Types of CorrelationTypes of Correlation Ex. A teacher may feel confident about rank Ex. A teacher may feel confident about rank
ordering students’ leadership abilities but would ordering students’ leadership abilities but would find it difficult to measure leadership on some find it difficult to measure leadership on some other scale.other scale.
The Point-Biserial Correlation.The Point-Biserial Correlation. However, we can use both continues and However, we can use both continues and
discrete variables(data) in discrete variables(data) in The Point-Biserial The Point-Biserial Correlation. (can be a substitute for two Correlation. (can be a substitute for two independent t-test)independent t-test)
In special situations we can use In special situations we can use Partial Partial Correlations.Correlations.
8484
The Point-Biserial CorrelationThe Point-Biserial Correlation The point-biserial correlation is used to The point-biserial correlation is used to
measure the relationship between two measure the relationship between two variables in situations in which one variable variables in situations in which one variable consist of regular, consist of regular, numerical scores numerical scores (non-(non-dichotomies), but the second variable has only dichotomies), but the second variable has only two values two values (dichotomies).(dichotomies).
We can calculate the correlation from t-testWe can calculate the correlation from t-test r² = r² = Coefficient of Determination Coefficient of Determination which which
measures the measures the effect size=deffect size=d r² = t²/t²+dfr² = t²/t²+df r = √r²r = √r² 8585
The Correlational MethodThe Correlational Method
CorrelationCorrelation is the degree to which events or is the degree to which events or characteristics vary from each othercharacteristics vary from each other– Measures the Measures the strengthstrength of a relationship of a relationship– Does not Does not imply cause and effectimply cause and effect
The people chosen for a study are its The people chosen for a study are its subjects or participantssubjects or participants, collectively called a , collectively called a samplesample– The sample must be representativeThe sample must be representative
8686
A Partial CorrelationA Partial Correlation
Measures the relationship Measures the relationship between two variables while between two variables while controlling the influence of a controlling the influence of a third variable by holding it third variable by holding it constant.constant.
Ex. The correlation between Ex. The correlation between churches and crime. churches and crime.
8787
The Correlational MethodThe Correlational Method
Correlational data Correlational data can be can be graphedgraphed and a and a “line of best “line of best fit” fit” can be drawncan be drawn
8888
Positive Correlation Positive Correlation
Positive correlation: Positive correlation: variables change in variables change in the same directionthe same direction
8989
Positive Correlation Positive Correlation
9090
Negative Correlation Negative Correlation
Negative correlation:Negative correlation: variables change in variables change in the opposite the opposite direction direction
9191
Negative CorrelationNegative Correlation
9292
No CorrelationNo Correlation
Unrelated:Unrelated: no no consistent consistent relationshiprelationship
9393
No CorrelationNo Correlation
9494
The Correlational MethodThe Correlational Method
The magnitude (The magnitude (strengthstrength) of a ) of a correlation is also importantcorrelation is also important–High magnitude =High magnitude = variables which vary variables which vary
closely together; fall close to the line of closely together; fall close to the line of best fitbest fit
–Low magnitude =Low magnitude = variables which do variables which do not vary as closely together; loosely not vary as closely together; loosely scattered around the line of best fitscattered around the line of best fit
9595
The Correlational MethodThe Correlational Method
Direction and magnitude of a correlation are Direction and magnitude of a correlation are often calculated statisticallyoften calculated statistically– Called the Called the “correlation coefficient,” “correlation coefficient,” symbolized by the symbolized by the
letter letter “r”“r” Sign (+ or -) indicates directionSign (+ or -) indicates direction Number (from 0.00 to 1.00) indicates magnitudeNumber (from 0.00 to 1.00) indicates magnitude
0.00 = no consistent relationship0.00 = no consistent relationship +1.00 = +1.00 = perfectperfect positive correlation positive correlation -1.00 = -1.00 = perfectperfect negative correlation negative correlation
Most correlations found in psychological Most correlations found in psychological research fall far short of research fall far short of “perfect”“perfect”
9696
The Correlational MethodThe Correlational Method
Correlations can be trusted based on Correlations can be trusted based on statistical probabilitystatistical probability– ““Statistical significance”Statistical significance” means that the finding means that the finding
is unlikely to have occurred by chanceis unlikely to have occurred by chance By convention, if there is less than a 5% probability By convention, if there is less than a 5% probability
that findings are due to chance (that findings are due to chance (pp < 0.05), < 0.05), results are results are considered considered “significant” “significant” and thought to reflect the and thought to reflect the larger populationlarger population
– Generally, Generally, confidenceconfidence increases with the increases with the size of size of the sample the sample and the and the magnitude of the correlationmagnitude of the correlation
9797
The Correlational MethodThe Correlational Method
Advantages of correlational studies:Advantages of correlational studies:
– Have high Have high external validityexternal validityCan generalize findingsCan generalize findings
– Can repeat (replicate) studies on other Can repeat (replicate) studies on other samplessamples
Difficulties with correlational studies:Difficulties with correlational studies:
– Lack Lack internal validityinternal validityResults describe but Results describe but do not do not explainexplain a a
relationshiprelationship9898
External & Internal ValidityExternal & Internal Validity
External ValidityExternal Validity
External validity addresses the ability to generalize External validity addresses the ability to generalize your study to other people and other situations.your study to other people and other situations.
Internal ValidityInternal ValidityInternal validity addresses the "true" Internal validity addresses the "true" causescauses of the of the
outcomes that you observed in your study. Strong outcomes that you observed in your study. Strong internal validity means that you not only have internal validity means that you not only have reliable measures reliable measures of your of your independent (predictors)independent (predictors) and and dependent variables (criterions) dependent variables (criterions) BUT a strong BUT a strong justification that justification that causallycausally linkslinks your independent your independent variables variables (IV) (IV) to your dependent variablesto your dependent variables (DV). (DV). 9999
The Correlational MethodThe Correlational MethodPearsonPearson
r=sp/√ssx.ssyr=sp/√ssx.ssy Original DataOriginal Data X YX Y 1 31 3 2 62 6 4 44 4 5 75 7
SPSP requires 2 sets of data requires 2 sets of dataSSSS requires only one set of data requires only one set of data
df=n-2df=n-2100100
The Correlational MethodThe Correlational MethodSpearmanSpearman
r=sp/√ssx.ssyr=sp/√ssx.ssy Original Data Original Data Ranks Ranks X Y X YX Y X Y 1 3 1 11 3 1 1 2 6 2 3 2 6 2 3 4 4 3 24 4 3 2 5 7 4 45 7 4 4
SPSP requires 2 sets of data requires 2 sets of dataSSSS requires only one set of data requires only one set of data
101101
Percentage of Variance Accounted for Percentage of Variance Accounted for by the Treatment (similar to Cohen’s d) by the Treatment (similar to Cohen’s d) is known as is known as ωω² Omega Squared ² Omega Squared also is also is called called Coefficient of DeterminationCoefficient of Determination
102102
22
2
tr
t df
Coefficient of DeterminationCoefficient of Determination
If r = 0.80 then, r ² = 0.64If r = 0.80 then, r ² = 0.64This means 64% of the This means 64% of the
variability in the Y scores can variability in the Y scores can be predicted from the be predicted from the relationship with X.relationship with X.
103103
ProblemsProblems Test the hypothesis for the following n=4 pairs of Test the hypothesis for the following n=4 pairs of
scores for a correlation. scores for a correlation. r=sp/√ssx.ssyr=sp/√ssx.ssy Original DataOriginal Data X YX Y 1 31 3 2 62 6 4 44 4 5 75 7
104104
ProblemsProblems
Step 1) Step 1) HH00 : : ρρ=0 =0 (There is no population correlation.)(There is no population correlation.)
HH11 : : ρ≠ρ≠0 0 (There is a real correlation.)(There is a real correlation.)
ΡΡ: probability or chances are…: probability or chances are…
We will set alpha at We will set alpha at α α =.01=.01
105105
STEP 2STEP 2
106106
107107
ProblemsProblems Test the hypothesis for the following set of n=5 Test the hypothesis for the following set of n=5
pairs of scores for a pairs of scores for a positive correlation. positive correlation. Original DataOriginal Data X YX Y 0 20 2 10 610 6 4 24 2 8 48 4 8 68 6
108108
ProblemsProblems
Step 1) Step 1) HH00 : : ρρ≤0 ≤0 ((The population correlation is ((The population correlation is not positive.)not positive.)
HH11 : : ρρ>0 >0 (The population correlation is (The population correlation is positivepositive.).)
ΡΡ: probability or chances are…: probability or chances are…
We will set alpha at We will set alpha at α α =.05=.05
109109
Bi-Variate Regression AnalysisBi-Variate Regression Analysis Bi-variate regression Bi-variate regression analysis extends analysis extends
correlation and attempts to measure the extent correlation and attempts to measure the extent to which a to which a predictorpredictor variable (X) can be used to variable (X) can be used to make a make a prediction prediction about a about a criterioncriterion measure measure (Y). (Y).
X Y
Bi-variateBi-variate regression uses a linear linear model model to predictpredict the criterioncriterion measure.The formula for the predicted score is:
Y' = a + bXY' = a + bX 110110
Bivariate RegressionBivariate Regression The components of the line of best The components of the line of best
fit (Y' = a + bX) are:fit (Y' = a + bX) are:–the the YY-intercept -intercept ((aa) ) ConstantConstant–the slope the slope (b)(b)–Variable Variable (X)(X)
111111
Bivariate RegressionBivariate Regression The The Y-interceptY-intercept is the average value of Y is the average value of Y
whenwhen X is zero.X is zero.–The The Y-interceptY-intercept is also called is also called constant.constant.–Because, this is the amount of Y that is Because, this is the amount of Y that is
constant or present when the influence constant or present when the influence of X is null (0).of X is null (0).
The slope The slope is average value of a is average value of a one unit one unit change in Y change in Y for a corresponding for a corresponding one unit one unit change in X.change in X.–Thus, the Thus, the slope represents the direction slope represents the direction
and intensity of the line.and intensity of the line.112112
Regression and PredictionRegression and Prediction
Y=bX+aY=bX+aRegression LineRegression Line
113113
114114
Bivariate RegressionBivariate Regression Line of Best Fit:Line of Best Fit: Y' = 2.635 + .204X Y' = 2.635 + .204X
With this equation a predicted score may be made for With this equation a predicted score may be made for any value ofany value of X X within the range of data.within the range of data.
a=2.635 a=2.635 and and b=.204b=.204
01
23
4
High School Grade Point Average
0
1
2
3
4
0
2
First Year Grade Point Average
Y-intercept
2.635
Slope = .204
115115
Multiple Regression AnalysisMultiple Regression Analysis Multiple regression analysis is Multiple regression analysis is
an extension of an extension of bi-variate bi-variate regression, regression, in whichin which severalseveral predictorpredictor variables are used to variables are used to predict predict one criterion one criterion measure measure (Y).(Y).– In general, this method is In general, this method is
considered to be considered to be advantageous; since seldom advantageous; since seldom can an outcome measure be can an outcome measure be accurately explained by one accurately explained by one predictor variablepredictor variable. .
YX 2
X 1
X 3
Y' = a + bY' = a + b11XX11 +b +b22XX22 +b+b33XX33
E
116116
117117
118118
Relationship of Statistical Relationship of Statistical TestsTests
Does this diagram make sense to you?Does this diagram make sense to you?
Structural Equation Modeling
Structural Model
Multiple Regression
ANOVA
t ratio
Correlation
Measurement Model
Confirmatory Factor Analysis
Exploratory Factor Analysis
PARAMETRIC PARAMETRIC AND NONPARAMETRIC AND NONPARAMETRIC STATISTICAL TESTSSTATISTICAL TESTS
Parametric tests are more accurate and Parametric tests are more accurate and have 3 assumptions:have 3 assumptions:
1. Random selection1. Random selection 2. Independent of observation2. Independent of observation 3. Sample is taken from a 3. Sample is taken from a normal normal
population population with a normal distribution
120120
NONPARAMETRIC NONPARAMETRIC STATISTICAL TESTSSTATISTICAL TESTS
CHI SQURE: CHI SQURE: It It is like frequency distributionis like frequency distribution
Is is used for comparative studies.Is is used for comparative studies. Ex. Of the two leading brands of cola, Ex. Of the two leading brands of cola,
which is preferred by most American?which is preferred by most American?
121121
CHI SQURE CHI SQURE
= = ΣΣ(fo-fe)²(fo-fe)² /fe/fe
df = C-1 df = C-1
fe = n/c fe = n/c
122122
CHI SQURE CHI SQURE
123123
df=C-1df=C-1
124124
125125
ProblemsProblems A psychologist examining art appreciation selected an A psychologist examining art appreciation selected an
abstract painting that had no obvious top or bottom. abstract painting that had no obvious top or bottom. Hangers were placed on the painting so that it could Hangers were placed on the painting so that it could be hung with any one of the four sides at the top. The be hung with any one of the four sides at the top. The painting was shown to a sample of n=50 participants, painting was shown to a sample of n=50 participants, and each was asked to hung the painting in the and each was asked to hung the painting in the orientation that looked correct. The following data orientation that looked correct. The following data indicate how many people choose each of the four indicate how many people choose each of the four sides to be placed at the top. sides to be placed at the top.
fofo
126126
Top up (correct)18
Bottom up17
Left side up7
Right side up8
ProblemsProblems The question for the hypothesis test is The question for the hypothesis test is
whether there are any preferences among whether there are any preferences among the four possible orientations. Are any of the the four possible orientations. Are any of the orientations selected more (or less) often orientations selected more (or less) often than would be expected simply by chance?than would be expected simply by chance?
We will set alpha at We will set alpha at α α =.05=.05
127127
ProblemsProblems
Step 1) Step 1) HH00 : fo=fe : fo=fe no preference for any specific orientationno preference for any specific orientation
HH11 : fo≠fe : fo≠fe preference for specific orientationpreference for specific orientation
128128
Step 2 Step 2 df=C-1df=C-1
129129