lesson 11 - 4 using inference to make decisions. knowledge objectives define what is meant by a type...
TRANSCRIPT
Lesson 11 - 4
Using Inference toMake Decisions
Knowledge Objectives
• Define what is meant by a Type I error.
• Define what is meant by a Type II error.
• Define what is meant by the power of a test.
• Identify the relationship between the power of a test and a Type II error.
• List four ways to increase the power of a test.
Construction Objectives
• Describe, given a real situation, what constitutes a Type I error and what the consequences of such an error would be.
• Describe, given a real situation, what constitutes a Type II error and what the consequences of such an error would be.
• Describe the relationship between significance level and a Type I error.
• Explain why a large value for the power of a test is desirable.
Vocabulary• Power of the test – value of 1 – β
• Power curve – a graph that shows the power of the test against values of the population mean that make the null hypothesis false.
• Level of Significance – probability of making a Type I error, α
Reality
H0 is True Ha is True
Conclusion
Do Not Reject H0Correct
ConclusionType II Error
Reject H0Type I Error
CorrectConclusion
H0: the defendant is innocentH1: the defendant is guilty
Type I Error (α): convict an innocent personType II Error (β): let a guilty person go free
Note: a defendant is never declared innocent; just not guilty
decrease α increase βincrease α decrease β
Hypothesis Testing: Four Outcomes
Hypothesis Testing: Four Outcomes
• We reject the null hypothesis when the alternative hypothesis is true (Correct Decision)
• We do not reject the null hypothesis when the null hypothesis is true (Correct Decision)
• We reject the null hypothesis when the null hypothesis is true (Incorrect Decision – Type I error)
• We do not reject the null hypothesis when the alternative hypothesis is true (Incorrect Decision – Type II error)
Example 1
You have created a new manufacturing method for producing widgets, which you claim will reduce the time necessary for assembling the parts. Currently it takes 75 seconds to produce a widget. The retooling of the plant for this change is very expensive and will involve a lot of downtime.
Ho :
Ha:
TYPE I:
TYPE II:
Example 1
Ho : µ = 75 (no difference with the new method)
Ha: µ < 75 (time will be reduced)
TYPE I: Determine that the new process reduces time when it actually does not. You end up spending lots of money retooling when there will be no savings. The plant is shut unnecessarily and production is lost.
TYPE II: Determine that the new process does not reduce when it actually does lead to a reduction. You end up not improving the situation, you don't save money, and you don't reduce manufacturing time.
Example 2
A potato chip producer wants to test the hypothesis
H0: p = 0.08 proportion of potatoes with blemishesHa: p < 0.08
Let’s examine the two types of errors that the producer could make and the consequences of each
Type I Error:
Type II Error:
Description: producer concludes that the p < 8% when its actually greater
Description: producer concludes that the p > 8% when its actually less
Consequence: producer accepts shipment with sub-standard potatoes; consumers may choose not to come back to the product after a bad bag
Consequence: producer rejects shipment with acceptable potatoes; possible damage to supplier relationship and to production schedule
Example 3
A city manager’s staff takes a random sample of 400 emergency call response times that yielded x-bar = 6.48 minutes with a standard deviation of 2 minutes. The manager wants to know if the response time decreased from last year’s mean of 6.7 min?
Parameter to be tested:
Test Type:
H0:
Ha:
left-tailed test
Mean response time, = 6.7 minutes
Mean response time, < 6.7 minutes
mean response time in min
Example 3
H0: Mean response time, = 6.7 minutes
Ha: Mean response time, < 6.7 minutes
Give the description and consequences of the two error types:
Type I:
Type II:
The manager concludes that the response times have improved, when they really have not. No additional funding for improvement; possible additional lives lost.
The manager concludes that the response times still need to be improved, when they have improved already. Additional funds spent unnecessarily and morale might be lowered.
Graphical View of Error Types
• As the critical value of x-bar moves right α increases and β decreases• As the critical value of x-bar moves left α decreases and β increases• Need to identify the differences in errors and their consequences in a
given problem
Area to the left of critical value under the right most curve is the Type I error
Area to the right of critical value under the left most curve is the Type II error
Finding P(Type II Error)• Determine the sample mean that separates the rejection region
from the non-rejection region x-bar = μ0 ± zα · σ/√n
• Draw a normal curve whose mean is a particular value from the alternative hypothesis, with the sample mean(s) found in step 1 labeled.
• The area described below represents β, the probability of not rejecting the null hypothesis when the alternative hypothesis is true. a. Left-tailed Test: Find the area under the normal curve drawn in step 2 to the right of x-bar b. Two-tailed Test: Find the area under the normal curve drawn in step 2 between xl and xu
c. Right-tailed Test: Find the area under the normal curve drawn in step 2 to the left of x-bar
Example 3
The current wood preservative (CUR) preserves the wood for 6.40 years under certain conditions. We have a new preservative (NEW) that we believe is better, that it will in fact work for 7.40 years
Ho :
Ha:
TYPE I:
TYPE II:
μ = 6.40 (our preservative is same as the current)
μ = 7.40 (new is significantly better than the current)
Example 3
• Our hypotheses H0: μ = 6.40 (our preservative is the same as the
current one)
H0H1
H1: μ = 7.40 (our preservative is significantly better than the current one)
Example 3
• Type I error – Assumes that H0 is true (that NEW is no better)
– Our experiment leads us to reject H0
Critical Value
H0H1
This area is the P(Type I error)
Example 3
• Type I errors– Assumes that H0 is true (that NEW is no better)
– Our experiment leads us to reject H0
– Result – we conclude that NEW is significantly better, when it actually isn’t
– This will lead to unrealistic expectations from our customers that our product actually works better
Example 3
• Type II errors– Assumes that H1 is true (that NEW is better)
– Our experiment leads us to not reject H0
Critical Value (the same as before)
H0H1
This area is the P(Type II error)
Example 3
• Type II errors– Assumes that H1 is true (that NEW is better)
– Our experiment leads us to not reject H0
– Result – we conclude that NEW is not significantly better, when it actually is
– This will lead to customers not getting a better treatment because we didn’t realize that it was better
Example 3
● Test Details We test our product on n = 60 wood planks We have a known standard deviation σ = 3.2 We use a significance level of α = 0.05
● The standard error of the mean is
● The critical value (for a right-tailed test) is
6.40 + 1.645 0.41 = 7.08
3.2----- = 0.4160
Example 3• The Type II error, β, is the probability of not rejecting
H0 when H1 is true
– H1 is that the true mean is 7.40
– The area where H0 is not rejected is where the sample mean is 7.08 or less
• The probability that the sample mean is 7.08 or less, given that it’s mean is 7.40, is
• Thus β, the Type II error, is 0.22• Power of the test is 1 – β = 0.78
7.08 – 7.40 β = P( z < ------------------) = P(z < -0.77) = 0.22 0.41
Power and Type II Error
• Probability of a Type II error is β• Power of the test is 1 – β• P-value describes what would happen supposing the
null hypothesis is true• Power describes what would happen supposing that
a particular alternative is true
Increasing the Power of a Test
• Four Main Methods:– Increase significance level, – Consider a particular alternative that is farther
away from – Increase the sample size, n, in the experiment– Decrease the population (or sample) standard
deviation, σ
• Only increasing the sample size and the significance level are under the control of the researcher
Comparisons
• P-value, (compared to ) assumes that H0 is true
• Power, (1 - ) assumes that some alternative Ha is true
Summary and Homework
• Summary– A Type I error occurs if we reject H0 when in fact
its true; P(Type I) = α– A Type II error occurs if we fail to reject HO when
in fact its false; P(Type II) = β– Power of a significance test measures its ability
to detect an alternative hypothesis and is = 1 - β – Increasing the power of a test can be done by
increasing sample size and by using a higher α
• Homework– pg 727 – 735; 11.50, 51, 59-61