a confidence interval provides additional information about variability
TRANSCRIPT
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
1/14
A Confidence interval provides additional information about variability. It consists of an
interval of numbers along with a probability that the interval contains the unknown parameter
A point estimate of a parameter is a single sample value of a statistic that estimates the value
of the parameter.
The level of confidence in a confidence interval is a probability that represents the
percentage of intervals that will contain if a large number of repeated samples are obtained.
The level of confidence is denotedFor example, a 95% level of confidence would mean that
if 100 confidence intervals were constructed, each based on a different sample from the same
population, we would expect 95 of the intervals to contain the population mean.
The general formula for all confidence intervals is: Point Estimate (Critical Value)(Standard Error)
The construction of a confidence interval for the population mean depends upon three
factors-:
The point estimate of the population The level of confidence The standard deviation of the sample mean
Confidence Interval for
( Known)
95% of all sample means are in the interval
where X is the point estimate
Z(1.96) is the normal distribution critical value for a probability of/2 in each tail
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
2/14
Levels Of Confidence
Sampling Distribution of the Mean
Ques 1 A sample of 11 circuits from a large normal population has a mean resistance of 2.20
ohms. We know from past testing that the population standard deviation is 0.35 ohms.
Determine a 95% confidence interval for the true mean resistance of the population
ConfidenceLevel
Confidence
Coefficient, Z value
1.28
1.645
1.96
2.332.58
3.08
3.27
0.80
0.90
0.95
0.980.99
0.998
0.999
80%
90%
95%
98%99%
99.8%
99.9%
1
/2 /21
2.40681.9932
0.20682.20
)11(0.35/1.962.20n
ZX
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
3/14
Interpretation
We are 95% confident that the true mean resistance is between 1.9932 and 2.4068 ohms
Although the true mean may or may not be in this interval, 95% of intervals formed in this
manner will contain the true mean
Confidence Interval for
( Unknown)
Note:
If the population standard deviation is unknown, we can substitute the sample
standard deviation, S This introduces extra uncertainty, since S is variable from
sample to sample.So we use the t distribution instead of the normal distribution
The t value depends on degrees of freedom (d.f.)
Number of observations that are free to vary after sample mean has been calculated
d.f. = n1
Example 2
A random sample of n = 25 taken from a normal population has X = 50 and S = 8. Form a95% confidence interval for
n
StX
1-n
2.0639t0.025,241n,/2 t
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
4/14
d.f. = n1 = 24, so
The confidence interval is
46.698 53.302
Confidence Intervals for the Population Proportion,
An interval estimate for the population proportion ( ) can be calculated by adding and
subtracting an allowance for uncertainty to the sample proportion ( p )
SD of Interval
Sample Size
Upper and lower confidence limits for the population proportion are calculated with
the formula-:
where
Z is the standard normal value for the level of confidence desired p is the sample proportion n is the sample size
n
)(1p
n
p)p(1
25
8(2.0639)50
n
S1-n/2, tX
n
p)p(1Zp
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
5/14
Example 4 A random sample of 100 people shows that 25 are left-handed. Form a 95%
confidence interval for the true proportion of left-handers
Interpretation
We are 95% confident that the true percentage of left-handers in the population is
between 16.51% and 33.49%.
Sampling Error is the amount of imprecision in the estimate of the population parameter
To determine the required sample size for the mean, you must know:
The desired level of confidence (1 - ), which determines the critical Z value The acceptable sampling error, e The standard deviation,
s
If unknown, can be estimated when using the required sample size formula Use a value for that is expected to be at least as large as the true
/1000.25(0.75)1.9625/100
p)/np(1Zp
0.33490.1651
(0.0433)1.960.25
2
22
e
Zn
219.195
(45)(1.645)2
22
2
22
e
Zn
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
6/14
Select a pilot sample and estimate with the sample standard deviation, S
To determine the required sample size for the proportion, you must know: The desired level of confidence (1 - ), which determines the critical Z value The acceptable sampling error, e The true proportion of successes,
can be estimated with a pilot sample, if necessary (or conservativelyuse = 0.5)
Example 5
How large a sample would be necessary to estimate the true proportion defective in a large
population within 3%, with 95% confidence?
(Assume a pilot sample yields p = 0.12)
For 95% confidence, use Z = 1.96
e = 0.03
p = 0.12, so use this to estimate
2
2
e
)(1Z
n
450.74(0.03)
0.12)(0.12)(1(1.96)e
)(1Zn2
2
2
2
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
7/14
Fundamentals of Hypothesis Testing: One-Sample Tests
A hypothesis is a claim (assumption) about a population parameter.
Population mean
Example: The mean monthly cell phone bill of this city is = $42
Population proportion
Example: The proportion of adults in this city with cell phones is = 0.68
Null Hypothesis states the claim or assertion to be tested
Example: The average number of TV sets in U.S. Homes is equal to three Ho: = 3
Begin with the assumption that the null hypothesis is true Similar to the notion of innocent until
proven guilty
May or may not be rejectedAlternative hypothesis HiIs the opposite of the null hypothesis
e.g., The average number of TV sets in U.S. homes is not equal to 3 ( H1: 3 )
May or may not be proven Is generally the hypothesis that the researcher is trying to prove
Level of Significance,
Defines the unlikely values of the sample statistic if the nullhypothesis is true
Defines rejection region of the sampling distribution
Is designated by , (level of significance).Typical values are 0.01, 0.05, or0.10
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
8/14
Type I Error
Reject a true null hypothesis Considered a serious type of error
The probability of Type I Error is ,called level of significance of the test Type II Error
Does Not reject a false null hypothesisThe probability of Type II Error is
Hypothesis Test for Mean
ActualSituation
Decision
Do NotReject
H0
No error
(1-
)
Type II Error
( )
RejectH0
Type I Error( )
H0
FalseH0
True
No Error( 1 - )
Known Unknown
Hypothesis
Tests for
(Z test) (t test)
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
9/14
When SD Is Known (Z test )
For a two-tail test for the mean, known: Convert sample statistic ( ) to test statistic (Z statistic ) Determine the critical Z values for a specified
level of significance from a table or computer Decision Rule: If the test statistic falls in the rejection region, reject H0 ; otherwise do
not reject H0
Example
Test the claim that the true mean # of TV sets in US homes is equal to 3.
(Assume = 0.8)
H0: = 3 H1: 3 (This is a two-tail test)
Suppose that = 0.05 and n = 100 are chosen for this test
is known so this is a Z test.
For = 0.05 the critical Z values are 1.96
Suppose the sample results are
n = 100, X = 2.84 ( = 0.8 is assumed known)
So the test statistic is:
Since Z = -2.0 < -1.96, we reject the null hypothesis and conclude that there is sufficientevidence that the mean number of TVs in US homes is not equal to 3
n
X
Z
2.0.08
.16
100
0.8
32.84
n
XZ
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
10/14
p-Value Approach to Testing
p-value: Probability of obtaining a test statistic more extreme ( or ) than the observedsample value given H0 is true
Also called observed level of significance. It is the smallest value of forwhich H0 can be rejected
1)Convert Sample Statistic (e.g., ) to Test Statistic (e.g., Z statistic )
2)Obtain the p-value from a table
3)Compare the p-value with If p-value < , reject H0 If p-value , do not reject H0
Example
How likely is it to see a sample mean of 2.84 (or something further from the mean, in
either direction) if the true mean is = 3.0?
X = 2.84 is translated to a Z score of Z = -2.0
p-value
= 0.0228 + 0.0228 = 0.0456
Compare the p-value with If p-value < , reject H0 If p-value , do not reject H0
Here: p-value = 0.0456
= 0.05Since 0.0456 < 0.05, we reject the null hypothesis
Connecting the above to confidence interval
For X = 2.84, = 0.8 and n = 100, the 95% confidence interval is:
2.6832 2.9968 Since interval does not contain hypothesized mean 3.0 we
reject the null hypothesis at a=0.05
0.02282.0)P(Z
0.02282.0)P(Z
100
0.8(1.96)2.84to
100
0.8(1.96)-2.84
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
11/14
One Tail Tests
H1: < 3
H0: 3 This is a lower/left-tail test since the alternative hypothesis is focused on the
lower/left tail below the mean of 3
H1: > 3
H0: 3 This is an upper/right-tail test since the alternative hypothesis is focused on the
upper/right tail above the mean of 3
Example Upper/Right-Tail Z Test
for Mean ( Known)
A phone industry manager thinks that customer monthly cell phone bills have
increased, and now average over $52 per month. The company wishes to test thisclaim. (Assume = 10 is known)
Suppose that = 0.10 is chosen for this test thus Rejection region would be +1.28= 0.8997
from table
Suppose a sample is taken with the following results: n = 64, X = 53.1 (=10 was assumed
known) Then the test statistic is:
Do not reject H0since Z = 0.88 1.28
i.e.: there is not sufficient evidence that the mean bill is over $ 52
Calculating P value
H0: 52 the average is not over $52 per month
H : > 52 the average is greater than $52 per month
0.88
64
10
5253.1
n
XZ
0.1894
0.810610.88)P(Z
6410/
52.053.1ZP
53.1)XP(
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
12/14
Do not reject H0 since p-value = 0.1894 > = 0.10
t Test of Hypothesis for the Mean ( Unknown)
Example -The average cost of a hotel room in New York is said to be $168 per night.
A random sample of 25 hotels resulted in X = $172.50 and
S = $15.40. Test at the H0: = 168
= 0.05 level. H1: 168
a = 0.05
n= 25
is unknown, so use a t statistic
Critical Value: t24 = 2.0639
Do not reject H0: not sufficient evidence that true mean cost is different than
$168
Connecting to confidence interval
For X = 172.5, S = 15.40 and n = 25, the 95% confidence interval is:172.5 - (2.0639) 15.4/ 25 to 172.5 + (2.0639) 15.4/ 25
166.14 178.86
Since this interval contains the Hypothesized mean (168), we do not reject the nullhypothesis at = 0.05
Proportions
Sample proportion in the success category is denoted by p
n
SXt 1-n
1.46
25
15.40
168172.50
n
S
Xt
1n
sizesample
sampleinsuccessesofnumber
n
Xp
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
13/14
When both n and n(1-) are at least 5, pcan be approximated by a normal distribution
with mean and standard deviation
The sampling distribution of p is approximately normal, so the test statistic is a Zvalue:
Example
A marketing company claims that it receives 8% responses from its mailing. To test this
claim, a random sample of 500 were surveyed with 25 responses. Test at the = 0.05
significance level.
Check:
n = (500)(.08) = 40 H0: = 0.08
n(1-) = (500)(.92) = 460 Hi:Y=/o.o8
There is sufficient evidence to reject the companys claim of 8% response rate.
p-Value Solution
Reject Ho since p-value=0.0136
-
7/29/2019 A Confidence Interval Provides Additional Information About Variability
14/14
University of theFraser Valley
Assignment Chapter 8&9
Presented To Prof Gagan Sharma
Presented By Shrey Budhiraja
200106769