chapter 8: estimation of the mean and proportion

Post on 21-Dec-2015

277 Views

Category:

Documents

6 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Chapter 8:

ESTIMATION OF THE MEAN AND PROPORTION

2

ESTIMATION: AN INTRODUCTION

Definition The assignment of value(s) to a

population parameter based on a value of the corresponding sample statistic is called estimation.

3

ESTIMATION: AN INTRODUCTION cont.

Definition The value(s) assigned to a

population parameter based on the value of a sample statistic is called an estimate.

The sample statistic used to estimate a population parameter is called an estimator.

4

ESTIMATION: AN INTRODUCTION cont.

1. Select a sample.2. Collect the required information from

the members of the sample.3. Calculate the value of the sample

statistic.4. Assign value(s) to the corresponding

population parameter.

The estimation procedure involves the following steps.

5

POINT AND INTERVAL ESTIMATES

A Point Estimate An Interval Estimate

6

A Point Estimate

Definition The value of a sample statistic that is

used to estimate a population parameter is called a point estimate.

7

A Point Estimate cont. Usually, whenever we use point estimation,

we calculate the margin of error associated with that point estimation.

The margin of error is calculated as follows:

xx s96.1or 96.1error ofMargin

8

An Interval Estimation

Definition In interval estimation, an interval

is constructed around the point estimate, and it is stated that this interval is likely to contain the corresponding population parameter.

9

Figure 8.1 Interval estimation.

$1130 $1610

x 1370$x

10

An Interval Estimation cont.

Definition Each interval is constructed with regard to a

given confidence level and is called a confidence interval. The confidence level associated with a confidence interval states how much confidence we have that this interval contains the true population parameter. The confidence level is denoted by (1 – α)100%.

11

INTERVAL ESTIMATION OF A POPULATION MEAN: LARGE SAMPLES

Confidence Interval for μ for Large Samples The (1 – α)100% confidence interval for μ is

The value of z used here is read from the standard normal distribution table for the given confidence level.

knownnot is if

known is if

x

x

zsx

zx

nssn xx / and / where

12

INTERVAL ESTIMATION OF A POPULATION MEAN: LARGE SAMPLES cont.

Definition The maximum error of estimate

for μ, denoted by E, is the quantity that is subtracted from and added to the value of x to obtain a confidence interval for μ. Thus,

xx zszE or

13

Figure 8.2 Finding z for a 95% confidence level.

μ

-1.96 0 1.96 z

.4750 .4750

x

Total shaded area is .9500 or

95%

14

Figure 8.3 Area in the tails.

-z 0 z

(1 – α)

z

2

2

15

Example 8-1 A publishing company has just published a

new college textbook. Before the company decides the price at which to sell this textbook, it wants to know the average price of all such textbooks in the market. The research department at the company took a sample of 36 comparable textbooks and collected information on their prices. This information produces a mean price of $70.50 for this sample. It is known that the standard deviation of the prices of all such textbooks is $4.50.

16

Example 8-1

(a) What is the point estimate of the mean price of all such textbooks? What is the margin of error for the estimate?

(b) Construct a 90% confidence interval for the mean price of all such college textbooks.

17

Solution 8-1

a) n = 36, x = $70.50, and σ = $4.5

Point estimate of μ = x = $70.50

Margin of error =

75$.36

50.4

nx

47.1$)75(.96.196.1 x

18

Solution 8-1

b) Confidence level is 90% or .90.

z = 1.65.

$71.74 to$69.26

1.24) (70.50 to1.24)-(70.50

24.150.70)75(.65.150.70

xzx

19

Solution 8-1

We can say that we are 90% confident that the mean price of all such college textbooks is between $69.26 and $71.74.

20

Figure 8.4 Confidence intervals.

xx 65.11 xx 65.11 1x

1x x

xx 65.12 xx 65.12 2x

2x

xx 65.13 xx 65.13 3x

3x

21

Example 8-2 According to a report by the Consumer

Federation of America, National Credit Union Foundation, and the Credit Union National Association, households with negative assets carried an average of $15,528 in debt in 2002 (CBS.MarketWatch.com, May 14, 2002). Assume that this mean was based on a random sample of 400 households and that the standard deviation of debts for households in this sample was $4200. Make a 99% confidence interval for the 2002 mean debt for all such households.

22

Solution 8-2

Confidence level 99% or .99

The sample is large (n > 30) Therefore, we use the normal

distribution z = 2.58

210$400

4200

n

ssx

23

Solution 8-2

Thus, we can state with 99% confidence that the 2002 mean debt for all households with negative assets was between $14,986.20 and $16,069.80.

$16,069.80 to$14,986.20

80.541528,15)210(58.2528,15

xzsx

24

INTERVAL ESTIMATION OF A POPULATION MEAN: SMALL SAMPLES

The t Distribution Confidence Interval for μ Using the

t Distribution

25

The t Distribution Conditions Under Which the t Distribution Is Used to

Make a Confidence Interval About μ

The t distribution is used to make a confidence interval about μ if

1. The population from which the sample is drawn is (approximately) normally distributed

2. The sample size is small (that is, n < 30)3. The population standard deviation , σ , is not

known

26

The t Distribution cont.

The t distribution is a specific type of bell-shaped distribution with a lower height and a wider spread than the standard normal distribution. As the sample size becomes larger, the t distribution approaches the standard normal distribution. The t distribution has only one parameter, called the degrees of freedom (df). The mean of the t distribution is equal to 0 and its standard deviation is .

)2/( dfdf

27

Figure 8.5 The t distribution for df = 9 and the standard normal distribution.

μ = 0

The standard deviation of the standard normal distribution is 1.0

The standard deviation of the t distribution is134.1)29/(9

28

Example 8-3

Find the value of t for 16 degrees of freedom and .05 area in the right tail of a t distribution curve.

29

Table 8.1 Determining t for 16 df and .05 Area in the Right Tail

Area in the Right Tail Under the t Distribution Curve

df .10 .05 .025 … .001

123.

16.

3.0781.8861.638

…1.337

6.3142.9202.353

…1.746

12.706 4.303 3.182

… 2.120

………………

318.309 22.327 10.215

… 3.686

Area in the right tail

df

The required value of t for 16 df and .05 area in the right tail

30

Figure 8.6 The value of t for 16 df and .05 area in the right tail.

df = 16

0 1.746

.05

tThis is the

required value of t

31

Figure 8.7 The value of t for 16 df and .05 area in the left tail.

df = 16

0 -1.746

.05

t

32

Confidence Interval for μ Using the t Distribution

The (1 – α)100% confidence interval for μ is

The value of t is obtained from the t distribution table for n – 1 degrees of freedom and the given confidence level.

n

sstsx xx where

33

Example 8-4 Dr. Moore wanted to estimate the mean

cholesterol level for all adult men living in Hartford. He took a sample of 25 adult men from Hartford and found that the mean cholesterol level for this sample is 186 with a standard deviation of 12. Assume that the cholesterol levels for all adult men in Hartford are (approximately) normally distributed. Construct a 95% confidence interval for the population mean μ.

34

Solution 8-4

Confidence level is 95% or .95

df = n – 1 = 25 – 1 = 24 Area in each tail = .5 – (.95/2)

= .5 - .4750 = .025 The value of t in the right tail is 2.064

40.225

12

n

ssx

35

Figure 8.8 The value of t.

df = 24

.4750

.025.025

.4750

-2.064 0 2.064 t

36

Solution 8-4

Thus, we can state with 95% confidence that the mean cholesterol level for all adult men living in Harford lies between 181.05 and 190.95.

190.95 to05.181

95.4186)40.2(064.2186

xtsx

37

Example 8-5 Twenty-five randomly selected adults who

buy books for general reading were asked how much they usually spend on books per year. The sample produced a mean of $1450 and a standard deviation of $300 for such annual expenses. Assume that such expenses for all adults who buy books for general reading have an approximate normal distribution. Determine a 99% confidence interval for the corresponding population mean.

38

Solution 8-5 Confidence level is 99% or .99 df = n – 1 = 25 – 1 = 24 Area in each tail = .5 – (.99/2)

= .5 - .4950 = .005 The values of t are 2.797 and -2.797

60$25

300

n

ssx

39

Solution 8-5

$1617.82 to18.1282$

82.167$1450$

)60(797.21450$

xtsx

The 99% confidence interval for μ is

40

INTERVAL ESTIMATION OF A POPULATION PROPORTION: LARGE SAMPLES

Estimator of the Standard Deviation of

The value of , which gives a point estimate of , is calculated as

n

qps p

ˆˆˆ

p̂ps ˆ

41

INTERVAL ESTIMATION OF A POPULATION PROPORTION: LARGE SAMPLES cont.

Confidence Interval for the Population Proportion, p

The (1 – α)100% confidence interval for the population proportion, p, is

The value of z used here is obtained from the standard normal distribution table for the given confidence level, and .

/nqps p ˆˆˆ

pzsp ˆˆ

42

Example 8-6 According to a 2002 survey by

FindLaw.com, 20% of Americans needed legal advice during the past year to resolve such thorny issues as family trusts and landlord disputes (CBS.MarketWach.com, August 6, 2002). Suppose a recent sample of 1000 adult Americans showed that 20% of them needed legal advice during the past year to resolve such family-related issues.

43

Example 8-6

a) What is the point estimate of the population proportion? What is the margin of error of this estimate?

b) Find, with a 99% confidence level, the percentage of all adult Americans who needed legal advice during the past year to resolve such family-related issues.

44

Solution 8-6 n = 1000, = .20, and, = .80

Note that and are both greater than 5.

01264911.1000

)80)(.20(.ˆˆˆ

n

qps p

qn ˆpnˆ

p̂ q̂

45

Solution 8-6

a)

Point estimate of p = = .20

Margin of error = ±1.96 = ±1.96(.01264911)

= ± .025 or ±2.5%

ps ˆ

46

Solution 8-6

b) The confidence level is 99%, or .99.

The z value for .4950 is approximately 2.58.

23.3% to16.7%or .233 to167.

.033.204911)2.58(.0126.20ˆ ˆ

pzsp

47

Example 8-7 According to the analysis of a CNN–USA TODAY–

Gallup poll conducted in October 2002, “Stress has become a common part of everyday life in the United States. The demands of work, family, and home place an increasing burden on the average American.” According to this poll, 40% of Americans included in the survey indicated that they had a limited amount of time to relax (Gallup.com, November 8, 2002). The poll was based on a randomly selected national sample of 1502 adults aged 18 and older. Construct a 95% confidence interval for the corresponding population proportion.

48

Solution 8-7

Confidence level = 95% or .95

The value of z for .95 / 2 = .4750 is 1.96.

01264069.1502

)60)(.40(.ˆˆˆ

n

qps p

49

Solution 8-7

42.5% to37.5%or .425 to375.

.025.40

4069)1.96(.0126.40ˆ ˆ

pzsp

50

DETERMINING THE SAMPLE SIZE FOR THE ESTIMATION OF THE MEAN

Given the confidence level and the standard deviation of the population, the sample size that will produce a predetermined maximum error E of the confidence interval estimate of μ is

Where E is

2

22

E

zn

nzzE x

.

51

Example 8-8

An alumni association wants to estimate the mean debt of this year’s college graduates. It is known that the population standard deviation of the debts of this year’s college graduates is $11,800. How large a sample should be selected so that the estimate with a 99% confidence level is within $800 of the population mean?

52

Solution 8-8

144918.1448

)800(

)800,11()58.2(2

22

2

22

E

zn

53

DETERMINING THE SAMPLE SIZE FOR THE ESTIMATION OF PROPORTION

Given the confidence level and the values of p and q, the sample size that will produce a predetermined maximum error E of the confidence interval estimate of p is

Where E is

2

2

E

pqzn

n

pqzzE p ˆ

54

DETERMINING THE SAMPLE SIZE FOR THE ESTIMATION OF PROPORTION cont.

In case the values of p and q are not known:1. Take the most conservative estimate of

the sample size n by using p = .5 and q = .5. For a given E, these values of p and q will give the largest sample size in comparison to any other pair of values of p = .5 and q = .5 since their product is greater than the product of any other pair.

55

DETERMINING THE SAMPLE SIZE FOR THE ESTIMATION OF PROPORTION cont.

2. Take a preliminary sample of arbitrarily determined size and calculate and from this sample. Then use them to find n.

q̂p̂

56

Example 8-9 Lombard Electronics Company has just

installed a new machine that makes a part that is used in clocks. The company wants to estimate the proportion of these parts produced by this machine that are defective. The company manager wants this estimate to be within .02 of the population proportion for a 95% confidence level. What is the most conservative estimate of the sample size that will limit the maximum error to within .02 of the population proportion?

57

Solution 8-9

The value of z for a 95% confidence level is 1.96.

p = .50 and q = .50

Thus, if the company takes a sample of 2401 parts, there is 95% chance that the estimate of p will be within .02 of the population proportion.

2401)02(.

)50)(.50(.)96.1(2

2

2

2

E

pqzn

58

Example 8-10

Consider Example 8-9 again. Suppose a preliminary sample of 200 parts produced by this machine showed that 7% of them are defective. How large a sample should the company select so that the 95% confidence interval for p is within .02 of the population proportion?

59

Solution 8-10

= .07 and = .93

62622.6250004.

)93)(.07)(.8416.3(

)02(.

)93)(.07(.)96.1(ˆˆ2

2

2

2

E

qpzn

p̂ q̂

top related