chapter 8: producing data sampling

13
CHAPTER 8: Producing Data Sampling ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation

Upload: pamela-green

Post on 02-Jan-2016

46 views

Category:

Documents


4 download

DESCRIPTION

CHAPTER 8: Producing Data Sampling. ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation. Chapter 8 Concepts. Population vs. Sample Bad Sampling method Simple Random Samples (SRS) Inference About the Population. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CHAPTER 8: Producing Data Sampling

CHAPTER 8:Producing DataSampling

ESSENTIAL STATISTICSSecond Edition

David S. Moore, William I. Notz, and Michael A. Fligner

Lecture Presentation

Page 2: CHAPTER 8: Producing Data Sampling

Chapter 8 Concepts

Population vs. Sample

Bad Sampling method

Simple Random Samples (SRS)

Inference About the Population

2

Page 3: CHAPTER 8: Producing Data Sampling

Chapter 8 Objectives

Identify the population and sample in a survey

Identify bad sampling methods

Select a simple random sample

Describe different sampling methods

3

Page 4: CHAPTER 8: Producing Data Sampling

The distinction between population and sample is basic to statistics. To make sense of any sample result, you must know what population the sample represents.

4Population and Sample

The population in a statistical study is the entire group of individuals about which we want information.

A sample is the part of the population from which we actually collect information. We use information from a sample to draw conclusions about the entire population.

The population in a statistical study is the entire group of individuals about which we want information.

A sample is the part of the population from which we actually collect information. We use information from a sample to draw conclusions about the entire population.

Population

Sample

Collect data from a representative Sample...

Make an Inference about the Population.

Page 5: CHAPTER 8: Producing Data Sampling

Population and Sample

■Researchers often want to answer questions about some large group of individuals (this group is called the population)

■ Often the researchers cannot measure (or survey) all individuals in the population, so they measure a subset of individuals that is chosen to represent the entire population (this subset is called a sample) ■ The researchers then use statistical techniques to make conclusions about the population based on the sample

5

Page 6: CHAPTER 8: Producing Data Sampling

6Bad Sampling Designs

The design of a sample is biased if it systematically favors certain outcomes.

Voluntary response samplingallowing individuals to choose to be in the sample.Voluntary response samples show bias because people with strong opinions (often in the same direction) are most likely to respond.

Voluntary response samplingallowing individuals to choose to be in the sample.Voluntary response samples show bias because people with strong opinions (often in the same direction) are most likely to respond.

Convenience Samplingselecting individuals that are easiest to reachConvenience Samplingselecting individuals that are easiest to reach

Page 7: CHAPTER 8: Producing Data Sampling

7Simple Random Samples

Random sampling, the use of chance to select a sample, is the central principle of statistical sampling.

■ Each individual in the population has the same chance of being chosen for the sample.■ Each group of individuals (in the population) with size n has the same chance of being selected to be the sample

■ Each individual in the population has the same chance of being chosen for the sample.■ Each group of individuals (in the population) with size n has the same chance of being selected to be the sample

In practice, people use random numbers generated by a computer or calculator to choose samples. If you don’t have technology handy, you can use a table of random digits.

Page 8: CHAPTER 8: Producing Data Sampling

8How to Choose a SRS

A table of random digits is a long string of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 with these properties:

◙ Each entry in the table is equally likely to be any of the 10 digits 0–9.

◙ The entries are independent of each other. That is, knowledge of one part of the table gives no information about any other part.

◙ each pair of entries is equally likely to be any of the 100 pairs 00, 01,…, 99

◙ each triple of entries is equally likely to be any of the 1000 values 000, 001, …, 999

A table of random digits is a long string of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 with these properties:

◙ Each entry in the table is equally likely to be any of the 10 digits 0–9.

◙ The entries are independent of each other. That is, knowledge of one part of the table gives no information about any other part.

◙ each pair of entries is equally likely to be any of the 100 pairs 00, 01,…, 99

◙ each triple of entries is equally likely to be any of the 1000 values 000, 001, …, 999

Step 1: Label. Give each member of the population a numerical label of the same length.

Step 2: Table. Read consecutive groups of digits of the appropriate length from Table B.

Your sample contains the individuals whose labels you find.

Step 1: Label. Give each member of the population a numerical label of the same length.

Step 2: Table. Read consecutive groups of digits of the appropriate length from Table B.

Your sample contains the individuals whose labels you find.

How to Choose an SRS Using Table BHow to Choose an SRS Using Table B

Page 9: CHAPTER 8: Producing Data Sampling

9How to Choose a SRSThe table B is just a long list of randomly chosen digits

It listed in groups of five and in numbered rows, the groups and rows have no meaning

The table B is just a long list of randomly chosen digits

It listed in groups of five and in numbered rows, the groups and rows have no meaning

Step 1: Label. Label each individual in the population of the same length in the population.

Step 2: Table. Use Table B to select labels at random.

Your sample contains the individuals whose labels you find.

Step 1: Label. Label each individual in the population of the same length in the population.

Step 2: Table. Use Table B to select labels at random.

Your sample contains the individuals whose labels you find.

How to Choose an SRS Using Table BHow to Choose an SRS Using Table B

Page 10: CHAPTER 8: Producing Data Sampling

10SRS Example

01 Aloha Kai 08 Captiva 15 Palm Tree 22 Sea Shell02 Anchor Down 09 Casa del Mar 16 Radisson 23 Silver Beach03 Banana Bay 10 Coconuts 17 Ramada 24 Sunset Beach04 Banyan Tree 11 Diplomat 18 Sandpiper 25 Tradewinds05 Beach Castle 12 Holiday Inn 19 Sea Castle 26 Tropical Breeze06 Best Western 13 Lime Tree 20 Sea Club 27 Tropical Shores07 Cabana 14 Outrigger 21 Sea Grape 28 Veranda

69051 64817 87174 09517 84534 06489 87201 9724569051 64817 87174 09517 84534 06489 87201 97245

69 05 16 48 17 87 17 40 95 17 84 53 40 64 89 87 20

Our SRS of four hotels for the editors to contact is: 05 Beach Castle, 16 Radisson, 17 Ramada, and 20 Sea Club.

Use the random digits provided to select an SRS of four hotels.

Page 11: CHAPTER 8: Producing Data Sampling

11Inference

The purpose of a sample is to give us information about a larger population.

The process of drawing conclusions about a population on the basis of sample data is called inference.

Why should we rely on random sampling?

1.To eliminate bias in selecting samples from the list of available individuals.

2.The laws of probability allow trustworthy inference about the population.

• Results from random samples come with a margin of error that sets bounds on the size of the likely error.

• Larger random samples give better information about the population than smaller samples.

Why should we rely on random sampling?

1.To eliminate bias in selecting samples from the list of available individuals.

2.The laws of probability allow trustworthy inference about the population.

• Results from random samples come with a margin of error that sets bounds on the size of the likely error.

• Larger random samples give better information about the population than smaller samples.

Page 12: CHAPTER 8: Producing Data Sampling

12Cautions About Sample Surveys

Good sampling technique includes the art of reducing all sources of error.

Undercoverage occurs when some groups in the population are left out of the process of choosing the sample.

Nonresponse occurs when an individual chosen for the sample can’t be contacted or refuses to participate.

A systematic pattern of incorrect responses in a sample survey leads to response bias.

The wording of questions is the most important influence on the answers given to a sample survey.

Undercoverage occurs when some groups in the population are left out of the process of choosing the sample.

Nonresponse occurs when an individual chosen for the sample can’t be contacted or refuses to participate.

A systematic pattern of incorrect responses in a sample survey leads to response bias.

The wording of questions is the most important influence on the answers given to a sample survey.

Page 13: CHAPTER 8: Producing Data Sampling

Chapter 8 Objectives Review

Identify the population and sample in a survey

Identify bad sampling methods

Select a simple random sample

Describe different sampling methods

Recognize cautions about sample surveys

Describe how to make an inference about the population from a sample

13