sampling and sample size determination. terms sample sample population population population element...

42
Sampling and Sample Sampling and Sample Size Determination Size Determination

Post on 20-Dec-2015

256 views

Category:

Documents


8 download

TRANSCRIPT

Page 1: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Sampling and Sample Sampling and Sample Size DeterminationSize Determination

Page 2: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

TermsTerms

SampleSample PopulationPopulation Population elementPopulation element CensusCensus

Page 3: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Why use a sample?Why use a sample?

CostCost SpeedSpeed AccuracyAccuracy Destruction of test unitsDestruction of test units

Page 4: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

StepsSteps

Definition of target populationDefinition of target population Selection of a sampling frame (list)Selection of a sampling frame (list) Probability or Nonprobability Probability or Nonprobability

samplingsampling Sampling UnitSampling Unit ErrorError

– – Random sampling error (chance Random sampling error (chance fluctuations)fluctuations)– Nonsampling error (design errors)Nonsampling error (design errors)

Page 5: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Target Population Target Population (step 1)(step 1) Who has the information/data you Who has the information/data you

need?need? How do you define your target How do you define your target

population?population?- Geography - Geography - Demographics- Demographics- Use- Use- Awareness- Awareness

Page 6: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Operational DefinitionOperational Definition

A definition that gives meaning to A definition that gives meaning to a concept by specifying the a concept by specifying the activities necessary to measure it.activities necessary to measure it.- Eg. Student, employee, user, area, Eg. Student, employee, user, area,

major news paper.major news paper.

What variables need further definition?What variables need further definition?

(Items per construct)(Items per construct)

Page 7: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Sampling Frame Sampling Frame (step 2)(step 2) List of elementsList of elements

Sampling Frame errorSampling Frame error– Error that occurs when certain Error that occurs when certain

sample elements are not listed or sample elements are not listed or available and are not represented in available and are not represented in the sampling framethe sampling frame

Page 8: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Probability or Probability or Nonprobability (step Nonprobability (step 3)3)Probability Sample: Probability Sample:

– A sampling technique in which every A sampling technique in which every member of the population will have member of the population will have a known, nonzero probability of a known, nonzero probability of being selectedbeing selected

Page 9: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Non-Probability Sample: Non-Probability Sample: – Units of the sample are chosen on Units of the sample are chosen on

the basis of personal judgment or the basis of personal judgment or convenienceconvenience

– There are NO statistical techniques There are NO statistical techniques for measuring random sampling for measuring random sampling error in a non-probability sample. error in a non-probability sample. Therefore, generalizability is Therefore, generalizability is nevernever statistically appropriate.statistically appropriate.

Page 10: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Classification of Sampling Methods

SamplingMethods

ProbabilitySamples

SimpleRandom

Cluster

Systematic Stratified

Non-probability

QuotaJudgment

Convenience Snowball

Page 11: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Probability Sampling Methods

Simple Random Sampling the purest form of probability sampling. Assures each element in the population

has an equal chance of being included in the sample

Random number generators

Probability of Selection = Sample Size

Population Size

Page 12: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages minimal knowledge of population needed External validity high; internal validity

high; statistical estimation of error Easy to analyze data

Disadvantages High cost; low frequency of use Requires sampling frame Does not use researchers’ expertise Larger risk of random error than stratified

Page 13: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Systematic Sampling An initial starting point is selected by a

random process, and then every nth number on the list is selected

n=sampling intervalThe number of population elements

between the units selected for the sample

Error: periodicity- the original list has a systematic pattern

?? Is the list of elements randomized??

Page 14: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages Moderate cost; moderate usage External validity high; internal validity

high; statistical estimation of error Simple to draw sample; easy to verify

Disadvantages Periodic ordering Requires sampling frame

Page 15: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Stratified Sampling Sub-samples are randomly drawn from

samples within different strata that are more or less equal on some characteristic

Why? Can reduce random error

More accurately reflect the population by more proportional representation

Page 16: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages minimal knowledge of population needed External validity high; internal validity

high; statistical estimation of error Easy to analyze data

Disadvantages High cost; low frequency of use Requires sampling frame Does not use researchers’ expertise Larger risk of random error than stratified

Page 17: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Systematic Sampling An initial starting point is selected by a

random process, and then every nth number on the list is selected

n=sampling intervalThe number of population elements

between the units selected for the sample

Error: periodicity- the original list has a systematic pattern

?? Is the list of elements randomized??

Page 18: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages Moderate cost; moderate usage External validity high; internal validity

high; statistical estimation of error Simple to draw sample; easy to verify

Disadvantages Periodic ordering Requires sampling frame

Page 19: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Stratified Sampling Sub-samples are randomly drawn from

samples within different strata that are more or less equal on some characteristic

Why? Can reduce random error

More accurately reflect the population by more proportional representation

Page 20: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

How?1.Identify variable(s) as an efficient basis

for stratification. Must be known to be related to dependent variable. Usually a categorical variable

2.Complete list of population elements must be obtained

3.Use randomization to take a simple random sample from each stratum

Page 21: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Types of Stratified SamplesProportional Stratified Sample:

The number of sampling units drawn from each stratum is in proportion to the relative population size of that stratum

Disproportional Stratified Sample: The number of sampling units drawn

from each stratum is allocated according to analytical considerations e.g. as variability increases sample size of stratum should increase

Page 22: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Types of Stratified Samples…Optimal allocation stratified sample:

The number of sampling units drawn from each stratum is determined on the basis of both size and variation.

Calculated statistically

Page 23: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages Assures representation of all groups in

sample population needed Characteristics of each stratum can be

estimated and comparisons made Reduces variability from systematic

Disadvantages Requires accurate information on

proportions of each stratum Stratified lists costly to prepare

Page 24: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Cluster Sampling The primary sampling unit is not the

individual element, but a large cluster of elements. Either the cluster is randomly selected or the elements within are randomly selected

Why? Frequently used when no list of population available or because of cost

Ask: is the cluster as heterogeneous as the population? Can we assume it is representative?

Page 25: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Cluster Sampling example You are asked to create a sample of all

Management students who are working in Lethbridge during the summer term

There is no such list available Using stratified sampling, compile a list of

businesses in Lethbridge to identify clusters

Individual workers within these clusters are selected to take part in study

Page 26: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Types of Cluster SamplesArea sample:

Primary sampling unit is a geographical area

Multistage area sample: Involves a combination of two or more

types of probability sampling techniques. Typically, progressively smaller geographical areas are randomly selected in a series of steps

Page 27: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages Low cost/high frequency of use Requires list of all clusters, but only of

individuals within chosen clusters Can estimate characteristics of both cluster and

population For multistage, has strengths of used methods

Disadvantages Larger error for comparable size than other

probability methods Multistage very expensive and validity depends

on other methods used

Page 28: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Classification of Sampling Methods

SamplingMethods

ProbabilitySamples

SimpleRandom

Cluster

Systematic Stratified

Non-probability

QuotaJudgment

Convenience Snowball

Page 29: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Non-Probability Sampling Methods

Convenience Sample The sampling procedure used to obtain

those units or people most conveniently available

Why: speed and cost External validity? Internal validity Is it ever justified?

Page 30: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages Very low cost Extensively used/understood No need for list of population elements

Disadvantages Variability and bias cannot be measured

or controlled Projecting data beyond sample not

justified.

Page 31: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Judgment or Purposive Sample The sampling procedure in which an

experienced research selects the sample based on some appropriate characteristic of sample members… to serve a purpose

Page 32: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages Moderate cost Commonly used/understood Sample will meet a specific objective

Disadvantages Bias! Projecting data beyond sample not

justified.

Page 33: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Quota Sample The sampling procedure that ensure that

a certain characteristic of a population sample will be represented to the exact extent that the investigator desires

Page 34: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages moderate cost Very extensively used/understood No need for list of population elements Introduces some elements of stratification

Disadvantages Variability and bias cannot be measured

or controlled (classification of subjects0 Projecting data beyond sample not

justified.

Page 35: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Snowball sampling The sampling procedure in which the

initial respondents are chosen by probability or non-probability methods, and then additional respondents are obtained by information provided by the initial respondents

Page 36: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Advantages low cost Useful in specific circumstances Useful for locating rare populations

Disadvantages Bias because sampling units not

independent Projecting data beyond sample not

justified.

Page 37: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Determining Sample Determining Sample SizeSize What data do you need to considerWhat data do you need to consider

– Variance or heterogeneity of Variance or heterogeneity of populationpopulation

– The degree of acceptable error The degree of acceptable error (confidence interval)(confidence interval)

– Confidence levelConfidence level

– Generally, we need to make Generally, we need to make judgments on all these variablesjudgments on all these variables

Page 38: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Determining Sample Determining Sample SizeSize Variance or heterogeneity of Variance or heterogeneity of

populationpopulation– Previous studies? Industry Previous studies? Industry

expectations? Pilot study?expectations? Pilot study?– Sequential samplingSequential sampling– Rule of thumb: the value of standard Rule of thumb: the value of standard

deviation is expected to be 1/6 of deviation is expected to be 1/6 of the range.the range.

Page 39: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Determining Sample Determining Sample SizeSize

Formulas:Formulas:

Means Means n = (ZS/E) n = (ZS/E) 22

ProportionsProportions n = Zn = Z22 pq/ E pq/ E22

PercentilesPercentiles n = pc (100 – pc) Zn = pc (100 – pc) Z22/ E/ E22

Z at 95% confidence = 1.96Z at 95% confidence = 1.96

Z at 99% confidence = 2.58Z at 99% confidence = 2.58

Page 40: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Sample Size (Mean)Sample Size (Mean)

Exercise 1Exercise 1 We are about to go on a recruitment drive to We are about to go on a recruitment drive to

hire some auditors at the entry level. We need hire some auditors at the entry level. We need to decide on a competitive salary offer for to decide on a competitive salary offer for these new auditors. From talking to some HR these new auditors. From talking to some HR professionals, I’ve made a rough estimate that professionals, I’ve made a rough estimate that most new hires are getting starting salaries in most new hires are getting starting salaries in the $38-42,000 range and the average (mean) the $38-42,000 range and the average (mean) is around $39,000. The standard deviation is around $39,000. The standard deviation seems to be around $3000.seems to be around $3000.

I want to be 95% confident about the average I want to be 95% confident about the average salary and I’m willing to tolerate an estimate salary and I’m willing to tolerate an estimate that is within $500 (plus or minus) of the true that is within $500 (plus or minus) of the true estimate. If we’re off, we can always adjust estimate. If we’re off, we can always adjust salaries at the end of the probation period.salaries at the end of the probation period.

What sample size should we use?What sample size should we use?

Page 41: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Sample Size Sample Size (Proportion)(Proportion)Exercise 2Exercise 2 We’ve just started a new educational TV We’ve just started a new educational TV

program that teaches viewers all about program that teaches viewers all about research methods!!research methods!!

We know from past educational TV programs We know from past educational TV programs that such a program would likely capture 2 out that such a program would likely capture 2 out of 10 viewers on a typical night.of 10 viewers on a typical night.

Let’s say we want to be 99% confident that Let’s say we want to be 99% confident that our obtained sample proportion of viewers will our obtained sample proportion of viewers will differ from the true population proportions by differ from the true population proportions by not more than 5%.not more than 5%.

What sample size do we need?What sample size do we need?

Page 42: Sampling and Sample Size Determination. Terms Sample Sample Population Population Population element Population element Census Census

Sample size Sample size (Percentage)(Percentage)Exercise 3Exercise 3 We wish to determine the required sample We wish to determine the required sample

size with 95% confidence and 5% error size with 95% confidence and 5% error tolerance that the percentage of tolerance that the percentage of Canadians preferring the federal Liberal Canadians preferring the federal Liberal party.party.

A recent poll showed that 40% of A recent poll showed that 40% of Canadians questioned preferred the Canadians questioned preferred the Liberals.Liberals.

What is the required sample size?What is the required sample size?