inferential statistics © 2012 project lead the way, inc.introduction to engineering design

20
Inferential Statistics © 2012 Project Lead The Way, Inc. Introduction to Engineering Design

Upload: alberta-taylor

Post on 30-Dec-2015

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Inferential Statistics

© 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Page 2: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

• Often we do not have information on the entire population of interest

• Population versus sample– Population = all members of a group– Sample = part of a population

• Inferential statistics involves estimating or forecasting an outcome based on an incomplete set of data– use sample statistics

Research and Statistics

Page 3: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Population versus Sample Standard Deviation

– Population Standard Deviation• The measure of the spread of data within a

population. • Used when you have a data value for every

member of the entire population of interest.

– Sample Standard Deviation• An estimate of the spread of data within a larger

population.• Used when you do not have a data value for every

member of the entire population of interest.• Uses a subset (sample) of the data to generalize

the results to the larger population.

Page 4: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Population Standard Deviation

SampleStandard Deviation

A Note about Standard Deviation

σ = population standard deviation

xi = individual data value ( x1, x2, x3, …)

μ = population mean

N = size of population

σ=√∑ (x i−   μ )2

Ns=√∑ (x i−   x )2

n−1

s = sample standard deviation

xi = individual data value ( x1, x2, x3, …)

= sample mean

n = size of sample

Page 5: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Sample Standard Deviation Variation

Procedure:

1. Calculate the sample mean,.

2. Subtract the mean from each value and then square each difference.

3. Sum all squared differences.

4. Divide the summation by the number of data values minus one, n - 1.

5. Calculate the square root of the result.

s=√∑ (x i−   x )2

n−1

Page 6: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Sample Mean Central Tendency

= sample mean

xi = individual data value

= summation of all data values

n = # of data values in the sample

x  = ∑ x in

Essen

tially

the

sam

e ca

lculat

ion a

s

popu

lation

mea

n

Page 7: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Sample Standard Deviation

2, 5, 48, 49, 55, 58, 59, 60, 62, 63, 63

Estimate the standard deviation for a population for which the following data is a sample.

524

111. Calculate the sample mean

2. Subtract the sample mean from each data value and square the difference.

(2 - )2 = 2082.6777 (5 - )2 = 1817.8595(48 - )2 = 0.1322(49 - )2 = 1.8595(55 - )2 = 54.2231(58 - )2 = 107.4050

(59 - )2 = 129.1322(60 - )2 = 152.8595(62 - )2 = 206.3140(63 - )2 = 236.0413(63 - )2 = 236.0413

s=√∑ (x i−x )2

n  − 1

¿ 47.63x  = ∑ x in

(x i−    x )2

Page 8: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Sample Standard Deviation Variation

= 5,024.5455

=

= = 502.4545

√∑ (x i   −   x )2

n  − 1=√502.4545  = 22.4

3. Sum all squared differences.

4. Divide the summation by the number of sample data values minus one.

5. Calculate the square root of the result.

2082.6777 + 1817.8595 + 0.1322 + 1.8595 + 54.2231 + 107.4050 + 129.1322 + 152.8595 + 206.3140 + 236.0413 + 236.0413

Page 9: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

• General Rule: Don’t round until the final answer– If you are writing intermediate results you may

round values, but keep unrounded number in memory

• Mean – round to one more decimal place than the original data

• Standard Deviation: round to one more decimal place than the original data

A Note about Rounding in Statistics

Page 10: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Population Standard Deviation

SampleStandard Deviation

A Note about Standard Deviation

σ = population standard deviation

xi = individual data value ( x1, x2, x3, …)

μ = population mean

N = size of population

σ=√∑ (x i−   μ )2

Ns=√∑ (x i−   x )2

n  − 1

s = sample standard deviation

xi = individual data value ( x1, x2, x3, …)

= sample mean

n = size of sample

As n → N, s → σ

Page 11: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Population Standard Deviation

SampleStandard Deviation

A Note about Standard Deviation

σ = population standard deviation

xi = individual data value ( x1, x2, x3, …)

μ = population mean

N = size of population

σ=√∑ (x i−   μ )2

Ns=√∑ (x i−   x )2

n  − 1

s = sample standard deviation

xi = individual data value ( x1, x2, x3, …)

= sample mean

n = size of sample

Given the ACT score of every student in your

class, use the population standard deviation formula to find the standard deviation of

ACT scoresin the class.

Page 12: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Population Standard Deviation

SampleStandard Deviation

A Note about Standard Deviation

σ = population standard deviation

xi = individual data value ( x1, x2, x3, …)

μ = population mean

N = size of population

σ=√∑ (x i−   μ )2

Ns=√∑ (x i−   x )2

n  − 1

s = sample standard deviation

xi = individual data value ( x1, x2, x3, …)

= sample mean

n = size of sample

Given the ACT scores of every student in your

class, use the sample standard deviation formula to estimate the standard

deviation of the ACT scores of all students at

your school.

Page 13: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

A distribution of all possible values of a variable with an indication of the likelihood that each will occur– A probability distribution can be represented

by a probability density function• Normal Distribution – most commonly used

probability distribution

Probability Distribution Distribution

http://en.wikipedia.org/wiki/File:Normal_Distribution_PDF.svg

Page 14: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

“Is the data distribution normal?”• Translation: Is the histogram/dot plot bell-

shaped?

Normal Distribution Distribution

• Does the greatest frequency of the data values occur at about the mean value?

• Does the curve decrease on both sides away from the mean?

• Is the curve symmetric about the mean?

Page 15: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Fre

qu

ency

Data Elements

0 1 2 3 4 5 6-1-2-3-4-5-6

Bell shaped curve

Normal Distribution Distribution

Page 16: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Fre

qu

ency

Data Elements

0 1 2 3 4 5 6-1-2-3-4-5-6

Mean Value

Normal Distribution Distribution

Does the greatest frequency of the data values occur at about the mean value?

Page 17: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Fre

qu

ency

Data Elements

0 1 2 3 4 5 6-1-2-3-4-5-6

Mean Value

Normal Distribution Distribution

Does the curve decrease on both sides away from the mean?

Page 18: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

Fre

qu

ency

Data Elements

0 1 2 3 4 5 6-1-2-3-4-5-6

Mean Value

Normal Distribution Distribution

Is the curve symmetric about the mean?

Page 19: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

What if the data is not symmetric?

Histogram Interpretation: Skewed (Non-Normal) Right

Page 20: Inferential Statistics © 2012 Project Lead The Way, Inc.Introduction to Engineering Design

What if the data is not symmetric?

A normal distribution is a reasonable assumption.