statistics as a tool a set of tools for collecting, organizing, presenting and analyzing numerical...

Statistics as a Tool

A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Similar Concepts Across Cultures

Http://www.uptodate.com

Descriptive Statistics

Numerical facts or observations that are organized describe the frequencies, measures of central tendency, and degree of dispersion of variables in a sample of a larger population.

Environmental Studies: Air/Water

Levels of Measurement

Reflects type of information measured and helps determine what descriptive statistics and which statistical test can be used.

Four Levels of Measurement

NOIR -- no one is ready

Nominal – lowest level, categories, no rank

Ordinal – second lowest, ranked categories

Interval – next to highest, ranked categories with known units between rankings

Ratio – highest level, ranked categories with known intervals and an absolute zero

Descriptives for nominal and ordinal data

Frequencies and percentages Frequencies – absolute number of cases Percentages – relative number of cases

Descriptives for interval/ratio (scale) variables

Measures of central tendency– Mean -- sum of all cases divided by number of

cases– Median – case for which half of all other cases are

above and half of all other cases are below.– Mode – most frequently occurring case

Descriptives for scale variables

Measures of dispersion– Range – Value of cases from minimum to maximum– Standard Deviation – number which when added or

taken away from each case adds up to zero.– Variance – Standard deviation squared

Inferential statistics

Procedures used to make inferences from sample data and generalize findings to the population

Probability

Statistical significance – the probability that the difference or the association found in the sample would be present in the population.

Three common probabilities used <.05 <.01 <.001

Sampling bias

The systematic differences between sample in study and the larger population of interest.

The use of inferential statistics allows us to calculate the odds that what is found in the sample is due to sampling bias.

Statistical significance (p-levels)

When p < .05, the degree of difference or association being tested would only occur by chance alone five times out of a hundred.

When p < .01, the difference or association being observed would only occur by chance alone one time out of a hundred.

When p < .001……

Testing for statistically significant differences

When you want to see if there is a difference in outcome by group membership, or by treatment approach.

SPSS– Analyze

Compare means– Independent t-test

Planned Parenthood

Earth is in need of another one/third earth addition: 8 billion

Is there a significant difference in months of service and type of outcome?

Group Statistics

23 11.2174 5.11643 1.06685

14 10.3571 4.73298 1.26494

Was outcome positive?yes

no

MONTHSN Mean Std. Deviation

Std. ErrorMean

Independent Samples Test

.123 .728 .510 35 .613 .8602 1.68725 -2.56506 4.28556

.520 29.309 .607 .8602 1.65476 -2.52258 4.24307

Equal variancesassumed

Equal variancesnot assumed

MONTHSF Sig.

Levene's Test forEquality of Variances

t df Sig. (2-tailed)Mean

DifferenceStd. ErrorDifference Lower Upper

95% ConfidenceInterval of the

Difference

t-test for Equality of Means

The Need for Pure Water

What cvan be done? Ans: wells/solar heat

Emerging

Common Vectors

39

• Mosquitoes serve as vectors for Malaria, Dengue fever, Yellow fever, and Chikungunya

• Ticks can serve as vectors for Lyme disease, Rickettsia, and Babesiosis

Statistically significant differences i.v. nominal and d.v. interval/ratio

Analyze – Univariate (One d.v.; multiple predictors)– Multivariate (Multiple d.v.; multiple predictors)– Repeated measures (time series of dependent

measures; one predictor.

Statistically significant associations at higher levels of measurement

Analyze– Correlate

Bi-variate– Pearson’s (two interval/ratio variables)– Kendall’s tau (two ordinal variables)– Spearman’s rho (two ordinal variables)

Test of Pearson Correlation Coefficient (r)

Correlations

1 .069

. .685

37 37

.069 1

.685 .

37 37

Pearson Correlation

Sig. (2-tailed)

N

Pearson Correlation

Sig. (2-tailed)

N

MONTHS

Percent of positive casesfor each referral reasons

MONTHS

Percent of positive

cases foreach referral

reasons

Person-Doctor==Public Health Group===Population==Mamny

Independent t-test to determine statistical significance

Independent Samples Test

.464 .509 -13.516 12 .000 -13.0000 .96186 -15.09571 -10.90429

-13.516 10.955 .000 -13.0000 .96186 -15.11809 -10.88191

Equal variancesassumed

Equal variancesnot assumed

MONTHSF Sig.

Levene's Test forEquality of Variances

t df Sig. (2-tailed)Mean

DifferenceStd. ErrorDifference Lower Upper

95% ConfidenceInterval of the

Difference

t-test for Equality of Means

Differences between groups at lower levels of measurement

Analyze– Descriptives…– Crosstabs

– Identify variable in row and column– Select statistics

Two nominal (dichotomized) Chi-Square Nominal by ordinal Kendal’s tau-b Nominal by interval Eta

Difference in LOS by referral

Group Statistics

7 5.2857 2.05866 .77810

7 18.2857 1.49603 .56544

Reason for referralmental illness

sexual abuse

MONTHSN Mean Std. Deviation

Std. ErrorMean

Crosstabs to determine difference between groups

Reason for referral * Was outcome positive? Crosstabulation

6 1 7

85.7% 14.3% 100.0%

5 2 7

71.4% 28.6% 100.0%

5 3 8

62.5% 37.5% 100.0%

3 3 6

50.0% 50.0% 100.0%

4 5 9

44.4% 55.6% 100.0%

23 14 37

62.2% 37.8% 100.0%

Count

% within Reasonfor referral

Count


Count


Count


Count


Count


mental illness

sexual abuse

physical abuse

neglect

parentreturn

Reasonfor referral

Total

yes no

Was outcomepositive?

Total

Chi-Square tests

Chi-Square Tests

3.485a 4 .480

3.696 4 .449

3.334 1 .068

37

Pearson Chi-Square

Likelihood Ratio

Linear-by-LinearAssociation

N of Valid Cases

Value dfAsymp. Sig.

(2-sided)

9 cells (90.0%) have expected count less than 5. Theminimum expected count is 2.27.

a.

Which test to use when?

Decision is made by what the question is, the level of measurement of the variable and the extent to which assumptions of parametric statistics are met.

Question: Difference or Association? Level of measurement: NOIR? Sample size and distribution (normal?)

Tests comparing difference between 2 or more groups

Test Dependent variable

Independent variable

Paired (dependent) t-test

Interval/ratio pre and post tests

Nominal

Unpaired (independent t-test

Interval/ratio Nominal (2 grps)

ANOVA F-test Interval/ratio Nominal (>2 grps)

Chi-Square

(Nonparametric)

Nominal (Dichotomous)

Nominal

Tests demonstrating association between two groups

Test Dependent var. Independent var.

Spearman rho Ordinal Ordinal

Mann-Whitney U

Non-parametric

Ordinal Nominal

Pearson’s r Interval/ratio Interval/ratio

Tests demonstrating association between two groups, controlling for third variable

Test Dependent Independent

Logistic regression

Nominal Nominal

Linear regression Interval/ratio Interval/ratio

Pearson partial r Interval/ratio Interval/ratio

Kendall’s partial r Ordinal Ordinal

Mission,Manpower,Math,Maintain

How do you handle malnutrition?

Many healthcare task forces

How do you handle AIDS?

Different Theories About HIV/AIDS

Is Health a Right or is Health a Commodity?

USA Health Care Business Barriers

Some countries will lose

What can be done?

statistics as a tool a set of tools for collecting, organizing, presenting and analyzing numerical...

Documents