chapter 2 graphical summaries of data. definitions a frequency distribution is a table that...

Post on 30-Dec-2015

241 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

CHAPTER 2

GRAPHICAL SUMMARIES OF DATA

DEFINITIONS

A frequency distribution is a table that represents the frequency for each category.

The relative frequency of a category is the frequency of the category divided by the sum of all the frequencies.

Difference between frequency and relative frequency:

• The frequency of a category is the number of items in the category.

• The relative frequency of a category is the proportion of items in the category.

FREQUENCY DISTRIBUTION FOR QUALITATIVE DATA

Type of Computer Frequency Relative Frequency

Desktop 11 0.22

Laptop 23 0.46

Notebook 9 0.18

Tablet 7 0.14

BAR GRAPH

A bar graph is a graphical representation of a frequency distribution. A bar graph consists of rectangles of equal width, with one rectangle for each category. The heights of the rectangles represent the frequencies or relative frequencies of the categories.

FREQUENCY BAR GRAPH

Laptop Desktop Notebook Tablet0

0.2

0.4

0.6

0.8

1

1.2

Types of Computers Sold

#REF!

RELATIVE FREQUENCY BAR GRAPH

Laptop Desktop Notebook Tablet0

0.050.1

0.150.2

0.250.3

0.350.4

0.450.5

Relative Frequency

Relative Frequency

PARETO CHART

Sometimes it is desirable to construct a bar graph in which the categories are presented in order of frequency or relative frequency, with the largest frequency or relative frequency on the left and the smallest one on the right Such a graph is called a Pareto Chart.

Laptop Desktop Notebook Tablet0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

Relative Frequency

Relative Frequency

SIDE-BY-SIDE BAR GRAPHS

Sometimes we want to compare two bar graphs that have the same categories. The best way to do this is to construct both bar graphs on the same axes, putting bars that correspond to the same category next to each other. The result is called a side-by-side bar graph.

PIE CHARTS

A pie chart is an alternative to the bar graph for displaying relative frequency information. A pie chart is a circle. The circle is divided into sections, one for each category.

Relative Frequency

Laptop

Desktop

Notebook

Tablet

FREQUENCY DISTRIBUTIONS FOR QUANTITATIVE DATA

Classes Frequency

0.00 – 0.99 9

1.00 – 1.99 26

2.00 – 2.99 11

3.00 – 3.99 13

4.00 – 4.99 3

5.00 – 5.99 1

6.00 – 6.99 2

Classes Frequency Relative Frequency

0.00 – 0.99 9 0.138

1.00 – 1.99 26 0.400

2.00 – 2.99 11 0.169

3.00 – 3.99 13 0.200

4.00 – 4.99 3 0.046

5.00 – 5.99 1 0.015

6.00 – 6.99 2 0.031

DEFINITIONS• The lower class limit (LCL) of a class is the smallest value

that can appear in that class.

• The upper class limit (UCL) of a class is the largest value that can appear in that class.

• The class width is the difference between consecutive lower class limits.

REQUIREMENTS FOR CHOOSING CLASSES• Every observation must fall into one of the classes.

• The classes must not overlap (they must be mutually exclusive).

• The classes must be of equal width.

• There must be no gaps between classes. Even if there are no observations within a class, that class must still appear on the frequency distribution.

HOW TO CONSTRUCT A FREQUENCY DISTRIBUTION

1. Decide how many classes are needed.

2. Compute the class width.

• Choose a starting point – either the minimum data value or some convenient number slightly smaller.

• Computer the lower class limits of the remaining classes by adding the class width to the previous LCL.

• Determine the upper class limits by looking at data and remembering that the classes need to be mutually exclusive.

• Make sure you have included the largest and the smallest observations in your classes.

• Tally the observations into each class.

Choosing the number of classes:

• There is no single right way to choose the number of classes.• Too many classes will produce a frequency distribution and

histogram that has too much detail.• Too few classes will produce a frequency distribution and

histogram that does not have enough detail.• For most data, the number of classes should be between 5

and 20.

GRAPHS• Histograms are related to bar graphs and are appropriate

for quantitative data.

OTHER GRAPHS• Stem-and-leaf plots are a simple way to display small data

sets.

• Dot plots

• Time-series plots

STEMPLOT (OR STEM-AND-LEAF PLOT)

Represents data by separating each value into two parts: the stem (such as the leftmost digit) and the leaf (such as the rightmost digit)

HOW TO LIE WITH STATISTICS• Check the vertical scale

• Pictographs – using pictures to compare amounts

• Three-dimensional graphs and perspective

MISUSE # 1- BAD SAMPLESVoluntary response sample (or self-selected sample)

one in which the respondents themselves decide whether to be included

In this case, valid conclusions can be made only about the specific group of people who agree to participate.

MISUSE # 2- SMALL SAMPLES

Conclusions should not be based on samples that are far too small.

Example: Basing a school suspension rate on a sample of only three students

To correctly interpret a graph, you must analyze the numerical information given in the graph, so as not to be misled by the graph’s shape.

MISUSE # 3- GRAPHS

Part (b) is designed to exaggerate the difference by increasing each dimension in proportion to the actual amounts of oil consumption.

MISUSE # 4- PICTOGRAPHS

MISUSE # 5- PERCENTAGES

Misleading or unclear percentages are sometimes used. For example, if you take 100% of a quantity, you take it all. 110% of an effort does not make sense.

Loaded Questions

Order of Questions

Refusals

Correlation & Causality

Self Interest Study

Precise Numbers

Partial Pictures

Deliberate Distortions

Other Misuses of Statistics

top related