AP Statistics Summer Assignment 2014 Welcome to AP Statistics. This assignment will require internet access. If you do not think you can get to a computer with internet access or otherwise need help from me, please e-mail me at [email protected].
1. Why Statistics?
Write a one page paper explaining why high school students should take a Statistics class.
First, use evidence from the following sources to make your case:
“What is Statistics”, Against All Odds video #1
http://www.learner.org/resources/series65.html
TED talk with Arthur Benjamin entitled, “Teach Statistics before Calculus”,
http://tinyurl.com/nw8uyo
Clive Thompson: “Why we should learn the language of data.”
http://www.wired.com/magazine/2010/04/st_thompson_statistics/
TED talk with Hans Rosling entitled, “The Best Stats You’ve Ever Seen”,
http://www.ted.com/talks/lang/en/hans_rosling_shows_the_best_stats_you_ve_ever_seen.html
Then, write an additional paragraph explaining what you hope to gain from taking a class in
Statistics. What are your reasons for signing up for this class?
Type your write-up and submit via e-mail to [email protected] BEFORE 8/11/14.
2. Summer Reading – DUE 8/11/14 In order to accomplish this, you will choose a book from the list provided to read and report out on. You will be required to submit a typed assignment broken down as follows:
Pre-reading (one page): What does the term statistics mean to you? What impact do statistics have on your life and the world around you? What would you hope to learn from a course on Statistics?
Post-reading (between one & two pages): Write a summary of the book you chose to read. Be sure to address how statistics were involved, and how the study of statistics was defined, dealt with, discussed, etc. Return to your pre-reading assignment and discuss whether your views and ideas concerning statistics were confirmed or challenged. If any of your previous ideas have changed, please elaborate.
Post-viewing (between one & two pages): Choose the movie of the book you chose to read and compare and contrast the similarities and differences from book to film. Again be sure to address how statistics were involved, and how the study of statistics was defined, dealt with, discussed, etc.
Moneyball: The Art of Winning an Unfair Game, by M. Lewis, W. W. Norton & Company, 2003 (tells how the Oakland A’s revamped their baseball team using basic stats) ISBN-10: 0393324818 ISBN-13: 978-0393324815 OR ISBN-10: 0393338398 ISBN-13: 978-0393338393
MOVIE: Moneyball, Perf. Pitt, B., Wright, R., Hill, J. Sony Pictures, 2012 ASIN: B0060ZJ7BC
OR Bringing Down the House: How Six Students Took Vegas for Millions, by Ben Mezrich, Free Press ISBN-10: 0743249992 ISBN-10: 1417665637 ISBN-13: 978-1417665631
MOVIE: 21, Perf. Sturgess, J., Bosworth, K. Sony Pictures, 2008 ASIN: B0018CWW5K
1. Descriptive statistics:
o Mean, median, mode, variance, standard deviation, range, quartile 1, quartile 3,
minimum, and maximum
o For resources on this topic please view the videos on
http://www.khanacademy.org/math/probability/descriptive-statistics
(click on the tabs on the right for more topics)
2. Statistical Displays:
o Box-and-whisker plot, scatter plot, bar graph, histogram, pie graph, and stem and leaf
plot
o For resources on this topic please view the videos on
http://www.khanacademy.org/math/arithmetic/interpreting-data-topic/reading_data/
(click on the tabs on the left for more topics)
3. Elementary probability and logic:
o For resources on this topic please view the videos on
http://www.khanacademy.org/math/trigonometry/prob_comb/combinatorics_precalc/v/p
ermutations
o http://www.khanacademy.org/math/trigonometry/prob_comb/basic_prob_precalc/v/basi
c-probability (Click through the tabs on the left for additional topics).
4. How to use your calculator: You will need either purchase or borrow a graphing calculator that you can use for the entire year. A TI-84 Plus or TI-Nspire is preferred. The manual included with your calculator (TI Manual) is a very good resource to learn how to use the statistical functions on your calculator. You Tube is an excellent resource for tutorials on how to use your calculator. Listed below are a few good You Tube tutorials to help you get started with your calculator.
I. How to use the stat editor to enter a data list, and finding one variable statistics http://www.youtube.com/watch?v=UHpGzGGalx0
II. Entering a data list, and calculating one variable statistics http://www.youtube.com/watch?v=F9GxGW8gchw
III. How to plot a histogram. http://www.youtube.com/watch?v=_SM6JegQmig
IV. How to create a boxplot http://www.youtube.com/watch?v=A85vmvnlQ1w
V. How to create a normal probability plot http://www.youtube.com/watch?v=cZjKdm4TzDo
VI. How to create a scatterplot http://www.youtube.com/watch?v=yNuq-xX8PEI
VII. TI-Nspire interactive tutorials http://www.atomiclearning.com/ti_nspire
Please save
your points if
you purchase
a TI calculator
Upon your return to school on August 11th, I will expect you to turn in your summer reading report and this completed packet. You are expected to complete each part of the problems and to construct all data displays NEATLY. This assignment will be graded for correctness. If you have any questions, please feel free to contact me by e-mail me at [email protected] .
Categorical or Quantitative Data Stats Dictionary: http://stattrek.com/statistics/dictionary.aspx Determine if the variables listed below are quantitative or categorical. 1. Time it takes to get to school ________________________________
2. Number of people under 18 living in a household ________________________________
3. Hair color ________________________________
4. Temperature of a cup of coffee ________________________________
5. Teacher salaries ________________________________
6. Gender________________________________
7. Smoking ________________________________
8. Height ________________________________
9. Amount of oil spilled ________________________________
10. Age of Oscar winners________________________________
11. Type of blood pressure medication ________________________________
12. Jellybean flavors ________________________________
13. Country of origin ________________________________
14. Type of meat ________________________________
15. Number of shoes owned ________________________________
Statistic – what is that?
A statistic is a number calculated from data. Quantitative data has many different statistics that can
be calculated. Determine the given statistics from the data below on the number of home runs Mark
McGuire has hit in each season from 1982 – 2001.
70 52 22 49 3 32 58 39
39 65 42 29 9 32 9 33
Mean
Minimum
Maximum
Q1
Q3
Range
IQR
Create a box and whisker plot for your given data. Be sure to give your boxplot a title and
label your scale.
Example:
Accidental Deaths
In 1997 there were 92,353 deaths from accidents in the United States. Among these were
42,340 deaths from motor vehicle accidents, 11,858 from falls, 10,163 from poisoning, 4051
from drowning, and 3601 from fires. The rest were listed as other causes.
a. Find the percent of accidental deaths from each of the causes, rounded to the nearest
percent.
b. What percent of accidental deaths were from “other” causes?
c. NEATLY create a well labeled bar graph of the distribution of causes of accidental
deaths. Be sure to include an “other” causes bar. Make sure you include appropriate
labels, title and scale.
d. A pie chart is another graphical display used to show all the categories in a
categorical variable relative to each other. Create a pie chart for the accidental death
percentages. You may try using a software or internet source to make one and paste
it in the space below. (Microsoft Excel works well).
It’s a Twista
The data below give the number of hurricanes that happened each year from 1944 through
2000 as reported by Science magazine.
3 2 1 4 3 7 2 3 3 2
5 2 2 4 2 2 6 0 2 5
1 3 1 0 3 2 1 0 1 2
3 2 1 2 2 2 3 1 1 1
3 0 1 3 2 1 2 1 1 0
5 6 1 3 5 3
Make a dot plot to display these data. Make sure you include appropriate labels, title and
scale.
Example
Shopping Spree!
A marketing consultant observed 50 consecutive shoppers at a supermarket. One variable
of interest was how much each shopper spent in the store. Below are the data (rounded to
the nearest dollar), arranged in increasing order.
3 9 9 11 13 14 15 16 17 17
18 18 19 20 20 20 21 22 23 24
25 25 26 26 28 28 28 28 32 35
36 39 39 41 43 44 45 45 47 49
50 53 55 59 61 70 83 86 86 93
Create a stemplot using tens of dollars as the stem and dollars as the leaves.
Be sure you include appropriate labels, title and key.
Example:
Where do older folks live?
This table gives the percentages of residents aged 65 or older in each of the 50 states.
State Percent State Percent State Percent
Alabama
Alaska
Arizona
Arkansas
California
Colorado
Connecticut
Delaware
Florida
Georgia
Hawaii
Idaho
Illinois
Indiana
Iowa
Kansas
Kentucky
13.1
5.5
13.2
14.3
11.1
10.1
14.3
13.0
18.3
9.9
13.3
11.3
12.4
12.5
15.1
13.5
12.5
Louisiana
Maine
Maryland
Massachusetts
Michigan
Minnesota
Mississippi
Missouri
Montana
Nebraska
Nevada
New Hampshire
New Jersey
New Mexico
New York
North Carolina
North Dakota
11.5
14.1
11.5
14.0
12.5
12.3
12.2
13.7
13.3
13.8
11.5
12.0
13.6
11.4
13.3
12.5
14.4
Ohio
Oklahoma
Oregon
Pennsylvania
Rhode Island
South Carolina
South Dakota
Tennessee
Texas
Utah
Vermont
Virginia
Washington
West Virginia
Wisconsin
Wyoming
13.4
13.4
13.2
15.9
15.6
12.2
14.3
12.5
10.1
8.8
12.3
11.3
11.5
15.2
13.2
11.5
Histograms are a way to display groups of quantitative data into bins (the bars). These bins
have the same width and scale and are touching because the number line is continuous. To
make a histogram you must first decide on an appropriate bin width and count how many
observations are in each bin.
The bins for percentage of residents aged 65 or older have been started below for you.
Finish the chart of bin widths and
then create a histogram using those
bins on the grid below. Make sure
you include appropriate
labels, title and scale.
Example:
SSHA Scores
Here are the scores on the Survey of Study Habits and Attitudes (SSHA) for 18 first year college women:
154 109 137 115 152 140 154 178 101
103 126 126 137 165 165 129 200 148
and for 20 first year college men:
108 140 114 91 180 115 126 92 169 146
109 132 75 88 113 151 115 70 187 104
Put the data values in order for each gender. Compute numeral summaries for each gender.
Women Men
Mean Mean
Minimum Minimum
Q1 Q1
Median Median
Q3 Q3
Maximum Maximum
Range Range
IQR IQR
Using the minimum, Q1, Median, Q3, and Maximum from each gender, make parallel boxplots to compare
the distributions. Include your title, labels and scale.
Algebra The prerequisite for AP Statistics is Advanced Algebra. You will not find very much equation solving in this course, but some quick review of Advanced Algebra content will be helpful. Previously you learned about methods of sampling design. For more review, refer to www.stattrek.com “Survey Sampling Methods”. The 7 types of sampling designs are: A. voluntary response B. convenience C. simple random D. stratified E. cluster F. multistage G. systematic The Maryland division of Weight Watchers is doing research to determine how many people on the Weight Watchers diet cheat at least once a week. They decide that anonymous surveys will give them an accurate representation but do not have time to get responses from ALL the Maryland Weight Watchers people. Read the scenarios below and determine which of the 7 sampling methods best describes it. ____ Randomly select 10 members from each of the WW centers in the Maryland division. ____ Use an alphabetical listing of all Maryland division members. Randomly choose a starting
person on the list. Then select every 20th person thereafter. ____ Randomly select 2 or 3 branches of the Maryland division and survey every member of that
center. ____ Send out the survey to every member of the Maryland division. Place drop boxes in each WW
center. Anyone who returns a survey will be in the sample. ____ The Maryland regional office is in Baltimore so they survey members at the WW center in Baltimore. ____ From a numbered list of all Maryland division members use a computer to randomly select 100 numbers and survey all members with those corresponding numbers. What is the population of interest in the above weight watchers scenario?
Here is a formula that is often used in AP Statistics.
If z = 2.5, x = 102 and µ = 100, what is ? Show your work.
If z = - 3.35, x = 60 and = 4, what is µ? Show your work.
Linear Functions 1. The USDA reported that in 1990 each person in the United States consumed an average
of 133 pounds of natural sweeteners. They also claim this amount has decreased by about 0.6 pounds each year.
a. Write a linear equation that relates years since 1990 to the average consumption of natural sweeteners. Define your variables. b. What is the slope and what is the y-intercept?
c. Predict the average consumption of sweeteners per person for the year 2005.
2. The following equation can be used to predict the average height of boys anywhere
between birth and 15 years old: y = 2.79x + 25.64, where x is the age (in years) and y is the height (in inches).
a. What does the slope represent in this problem? Interpret it in context. b. What does the y-intercept represent in this problem? Interpret it in context.
Probability
1. A special lottery is to be held to select the student who will live in the only deluxe room in a dormitory. There are 100 seniors, 150 juniors, and 200 sophomores who applied. Each senior's name is placed in the lottery 3 times; each junior's name, 2 times; and each sophomore's name, 1 time. What is the probability that a senior's name will be chosen?
2. If a coin is tossed twice, what is the probability that on the first toss the coin lands heads and on the second toss the coin lands tails?
3. If a coin is tossed twice what is the probability that it will land either heads both times or tails both times?
4. Calculate the following probabilities and arrange them in order from least to greatest.
The probability that a fair die will produce an even number. ___________
A random digit from 1 to 9 (inclusive) is chosen, with all digits being equally likely. The probability that when it is squared it will end with the digit 1. ___________
The probability that a letter chosen from the alphabet will be a vowel. ___________
A random number between 1 and 20 (inclusive) is chosen.
The probability that its square root will not be an integer. ___________
ORDER: ____________, ______________, ______________, ______________
5. Which of the following has a probability closest to 0.5?
A. The sun will rise tomorrow.
B. It will rain tomorrow.
C. You will see a dog with only three legs when you leave the room.
D. A fair die will come up with a score of 6 four times in a row.
E. There will be a plane crash somewhere in the world within the next five minutes.
ADDITIONAL SUMMER READING – EXTRA CREDIT
There are many good books that deal with deep statistical ideas without getting too technical. Read and type up a one page reflection for extra credit. Your reflection should include a brief summary of the book and a description of how the concepts in the book relate to Statistics class and everyday life.
200% of Nothing, A. K. Dewdney, John Wiley and Sons, 1993.
A Mathematician Reads the Newspaper, J. A. Paulos, Basic Books, 1995.
Aging With Grace: What the Nun Study Teaches Us About Leading Longer, Healthier, and More Meaningful Lives, D. Snowden, Bantom Press, 2002
Beyond Numeracy, J. A. Paulos, Vintage, 1992.
The Black Swan: The Impact of the Highly Improbable, N. Taleb, Thomson, 2007
Calculated Risks: How to Know When Numbers Deceive You, G. Gigerenzer, Simon & Schuster, 2002
The Cartoon Guide to Statistics. L. Gonick & W. Smith. Harper, 1993
Damned Lies and Statistics: Untangling Numbers from the Media, Politicians, and Activists, J. Best, University of California Press, 2001. The Drunkard's Walk: How Randomness Rules Our Lives by Leonard Mlodinow, Pantheon Books, 2008
Envisioning Information. Edward R. Tufte. Graphics Press, 1990
Fooled by Randomness: The Hidden Role of Chance in Life and in the Markets, N. Taleb, Thomson, 2004 Freakonomics A Rogue Economist Explores the Hidden Side of Everything, S. D. Levitt & S. J. Dubner, Harper Collins, 2005. The Ghost Map: The Story of London's Most Terrifying Epidemic—and How It Changed Science, Cities, and the Modern World, S. Johnson
How to Lie with Statistics, D. Huff, W. W. Norton, 1952.
How to Think About Statistics, 6th Edition. J. L. Phillips, Jr., W. H. Freeman, 2000
Innumeracy: Mathematical Illiteracy and Its Consequences, J. A. Paulos, Vintage, 1990.
The Lady Tasting Tea, D. Salsburg, Owl Books, 2001.
Microtrends: The Small Forces Behind Tomorrow's Big Changes, Mark J. Penn, 2007
More Damned Lies and Statistics: How Numbers Confuse Public Issues, J. Best, University of California Press, 2004 News & Numbers: A Guide to Reporting Statistical Claims and Controversies in Health and Other Fields, V. Cohn.
Randomness, D. Bennett, Harvard University Press, 1998
State of Fear, M. Crichton, Avon Books, 2004.
Tainted Truth: The Manipulation of Fact In America, C. Crossen
The Tipping Point: How Little Things Can Make a Big Difference, M. Gladwell, Little, Brown & Co., 2002
The Visual Display of Quantitative Information. Edward R. Tufte. Graphics Press, 2001