AP Test  Statistics
Topics Covered
The AP Statistics course imitates a one semester long noncalculus based college statistics class. The emphasis is placed on conceptual understanding and interpretation rather than complicated arithmetic computations. There are four basic themes. Exploring data analysis (describing patterns and departures from patterns) covers 2030% of the exam. Sampling and experimentation (planning and conducting a study) covers 1015% of the exam. Probability and random variables (producing models using probability and anticipating patterns) covers 2030% of the exam. The last theme, statistical inference (estimating population parameters and testing hypotheses), covers 3040% of the exam. See reference below for a description of content covered in each theme.
Section 
Type 
# of Qs 
% of Final Grade 
Time Limit 

1 
Multiple Choice 
40 
50% 
90 minutes 
2 
Free Response 
6 
0.75*50% 
90 minutes 
Investigative Task 
1* 
0.25*50% 
30 minutes** 
*The investigative task is the 6th freeresponse question, not a separate entity.
**The 30 minutes is a portion of the total time given for Section 2.
The AP Statistics exam is 3 hours long and contains 2 sections – multiple choice questions and free response questions. Each section of the exam is worth 50% of the final exam grade. Section 1 is 90 minutes long and contains 40 multiple choice questions. Section 2 is 90 minutes long as well and contains 6 free response questions, one of which is the investigative task. The first five questions are worth 75% of the grade for section 2. The sixth question is worth 25% of the grade for section 2, so students should allocate more time to it.
Graphing calculators are allowed. The calculator memory will not be cleared, although it can only contain programs, not notes. A formula sheet is provided, seen below.
For more information, refer to the long official handout: http://apcentral.collegeboard.com/apc/public/repository/apstatisticscoursedescription.pdf
In section 1, one point is added for a correct answer. One quarter of a point is deducted for an incorrect answer. The student’s raw score is multiplied by 1.25 to get a maximum of 50 points. In section 2, each of the 6 free response question is scored on a scale from 0 to 4. The questions are scored holistically, so a student’s answer does not have to be perfect. The raw score for questions 15 is multiplied by 1.875 and the raw score for question 6 is multiplied by 3.125. Sum everything together to get the composite score.
Composite Scoring Range 
AP Grade 

60100 
5 
4559 
4 
3244 
3 
2331 
2 
022 
1 
The grade distribution has not changed in recent years. This table shows the percentage of students who received a 1, 2, 3, 4, or 5 over the past 4 years.
Year 
2010 
2011 
2012 
2013


5 
0.128 
0.121 
0.126 
0.122 
4 
0.224 
0.213 
0.202 
0.209 
3 
0.235 
0.251 
0.25 
0.257 
2 
0.182 
0.177 
0.188 
0.181 
1 
0.231 
0.238 
0.234 
0.231 
Mean Grade 
2.84 
2.8 
2.81 
2 
 Youtube Channel ProfRobBob is very helpful in discussing the topics taught in AP Statistics. Link to the playlist: http://www.youtube.com/watch?v=CUuWMwJ1Juw&list=PLC8478000586FA6F9
 Learning through handson projects is extremely helpful.
 The questions are not presented in any order of difficulty, so use your personal preference to differentiate the easy questions and do them first.
 You have about two minutes for each multiplechoice question, 1213 minutes for Questions 15 of the free response section, and 2530 minutes for the investigative task. Avoid spending too much time on any one question.
 Note that the investigative task (Question #6) may contain material you’ve never studied, since the goal of such a question is to see how well you reason statistically.
 Mark down the date of the exam (May 9, 2014).
Link: http://www.education.com/studyhelp/article/tipsexam1/?page=3
 When asked to describe a onevariable data set, always discuss shape, center, and spread.
 Understand how skewness can be used to differentiate between the mean and the median.
 Know how transformations of a data set affect summary statistics (mean, median, mode, interquartile range, standard deviation, skewness, etc).
 “Normal” refers to a specific distribution. Instead of writing “normal,” use “approximately normal” and “bellshaped” instead if you were not given the specific distribution.
 Correlation is not causation. A lack of correlation does not mean that there is no relationship (it might be linear).
 Use residual plot to determine if a linear model is appropriate.
 Interpret the slope and yintercept of a leastsquares regression line
 Read computer regression output.
 Know the definition of a simple random sample (SRS).
 An experiment that uses blocking cannot be a completely randomized design.
 Know the differences between why one uses randomization vs blocking.
 Know what blinding and confounding variables are.
 Know how to create a simulation for a probability problem.
 Know to differentiate between independent events and mutually exclusive events. Know why mutually exclusive events can’t be independent (look at the definitions).
 Find the mean and standard deviation of a discrete random variable.
 Recognize binomial and geometric situations.
 Hypotheses are about parameters, never about statistics.
 Know the four steps of any inference procedure.
 In inference problems, show, not declare, that the conditions necessary to do the procedure are present.
 Know Type I and Type II errors and the power of a test.
 For confidence interval questions, you need three things: justify that the conditions necessary to construct the interval are present, construct the interval, and interpret the interval in context.
 Label your graphs.
SECTION 
TOPIC AREA/ 
TOPICS 

I. 
Exploring Data 
A. Graphics display of distributions of one variable data (dot plot, stem plot, histogram, ogive). 
II. 
Sampling and 
A. Methods of data collection (census, survey, Experiment, observational study). 
III. 
Anticipating Patterns 
A. Probability (relative frequency, law of large numbers, addition and multiplication rules, conditional probability, independence, random variables, simulation, mean and standard deviation of a random variable). 
IV. 
Statistical Inference 
A. Estimation (population parameters, margin of error, point estimators, confidence interval for a proportion, confidence interval for the difference between two proportions, confidence interval for a mean, confidence interval for the difference between two means, confidence interval for the slope of a leastsquares regression line). 
 In the scatterplot of y versus x shown above, the least squares regression line is superimposed on the plot. Which of the following points has the largest residual?
 A
 B
 C
 D
 E
 A
 candy company claims that 10 percent of its candies are blue. A random sample of 200 of these candies is taken, and 16 are found to be blue. Which of the following tests would be most appropriate for establishing whether the candy company needs to change its claim?
 Matched pairs ttest
 One Sample proportion ztest
 Twosample ttest
 Twosample Proportion ztest
 Chisquare test of association
 Matched pairs ttest
 In a test of H0: µ=8, a sample of size 220 leads to a pvalue of 0.034. Which of the following must be true?
 A 95% confidence interval for µ calculated from these data will not include µ=8
 At the 5% level if H0 is rejected, the probability of a Type II error is 0.034
 The 95% confidence interval for µ calculated from these data will be centered at µ=8
 The null hypothesis should be rejected at the 5% level
 The sample size is insufficient to draw a conclusion with 95% confidence interval
 A 95% confidence interval for µ calculated from these data will not include µ=8
 Courtney has constructed a cricket out of paper and rubber bands. According to the insturctions for making the cricket, when it jumps it will land on its feet half of the time and on its back the other half of the time. In the first 50 jumps, Courtney’s cricket landed on its feet 35 times. In the next 10 jumps, it landed on its feet only twice. Based on this experience, Courtney can conclude that
 The cricket was due to land on its feet less than half the time during the final 10 jumps, since it had landed too often on its feet during the first 50 jumps.
 A confidence interval for estimating the cricket’s true probability of landing on its feet is wider after the final 10 jumps than it was before the final 10 jumps
 A confidence interval for estimating the cricket’s true probability of landing on its feet after the final 10 jumps is exactly the same as it was before the final 10 jumps
 A confidence interval for estimating the cricket’s true probability of landing on its feet is more narrow after the final 10 jumps than it was before the final 10 jumps
 A confidence interval for estimating the cricket’s true probability of landing on its feet based on the initial 50 jumps does not include 0.2, so there must be a defect in the cricket’s construction to account for the poor showing in the final 10 jumps.
 The cricket was due to land on its feet less than half the time during the final 10 jumps, since it had landed too often on its feet during the first 50 jumps.
 Link to Free Response Questions Tests 19982013: http://apcentral.collegeboard.com/apc/members/exam/exam_information/8357.html
Each full carton of Grade A eggs consists of 1 randomly selected empty cardboard container and 12 randomly selected eggs. The weights of such full cartons are approximately normally distributed with a mean of 840 grams and a standard deviation of 7.9 grams.
 What is the probability that a randomly selected full carton of Grade A eggs will weigh more than 850 grams?
 What is the probability that a randomly selected full carton of Grade A eggs will weigh more than 850 grams?
 The weights of the empty cardboard containers have a mean of 20 grams and a standard deviation of
1.7 grams. It is reasonable to assume independence between the weights of the empty cardboard containers and the weights of the eggs. It is also reasonable to assume independence among the weights of the 12 eggs that are randomly selected for a full carton .
Let the random variable X be the weight of a single randomly selected Grade A egg.
 What is the mean of X?
 What is the mean of X?
 What is the standard deviation of X?
 Tropical storms in the Pacific Ocean with sustained winds that exceed 74 miles per hour are called typhoons. Graph A below displays the number of recorded typhoons in two regions of the Pacific Ocean—the Eastern Pacific and the Western Pacific—for the years from 1997 to 2010.
 Compare the distributions of yearly frequencies of typhoons for the two regions of the Pacific Ocean for the years from 1997 to 2010
 For each region, describe how the yearly frequencies changed over the time period from 1997 to 2010.
 A moving average for data collected at regular time increments is the average of data values for two or more consecutive increments. The 4year moving averages for the typhoon data are provided in the table below. For example, the Eastern Pacific 4year moving average for 2000 is the average of 22, 16, 15, and 21, which is equal to 18.50. Show how to calculate the 4year moving average for the year 2010 in the Western Pacific. Write your value in the appropriate place in the table.
YearNumber of
Typhoons in the
Eastern PacificEastern Pacifi
4year moving averageNumber of
Typhoons in the
Western PacificWestern Pacific
4year moving
average19972233199816271999153620002118.503733.2520011917.753734.2520021918.503937.2520031719.003035.7520041718.003435.0020051717.502632.2520062519.003431.0020071919.502830.5020082020.252728.7520092321.752829.2520101820.0018
 Graph B below shows both yearly frequencies (connected by dashed lines) and the respective 4year moving averages (connected by solid lines). Use your answer in part c to complete the graph.
 Consider graph B
 What information is more apparent from the plots of the 4year moving averages than from the plots of the yearly frequencies of typhoons?
 What information is more apparent from the plots of the 4year moving averages than from the plots of the yearly frequencies of typhoons?
 hat information is less apparent from the plots of the 4year moving averages than from the plots of the yearly frequencies of typhoons?
 Compare the distributions of yearly frequencies of typhoons for the two regions of the Pacific Ocean for the years from 1997 to 2010
Sample Questions (Multiple Choice)
1. a. A
2. b. One Sample proportion ztest
3. a. A 95% confidence interval for µ calculated from these data will not include µ=8
4. d. A confidence interval for estimating the cricket’s true probability of landing on its feet is more narrow after the final 10 jumps than it was before the final 10 jumps
Sample Questions (Free Response)
a
Let W denote the weight of a randomly selected full carton of eggs. W~N(840, 7.92).
The zscore for a weight of 850 grams is z = (850840)/(7.9) = 1.27
The ztable shows that P[W>850] = P[Z>1.27] = 1P[Z<1.27] = 10.8980 = 0.1020
b
Let W represent the weight of a randomly selected full carton of eggs, P the weight of the packaging, and Xi the weight of the ith egg, for I = 1, 2,…, 12.
Note that W = P + X1 + X2 +…+ X12
E(W) = E(P) + E(X1) +…+ E(X12) by the linearity property of expectations.
Since X1 = X2 =…= X12, E(W) = E(P) + 12*E(Xi)
Given that E(W)=840 and E(P)=20, so 840 = 20 + 12*E(Xi) → E(Xi) = 68.33
c
Because of independence, Var(W) = Var(P) + Var(X1) + Var(X2) +…+ Var(X12).
Since Var(X1) =…= Var(X12), given SD(W) = 7.9 and SD(P) = 1.7 →Var(W) = 7.92 and Var(P) =1.72.
→7.92 = 1.72 +12*Var(Xi) →Var(Xi) = 4.96 → SD(Xi) = √(4.96) = 2.23
Sample Question (Investigative Task)
1a
The Western Pacific Ocean had more typhoons than the Eastern Pacific Ocean in all but one of these years. The average seems to have been about 31 typhoons per year in the Western Pacific Ocean, which is higher than the average of about 19 typhoons per year in the Eastern Pacific Ocean. The Western Pacific Ocean also saw more variability (in number of typhoons per year) than the Eastern Pacific Ocean; for example, the range of the frequencies for the Western Pacific is about 21 typhoons and only 10 typhoons for the Eastern Pacific.
1b
The Western Pacific Ocean had a decreasing trend in number of typhoons per year over this time period, especially from about 2001 through 2010. In contrast, the Eastern Pacific Ocean was fairly consistent in the number of typhoons per year over this time period, with a slight increasing trend in the later years from 2005 through 2010.
1c
The four year moving average for the year 2010 in the Western Pacific Ocean is: (28+27+28+18)/4 = 25.25.
The values written in the table is as follows:
2008  20  20.25  27  28.75 

2009  23  21.75  28  29.25 
2010  18  20  18  25.25 
1d
1e.i
The overall trends across this time period were more apparent with the moving averages than with the original frequencies. The moving averages reduce variability, making more apparent the overall decreasing trend in number of typhoons in the Western Pacific Ocean and the slight increasing trend in the number of typhoons in the Eastern Pacific Ocean.
1e.ii
The yeartoyear variability in number of typhoons is less apparent with the moving averages than with the original frequencies.