We will work our way through the concepts of Standard Deviation (SD) today. Let’s take a look at how you calculate standard deviation first:

Ai – The numbers in the list

Aavg – Arithmetic mean of the list

n – Number of numbers in the list

Say you have 3 numbers : 11, 13 and 15. Their standard deviation is the “square root of the average of their squared deviations from the arithmetic mean.” Let’s see what we mean by this.

Mean of 11, 13 and 15 is 13.

Focus on these words: “deviations from mean”

The important point to note is that SD is a measure of dispersion or deviation from the mean (the mean is approximately the middle of the list if there are no outliers). In other words, SD is a measure of whether the numbers are very far away from the mean or close together. Since GMAT isn’t calculation intensive, you probably won’t need to calculate the actual SD in the test. The calculations are shown here only to illustrate the concept. But you must have a feel for how the numbers are distributed around the mean and what that implies for the SD.

Your statistics book explains how to visualize SD using the number line in detail, therefore, I am not going to delve deep into it but will quickly recap so that we can move ahead. Recall that if you plot the numbers on the number line, it gives you a sense of how far the numbers are from the mean. The farther the numbers, higher is the SD.

Let’s check out a few different cases to internalize the SD concept. Do not calculate anything in these questions. Just look at the number line for each case and figure out whether it makes sense to you.

Question: Which set, S or T, has higher SD?

Case 1: S = {3, 3, 3} or T = {0, 10, 20}

Case 2: S = {3, 4, 5} or T = {5, 6, 7}

Case 3: S = {3, 4, 5, 6} or T = {2, 3, 4, 5, 6, 7}

Case 4: S = {1, 3, 5} or T = {1, 1, 3, 5, 5}

Case 5: S = {1, 3, 5} or T = {1, 3, 3, 5}

Case 6: S = {6, 8, 10} or T = {12, 16, 20}

Case 7: S = {6, 8, 10} or T = {3, 4, 5}

Let me represent the first four cases on the number line. Check them out and then think which set should have the higher SD.

Let’s discuss each of these four cases now.

Case 1: S = {3, 3, 3} or T = {0, 10, 20}

T has higher SD. We will obtain the SD of T by calculating as shown in the example above. But we don’t really need to calculate it because we see that for set S, SD = 0. Each number is at the mean and hence has 0 deviation from the mean. Since SD cannot be negative, whatever the SD of T, it will be higher than the SD of S which is 0.

Case 2: S = {3, 4, 5} or T = {5, 6, 7}

Both sets have the same SD. We can see from the number line that they are equally dispersed around their respective means.

Case 3: S = {3, 4, 5, 6} or T = {2, 3, 4, 5, 6, 7}

Set T has higher SD. T has two extra numbers which are farther from the mean. Hence these 2 numbers will add to the total deviation. (There is a caveat here which we will discuss next week.)

Case 4: S = {1, 3, 5} or T = {1, 1, 3, 5, 5}

T has higher SD. It has two extra numbers far from the mean. (There is a caveat here too!)

What do you think about cases 5, 6, and 7? I will give you the answers to these three cases next week!

*Karishma, a Computer Engineer with a keen interest in alternative Mathematical approaches, has mentored students in the continents of Asia, Europe and North America. She teaches the **GMAT** for Veritas Prep and regularly participates in content development projects such as this blog!*

## 4 thoughts on “Quarter Wit, Quarter Wisdom: Dealing with Standard Deviation”

Comments are closed.

awesome post, thanks a bunch. learned a lot today in SD…

Hi Karishma,

Could you please explain the 4th Set in Detail.

Thanks

H

Plot the case 4 on the number line. Set S has 2 elements, 1 and 5, away from the mean.

Set T has 4 elements 1, 1, 5, 5 away from the mean. So possibly, the SD of Set T is higher. I say possibly, because the numerator of the SD increases but the denominator increases as well. The numerator increases much more while the denominator increases to a small extent. So overall, the SD will increase.

If instead, set T’s elements were {1, 2.9, 3, 3.1, 5}, the SD would have decreased. The numerator will increase a little bit while the denominator will increase to a large extent.

The numbers given will always be extreme so that you don’t have any doubts and don’t need to calculate. Check out the next SD post for more on this.

Case 5 : 1,3,5 and 1,3,3,5 sd^2 = variance here mean is 3 in both the cases and variance will be same also but as T have 4 elements vs 3 elements in S, SD of S will be greater than SD of T, as 1,3,5 is separated by difference of 2 while 1,3,3,5 are separated by 2,0,0,2 hence SD of S > SD of T.

Case 6 : 6,8,10 vs 12,16,20 obviously latter has higher SD as SD for S is 2 and SD in T is 4 or like case 3 dependent on constant multiplied or divide.

case 7 : S = 6,8,10 and T is 3,4,5 here SD in S > SD in T because if d =1 in set T it is d=2 in set S, so d in T and 2d in S. As we are multiplying or dividing by a constant that affects Standard Deviation.