If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

Main content

Concept check: Standard deviation

Six questions that will help you understand standard deviation more deeply.

Introduction

The questions below are designed to help you think deeply about standard deviation and its formula.
Unlike most questions on Khan Academy, some of these questions aren't graded by a computer. You'll learn the most if you try answering each question yourself before clicking "explain".

The formula (for reference)

The formula for standard deviation (SD) is
SD=|xx¯|2n
where means "sum of", x is a value in the data set, x¯ is the mean of the data set, and n is the number of values in the data set.

Part 1

Consider the simple data set {1,4,7,2,6}.
How does the standard deviation change when 7 is replaced with 12?
Choose 1 answer:

How can we see this in the formula for standard deviation?

Part 2

Is it possible to create a data set with 4 data points that has a standard deviation of 0?
Choose 1 answer:

If it is possible, do it! Can you create two different data sets? How about three?

Part 3

Can standard deviation be negative?
Choose 1 answer:

Why or why not?
Hint: Think about the formula.

Part 4

Standard deviation is a measure of spread of a data distribution.
What do you think deviation means?

Part 5

Here are the formulas for standard deviation (SD) and the formula for mean absolute deviation (MAD), both of which are measures of spread:
SD=|xx¯|2n
MAD=|xx¯|n
What are the similarities between the formulas? What are the differences?

Part 6

Here's the formula that we've been using to calculate standard deviation:
|xx¯|2n
Here's the formula that statisticians actually use:
(xx¯)2n
Are the two formulas equivalent?
Choose 1 answer:

Want to join the conversation?

  • leafers ultimate style avatar for user Spencer Black
    I agree with snowball1984...what makes this the preferred measure? It seems that there could also be another way to calculate deviation based on any way to calculate mean. Could we say that there is:
    1) a linear mean deviation, [or mean absolute deviation,]
    2) a standard deviation, [using the squared differences.]
    3) a geometric deviation, [using the cubed differences,]
    4) a harmonic deviation, [using a harmonic mean...
    (23 votes)
    Default Khan Academy avatar avatar for user
  • aqualine ultimate style avatar for user Tom
    It was said that a sample variance is calculated by dividing by n-1. Why is the sample standard deviation here not dividing by n-1?
    (11 votes)
    Default Khan Academy avatar avatar for user
    • leaf blue style avatar for user Dr C
      It should be.

      That being said, in this case, it doesn't matter so much, as Sal is not asking about how the SD will behave when changing a number. So, we're really just interested in whether the numerator is increasing or decreasing, dividing by n or by n-1 is only scaling that numerator. So, yes, the formula should have an n-1 in the denominator, but having just n there will not change the answer
      (6 votes)
  • blobby green style avatar for user Jason Farrell
    It appears this and the previous module are not using the correct variance equation. They should use the population variance equation and not the sample variance equation. N and mu, not n and x-bar.
    (11 votes)
    Default Khan Academy avatar avatar for user
  • female robot grace style avatar for user pierdolonymistrzmatematyki
    No way, dude. That's crazy. 💀💀💀💀
    (4 votes)
    Default Khan Academy avatar avatar for user
  • leaf green style avatar for user Jasmine Corona
    when using the formula of standard deviation that statitians use how do you know what the sign (negative or positive the answer to the devation is going to be?
    (2 votes)
    Default Khan Academy avatar avatar for user
  • mr pink orange style avatar for user SkulldyvanKhan
    Does it make sense to calculate the standard deviation of a collection of complex numbers? That final question in Part 6 makes me wonder.
    (3 votes)
    Default Khan Academy avatar avatar for user
  • purple pi purple style avatar for user The first integral proponent
    So the formula for MAD is (∑|x-mean|)/n ? Why not x_i ? I mean, the ith value in the data set.
    (2 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user Anna Cherdakova
    I am a bit confused: is SD an average of how far the point of data are from each other or from the mean? if second - what is the formula for "how far the points of data are from each other on average"? thank you!
    (2 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user ilya112358
    Part 3
    Can standard deviation be negative?
    Hint: Think about the formula.

    I thought about the formula and came up with the following.

    Since symbol √ always means non-negative square root then SD cannot be negative.

    The content of the Explain section then is irrelevant. In fact, it explains why variance cannot be negative which is a different question after all.

    Correct me if I am wrong.
    (2 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user Cullen ONeill
    I don't think the explanation in the section 'No, standard deviation cannot be negative!' is useful. It explains very well why what is under the square root sign cannot be negative, but that is not the same thing as the StdDev itself not being negative (just means it is not imaginary I think). I am of the impression that the reason the StdDev will not be negative is because we select the principal (non-negative) square root. Am I missing something, or am I correct in thinking that this section is not quite showing the right thing?
    (2 votes)
    Default Khan Academy avatar avatar for user