If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

Main content

Median, mean and skew from density curves

High level analysis of density curves. A focus on median, mean, left-skew and right-skew.

Want to join the conversation?

  • starky tree style avatar for user Sanjeeth K
    I've heard of terms such as Positively and Negatively Skewed and that's why I watched this video. Can anyone explain what those terms mean?
    (7 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user Anoosheh Kalantari
    Thank you for your great channel! I am puzlled by your definition of mean and median here because essentially they are the same you mentioned:

    Median is the point where the area under the density curve to the left is equal to the area to the right. That is the same meaning for the point representing the mean as when you multiply each point by its probability you get the area under and the balancing point is the point in which areas on both sides balance each other! so for the left-skewed and right-skewed I do not understand how you differentiate between median and mean to be different!
    (2 votes)
    Default Khan Academy avatar avatar for user
    • cacteye blue style avatar for user Jerry Nilsson
      It's true that we can view the mean as the balancing point, but that does not imply that the area to the left of the mean is equal to the area to the right.

      For example, suppose that Archie weighs significantly more than Betty.
      Then, with Archie on one end of a seesaw and Betty on the other end, the balancing point will definitely be closer to Archie but not directly underneath him!

      The center of mass ("median") is not necessarily the same as the center of momentum ("mean").
      (10 votes)
  • leaf green style avatar for user Zhizhong Pu
    at , why we can use the physical method to find the mean?
    (4 votes)
    Default Khan Academy avatar avatar for user
  • blobby green style avatar for user innrmylife
    what is an example of symmetrical distribution?
    (1 vote)
    Default Khan Academy avatar avatar for user
    • primosaur seed style avatar for user Ian Pulizzotto
      An extremely common example of a symmetrical distribution is the normal distribution (bell-shaped curve). So the mean and median of a normal distribution are the same. Since a normal distribution is also symmetric about its highest peak, the mode (as well as the mean and median) are all equal in a normal distribution.

      Have a blessed, wonderful day!
      (5 votes)
  • blobby green style avatar for user Amine Zitoun
    hi how you doing, one question didn't you say that median is the best for getting the central tendency of data but in getting the median from the density curve in the median mislead us in getting the central tendency so when to use media as a way to get the central tendency thank :D
    (2 votes)
    Default Khan Academy avatar avatar for user
  • male robot hal style avatar for user Fernando
    what do the different skews of data look like.
    (2 votes)
    Default Khan Academy avatar avatar for user
    • blobby green style avatar for user daniella
      he different skews of data can be visualized as follows:

      Symmetric Distribution: In a symmetric distribution, the data is evenly distributed around the mean, with equal tails on both sides. The mean and median are equal, and the curve appears balanced.
      Right Skewed (Positive Skew): In a right-skewed distribution, the tail of the curve extends towards higher values, creating a longer tail on the right side. The mean is greater than the median, and the curve appears to have a longer right tail.
      Left Skewed (Negative Skew): In a left-skewed distribution, the tail of the curve extends towards lower values, creating a longer tail on the left side. The mean is less than the median, and the curve appears to have a longer left tail.
      (1 vote)
  • aqualine ultimate style avatar for user YANGXIAORUZOE
    @KhanAcademy, can you please do a video only based on skewed lines plz, thx😆😆😆
    (2 votes)
    Default Khan Academy avatar avatar for user
  • starky ultimate style avatar for user Charizard-Max
    so, does this mean that the MEDIAN has 50%(area on the left) on it's left side and 50%(area on the right) on it's right side?
    (1 vote)
    Default Khan Academy avatar avatar for user
    • leafers sapling style avatar for user green_ninja
      Yes. The last two graphs can seem confusing, since they aren't symmetrical distributions. Try to imagine the graph as a plank with rocks of increasingly heavier weight, the heaviest ones on one side and the lightest ones on the other. The median is a fulcrum you can put under the plank. Where could you put the fulcrum so that the weight on both sides of the plank would be equal?

      In this instance, you would put the fulcrum (median) closer to the heavier rocks, so that there would be enough of the lighter rocks to compensate for the weight difference.

      Hope this helps!
      (3 votes)
  • hopper happy style avatar for user ShaanPatel
    In left and right skewed, I thought the median is not affected by outliers and stays in the bell part. Am I wrong?
    (1 vote)
    Default Khan Academy avatar avatar for user
    • blobby green style avatar for user daniella
      In left and right skewed distributions, the median can still be influenced by outliers, but it is generally less affected compared to the mean. The median represents the middle value when the data is arranged in ascending or descending order, so extreme values (outliers) can have less impact on it compared to the mean, which takes into account the value of each data point. However, if the skewness is extreme or if there are influential outliers, the median may still be shifted away from the center of the bulk of the data.
      (3 votes)
  • blobby green style avatar for user mc200400580
    what can you say of the skewness in each of the following cases?
    Mode=32.1 and Mean=35.4
    median=1459 and Mean=1403
    Median=50 and Mode=50
    (1 vote)
    Default Khan Academy avatar avatar for user
    • leafers sapling style avatar for user green_ninja
      Problem 1: I believe that the mode would be the highest peak in the density graph, since it is the most common number. Since the mean is larger than it (and hence to the "right"), the graph should be right-skewed.

      Problem 2: The graph would be left-skewed since the mean is smaller than the median and hence to the "left".

      Problem 3: Using similar logic as problem 1, the mode is the peak of the density curve. Since the median is the "middle number" and it's equal to the mode, the mode would also be in the middle of the graph. I believe this would result in a symmetric curve.

      Hope this helps!😄
      (1 vote)

Video transcript

- [Instructor] In other videos we introduce ourselves to the idea of a density curve, which is a summary of a distribution, a distribution of data, and in the future we'll also look at things like probability density. But what I want to talk about in this video is think about what we can glean from them, the properties, how we can describe density curves and the distributions they represent. And we have four of them right over here, and the first thing I want to think about is if we can approximate what value would be the middle value or the median for the data set described by these density curves. So just to remind ourselves, if we have a set of numbers and we order them from least to greatest, the median would be the middle value, or the midway between the middle two values. In a case like this, we want to find the value for which half of the values are above that value and half of the values are below. So in looking at a density curve, you'd want to look at the area, and you'd want to say, OK, at what value do we have equal area above and below that value? And so for this one, just eyeballing it, this value right over here would be the median. And in general, if you have a symmetric distribution like this, the median will be right along that line of symmetry. Here we have a slightly more unusual distribution, this would be called a bimodal distribution where you have two major lumps right over here, but it is symmetric. And that point of symmetry is right over here, and so this value, once again, would be the median. Another way to think about it is, the area to the left of that value is equal to the area to the right of that value, making it the median. But what if we're dealing with non-symmetric distributions? Well we'd want to do the same principle. We would want to think, at what value is the area on the right and the area on the left equal? And once again this isn't going to be super exact, but I'm going to try to approximate it. You might be tempted to go right at the top of this lump right over here, but if I were to do that it's pretty clear, even eyeballing it, that the right area right over here is larger than the left area. So that would not be the median. If I move the median a little bit over to the right, this maybe right around here, this looks a lot closer. Once again, I'm approximating it, but it's reasonable to say that the area here looks pretty close to the area right over there. And if that is the case, then this is going to be the median. Similarly, on this one right over here, maybe right over here, and once again I'm just approximating it, but that seems reasonable, that this area is equal to that one, even though this is longer it's much lower, this part of the curve is much higher even though it goes on less to the right. So that's the median for well behaved continuous distributions like this, it's going to be the value for which the area to the left and the area to the right are equal. But what about the mean? Well the mean is, you take each of the possible values and you weight it by their frequencies, you weight it by their frequencies and you add all of that up. And so for symmetric distributions your mean and your median are actually going to be the same. So this is going to be your mean as well, this is going to be your mean as well. If you want to think about it in terms of physics, the mean would be your balancing point, the point at which you would want to put a little fulcrum and you would want to balance the distribution. And so you could put a little fulcrum here and you could imagine that this thing would balance, this thing would balance. And that all comes out of this idea of the weighted average of all of these possible values. What about for these less symmetric distributions? Well let's think about it over here. Where would I have to put the fulcrum, or what does our intuition say if we wanted to balance this? Well, we have equal areas on either side, but when you have this long tail to the right it's going to pull the mean to the right of the median in this case. And so our balance point is probably going to be something closer to that. And once again, this is me approximating it, but this would roughly be our mean. It would sit, in this case, to the right of our median. Let me make it clear, this median is referring to that, the mean is referring to this. In this case, because I have this long tail to the left, it's likely that I would have to balance it out right over here. So the mean would be this value, right over there. And there's actually a term for these non-symmetric distributions where the mean is varying from the median. Distributions like this are referred to as being skewed. And this distribution, where you have the mean to the right of the median, where you have this long tail to the right, this is called right skewed. Now, the technical idea of skewness can get quite complicated, but generally speaking, you can spot it out when you have a long tail on one direction, that's the direction in which it will be skewed, or if the mean is to that direction of the median. So the mean is to the right of the median, so generally speaking, that's going to be a right skewed distribution. So the opposite of that, here the mean is to the left of the median and we have this long tail on the left of our distribution, so generally speaking we will describe these as left skewed distributions.