Main content

## Summarizing spread of distributions

Current time:0:00Total duration:6:12

## Video transcript

- [Instructor] Let's get some practice calculating interquartile ranges and I've taken some exercises from the Khan Academy exercises here. I'm just gonna solve it on my scratch pad. The following data points represent the number of animal crackers
in each kid's lunch box. Sort the data from least to greatest and then find the interquartile
range of the data set and I encourage you to do this
before I take a shot at it. Alright, so let's first sort it and if we were actually doing this on the Khan Academy exercise,
you could just drag these, you could just click and
drag these numbers around to sort 'em but I'll just do it by hand. So let's see, the lowest number
here looks like it's a four. So I have that four
then I have another four and then I have another
four and let's see, are there any fives? No fives but there is a six. So then there is a six
and then there's a seven. There doesn't seem to
be an eight or a nine but then we get to a 10 and then we get to 11, 12. No 13 but then we got 14 and
then finally we have a 15. So the first thing we wanna do is figure out the median here. So the median's the middle number. I have one, two, three, four, five, six, seven, eight, nine numbers so there's going to be
just one middle number. I have an odd number of numbers here and it's going to be the number that has four to the left
and four to the right and that middle number, the
median is going to be 10. Notice I have four to the
left and four to the right and the interquartile range is all about figuring out the difference between the middle of the first half and the middle of the second half. It's a measure of spread, how far apart all of these data points are and so let's figure out the
middle of the first half. So we're gonna ignore the median here and just look at these first four numbers and so out of these first four numbers, since I have an even number of numbers, I'm gonna calculate the median
using the middle two numbers so I'm gonna look at the
middle two numbers here and I'm gonna take their average. So the average of four and six, halfway between four and six is five or you could say four plus six is, four plus six is equal to 10 but then I wanna divide that by two so this is going to be equal to five. So the middle of the first half is five. You can imagine it right over there and then the middle of the second half I'm gonna have to do the same thing. I have four numbers. I'm gonna look at the middle two numbers. The middle two numbers are 12 and 14. The average of 12 and
14 is going to be 13, is going to be 13. If you took 12 plus 14 over two, that's going to be 26 over
two which is equal to 13 but an easier way for numbers like this, you say hey, 13 is right exactly
halfway between 12 and 14. So there you have it. I have the middle of
the first half is five. I have the middle of the second half, 13. To calculate the interquartile range, I just have to find the difference
between these two things. So the interquartile range
for this first example is going to be 13 minus five. The middle of the second half minus the middle of the first half which is going to be equal to eight. Let's do some more of these. This is strangely fun. Find the interquartile range of the data in the dot plot below. Songs on each album in Shane's collection and so let's see what's going on here and like always, I encourage
you to take a shot at it. So this is just representing
the data in a different way but we could write this
again as an ordered list so let's do that. We have one song or we have
one album with seven songs I guess you could say. So we have a seven. We have two albums with nine
songs so we have two nines. Let me write those, we have two nines then we have three 10s. Cross those out. So 10, 10, 10 then we have an 11. We have an 11. We have two 12s, two 12s and then finally, we
have, I used those already and then we have an album with 14 songs. 14. So all I did here is I
wrote this data like this so we could see, okay,
this album has seven songs, this album has nine, this album has nine and the way I wrote it,
it's already in order so I could immediately get, I can immediately start
calculating the median. Let's see, I have one, two, three, four, five, six, seven, eight, nine, 10 numbers. I have an even number of numbers
so to calculate the median, I'm gonna have to look at
the middle two numbers. So the middle two numbers look
like it's these two 10s here because I have four to the left of them and then four to the right of them and so since I'm calculating
the median using two numbers, it's going to be halfway between them. It's going to be the average
of these two numbers. Well, the average of 10 and
10 is just going to be 10. So the median is going to be 10. Median is going to be 10 and in a case like this
where I calculated the median using the middle two numbers, I can now include this
left 10 in the first half and I can include this
right 10 in the second half. So let's do that. So the first half is going
to be those five numbers and then the second half is
going to be these five numbers and it makes sense 'cause
I'm literally just looking at first half it's gonna be five numbers, second half is gonna be five numbers. If I had a true middle number
like the previous example then we ignore that when we look at the first and second half or at least that's the
way that we're doing it in these examples but what's
the median of this first half if we look at these five numbers? Well, if you have five numbers, if you have an odd number of numbers, you're gonna have one middle number and it's going to be the one
that has two on either sides. This has two to the left
and it has two to the right. So the median of the first half, the middle of the first
half is nine right over here and the middle of the second half, I have one, two, three, four, five numbers and this 12 is right in the middle. You have two to the left
and two to the right so the median of the second half is 12. Interquartile range is
just going to be the median of the second half, 12 minus the median of the first half, nine which is going to be equal to three. So if I was doing this
on the actual exercise, I would fill out a three right over there.