If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

### Unit 6: Lesson 2

Shape of data distributions

# Clusters, gaps, peaks & outliers

AP.STATS:
UNC‑1 (EU)
,
UNC‑1.H (LO)
,
UNC‑1.H.1 (EK)
,
UNC‑1.H.2 (EK)
,
UNC‑1.H.5 (EK)
,
UNC‑1.H.6 (EK)
CCSS.Math:
Examples looking at different features of distributions, such as clusters, gaps, peaks, and outliers for distributions.

## Want to join the conversation?

• What is an outlier?
What is a range?
What is an interquartile range?
What is mean?
What is median?
What is mode?
What is a lower quartile?
What is an upper quartile?

I have them all mixed and am so confused.
• Outlier - a data value that is way different from the other data.
Range - the Highest number minus the lowest number
Interquarticel range - Q3 minus Q1
Mean- the average of the data (add up all the numbers then divide it by the total number of values that you originally added)
Median - the number in the middle of the data. If the numbers are all in order, whichever number is in the middle
Mode - whichever number there is the most of
Lower Quartile - Q1 - the middle of the bottom half of the data, if you take the median, it's the middle of the data on the right of the median(it's basically the number at the 1st quarter.
Upper Quartile - Q3 - the middle of the data above the median, the value at the 3rd quarter of the data.
• What is cluster? explain please.
• It is data is is clustered like 2 or 3 groups together like if it was 4 - 9 and 6-8 had 3 dots then the cluster would be 6-9
• I still didn't get what is a gap
• a gap is a space in the dada. like when you have a plum that expiyes in two days but then you dont have any plums that expier in 2,3,4,5 days. but then all of a suden you have a plum that expiers in 6 days! big gap or space in the dada or numbers.
• The last example in the video seems to answer this, but clarifying question: it is possible to have outliers without having a gap in the data, correct?
• At . No, because an outlier is a group of data that is much bigger or smaller than the rest of the data and to have an outlier, there must be a gap in the data. A big gap that is like 2 or more gaps from the data set.
• Whats a outlier
• An outlier is a piece of data that is far away from other data.
• In statistics this is a measure of the variation of the data. For example, the range (difference between maximum and minimum values), the mean absolute deviation (average distance between each point and the median), and interquartile range (distance between the lower and upper quartiles).
• outlier is a small set of data separated from all the big clusters? Right?
• It's usually only one data point (I think)
• Can you have multiple peaks?
• I think so, since you can have multiple clusters.
• a few questions about outlier

1.lets say there is are two clusters on the graph with a huge gap in between
would data in one cluster be considered as an outlier wrt another cluster ? or does this not have any outlier at all

2.lets say that this time there is a cluster on one side of the graph . but after the cluster the data points are just low but no gaps . but after a while for one value there is an abnormally high no.of data points
is this consider an outlier ?
(shown below)
.________.____
..._______.____
..._______.____
.......................__

. -> data point
-> space/blank (the comment isnt taking more than one space when given , thats why i used underscore)