Learn what an outlier is and how to find one!
What are outliers in scatter plots?
Scatter plots often have a pattern. We call a data point an outlier if it doesn't fit the pattern.
Consider the scatter plot above, which shows data for students on a backpacking trip. (Each point represents a student.)
Notice how two of the points don't fit the pattern very well. These points have been labeled Brad and Sharon, which are the names of the students they represent.
Sharon could be considered an outlier because she is carrying a much heavier backpack than the pattern predicts.
Brad could be considered an outlier because he is carrying a much lighter backpack than the pattern predicts.
Key idea: There is no special rule that tells us whether or not a point is an outlier in a scatter plot. When doing more advanced statistics, it may become helpful to invent a precise definition of "outlier", but we don't need that yet.
To fully wrap our minds around why certain data points might be considered outliers, let's try a couple of practice problems.
Problem 1: Computer shopping
Michelle was researching different computers to buy for college. She looked up the prices and quality ratings for a sample of computers. Her data is shown in the scatter plot to the right, where each point is a computer.
Michele wants to buy a computer whose quality rating is far higher than the pattern would predict based on its price.
Which of the labeled points represents a computer that Michele wants to buy?
Problem 2: Test scores
Some high school students in the U.S. take a test called the SAT before applying to colleges. The scatter plot to the right shows what percent of each state's college-bound graduates took the SAT in , along with that state's average score on the math section.
The three labeled points could be considered outliers.
Why might these points be considered outliers?
Want to join the conversation?
- I still dont get it. What if there are many outliers? How am I supposed to plot them?(7 votes)
- are you guys having a good day?(7 votes)
- Is there an easier way to do this.(6 votes)
- No, it is not complicated at all, you can tell if it is a Outlier because it is either higher or lower than all the rest of the points. see, easy(6 votes)
- Do scatter plots need to have an outlier?(4 votes)
- A scatterplot would be something that does not confine directly to a line but is scattered around it. It can have exceptions or outliers, where the point is quite far from the general line. but no it does not need to have an outlier to be a scatterplot, It simply cannot confine directly with the line.(4 votes)
- How do you find the outlier in a set of data?(3 votes)
- why? why dose this have to be a thing?(2 votes)
- How many outliers can there be?(2 votes)
- A data set can have any number of outliers. An outlier is simply any number than doesn't closely relate to the rest of the set.(4 votes)
- Can there be more than one outlier?(2 votes)
- Of course! However, if they are next to each other you might have to question if they really are outliers.(2 votes)
- Is there any sort of mathematical tests to determine whether or not a data point is an outlier? For instance, in a paper I am writing I am graphing market capitalization against population growth. How would I mathematically (with a formula) determine that a data point (ie Market Capitalization, Annual Population Growth (perhaps it is 4336.2, 2110)) is an outlier?(2 votes)
- Yes, if you have the IQR, 1st and 3rd Q, or have the ability to calculate these, you can multiply the IQR*1.5 and either add or subtract the product from the 1st and 3rd Q, respectively. Anything below the lower difference or above the upper sum is an outlier.(2 votes)