If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

Main content

Introduction to residuals

Build a basic understanding of what a residual is. 
We run into a problem in stats when we're trying to fit a line to data points in a scatter plot. The problem is this: It's hard to say for sure which line fits the data best.
For example, imagine three scientists, Andrea, Jeremy, and Brooke, are working with the same data set. If each scientist draws a different line of fit, how do they decide which line is best?
A graph plots points on an x y plane. Points are rising diagonally in a weak scatter between (1 half, 1 half) and (10, 7). Three different colored lines are plotted. The red line passes through (1, 3) and (10 and 1 half, 5 and 1 half). The green line passes through (1, 2) and (10 , 6). The blue line passes through (0, 1 half) and (10 and 1 half, 7 and 1 half). All values are estimated.
If only we had some way to measure how well each line fit each data point...

Residuals to the rescue!

A residual is a measure of how well a line fits an individual data point.
Consider this simple data set with a line of fit drawn through it
A graph plots points on an x y plane. Points are at (1, 2), (2, 8), (4, 3), (6, 7), and (8, 8). A line increases diagonally from the point (0, 3) through the point (10, 8). All values are estimated.
and notice how point (2,8) is 4 units above the line:
A graph plots points on an x y plane. Points are at (1, 2), (2, 8), (4, 3), (6, 7), and (8, 8). A line increases diagonally from the point (0, 3) through the point (10, 8). An green arrow labeled 4 extends vertically from the line up to the point at (2, 8). All values are estimated.
This vertical distance is known as a residual. For data points above the line, the residual is positive, and for data points below the line, the residual is negative.
For example, the residual for the point (4,3) is 2:
A graph plots points on an x y plane. Points are at (1, 2), (2, 8), (4, 3), (6, 7), and (8, 8). A line increases diagonally from the point (0, 3) through the point (10, 8). An green arrow labeled 4 extends up vertically from the line up to the point at (2, 8). A red arrow labeled negative 2 extends down vertically from the line to the point at (4, 3). All values are estimated.
The closer a data point's residual is to 0, the better the fit. In this case, the line fits the point (4,3) better than it fits the point (2,8).

Try to find the remaining residuals yourself

What is the residual of the point (6,7) in the graph above?
  • Your answer should be
  • an integer, like 6
  • a simplified proper fraction, like 3/5
  • a simplified improper fraction, like 7/4
  • a mixed number, like 1 3/4
  • an exact decimal, like 0.75
  • a multiple of pi, like 12 pi or 2/3 pi

What is the residual of the point (8,8) in the graph above?
  • Your answer should be
  • an integer, like 6
  • a simplified proper fraction, like 3/5
  • a simplified improper fraction, like 7/4
  • a mixed number, like 1 3/4
  • an exact decimal, like 0.75
  • a multiple of pi, like 12 pi or 2/3 pi

What is the residual of the point (1,2) in the graph above?
  • Your answer should be
  • an integer, like 6
  • a simplified proper fraction, like 3/5
  • a simplified improper fraction, like 7/4
  • a mixed number, like 1 3/4
  • an exact decimal, like 0.75
  • a multiple of pi, like 12 pi or 2/3 pi

Want to join the conversation?