Statistics
Scatter Plots and Residuals
Lesson
A scatter plot shows pairs of values as dots. When a roughly linear pattern appears, we fit a line and use it to predict.
Regression line
(read “y-hat”) is the predicted y for a given x.
Residual
Positive residual: actual is ABOVE the line. Negative residual: actual is BELOW the line.
For a least-squares regression line, the residuals always sum to 0.
Worked example 1 — prediction
Line , predict at :
Worked example 2 — residual
Line , data point .
A residual plot tells a story
If residuals are scattered randomly around zero, the linear model is a good fit. If there’s a clear pattern (curve, fan shape), a linear model misses something — try a different model.
Practice
Work through these. Stuck? Click Get a hint.
Warm-Up
Quick problems to get going.
Problem 1
Problem 2
Problem 3
Problem 4
Practice
Standard problems matching the lesson.
Problem 5
Problem 6
yhat=3x, actual (4,15). Residual?
Problem 7
Problem 8
Problem 9
Problem 10
yhat=4x+5, (6,30). Residual?
Problem 11
Problem 12
Problem 13
Problem 14
yhat=3x+1, (3,11). Residual?
Problem 15
Problem 16
yhat=5x+10, (2,25). Residual?
Problem 17
C-hat = 3x + 50. Cost at x=20?
Problem 18
Predicted 110, actual 105. Residual?
Challenge
Harder problems — edge cases, trickier numbers, multiple steps.
Problem 19
Problem 20
yhat=4x+1, (5,24). Residual?
Problem 21
yhat=2x+3, (4,12). Residual?
Problem 22
yhat=0.25x+1, (8,4). Residual?
Problem 23
r=0.9: strong (1) or weak (0)?
Problem 24
Problem 25
C-hat = 4x+20, actual at x=5 is 45. Residual?
Ask the tutor
Stuck on a concept? Want another example? Ask anything about this topic.
Type your own question below, or tap one of the suggestions. The tutor can re-explain the lesson, work through a specific problem with you, generate fresh practice tuned to where you are, or check your reasoning.
Quiz
Test yourself on this topic →
10 questions, no hints. About 5 minutes.