Subject

Statistics

Describe data with summary statistics, reason about probability, and work with the normal distribution and regression.

Begin at I if you’re new to Statistics.

Mean, Median, Mode
Three ways to describe the middle of a data set. Mean averages, median picks the middle, mode finds what repeats most.
Begin →
Range and IQR
Two simple measures of spread. Range covers the full extent; IQR covers the middle 50%, robust against outliers.
Begin →
Standard Deviation
Variance measures the average squared deviation from the mean; standard deviation is its square root, in the original units.
Begin →
Five-Number Summary
Describe a data set with five values: min, Q1, median, Q3, max. The foundation of the box plot.
Begin →
Graphical Displays
Read histograms, box plots, dot plots, and stem-and-leaf displays. Each makes a different feature of the data obvious.
Begin →
Distribution Shape
Classify a distribution as symmetric, skewed right, skewed left, or uniform — from a picture or from mean/median.
Begin →
Sampling Methods
Identify the sampling design: simple random, stratified, systematic, cluster, or convenience.
Begin →
Observational vs Experimental Studies
If the researcher assigned the treatment, it's an experiment. If they just measured, it's observational. Only experiments can show cause.
Begin →
Probability Fundamentals
When outcomes are equally likely, probability is a counting problem: favorable outcomes divided by total outcomes. Plus the complement rule.
Begin →
Independent Events
When two events don't affect each other, the probability they both happen is the product of their probabilities. The multiplication rule.
Begin →
Conditional Probability
Probability of A given B already happened. Restrict the sample space to just the B outcomes, then count favorable A within that.
Begin →
Two-Way Tables
Marginal, joint, and conditional probabilities — all read directly from the cells of a two-way table.
Begin →
Permutations and Combinations
Count arrangements (order matters) and selections (order doesn't). Factorial, P(n,k), and C(n,k) are the three core tools.
Begin →
Discrete vs Continuous Variables
Counting? It's discrete. Measuring on a real scale? It's continuous. The litmus test: could you have 4.7 of it?
Begin →
Expected Value
The long-run average of a discrete random variable. Multiply each value by its probability, then add. The key tool for analyzing games and decisions.
Begin →
Normal Distribution and z-scores
The bell curve, described by its mean and standard deviation. A z-score converts any raw value to standard-deviations-from-the-mean.
Begin →
Empirical Rule (68-95-99.7)
Quick percentages for normal data: 68% within 1 SD, 95% within 2 SD, 99.7% within 3 SD. Combined with symmetry, you can read off almost any region.
Begin →
Binomial Probability
Probability of exactly k successes in n independent trials with success probability p. P(X = k) = C(n, k) · p^k · (1-p)^(n-k).
Begin →
Correlation Coefficient
r measures the strength and direction of a linear relationship between two variables, from -1 to 1. r² is the fraction of variance explained.
Begin →
Linear Regression
The line of best fit y = mx + b. Compute slope and intercept from data; use the line to predict y from any x.
Begin →
Scatter Plots and Residuals
Use the regression line to predict y. Residual = actual − predicted. A pattern in residuals signals a poor linear fit.
Begin →
Confidence Intervals and Hypothesis Tests
Turn an estimate into a range with a confidence interval. Test a claim with a z-statistic. The framework underlying all modern inference.
Begin →

Statistics

Mean, Median, Mode

Range and IQR

Standard Deviation

Five-Number Summary

Graphical Displays

Distribution Shape

Sampling Methods

Observational vs Experimental Studies

Probability Fundamentals

Independent Events

Conditional Probability

Two-Way Tables

Permutations and Combinations

Discrete vs Continuous Variables

Expected Value

Normal Distribution and z-scores

Empirical Rule (68-95-99.7)

Binomial Probability

Correlation Coefficient

Linear Regression

Scatter Plots and Residuals

Confidence Intervals and Hypothesis Tests

Exams

Statistics Midterm

Statistics Final Exam