← Home

Subject

Statistics

Describe data with summary statistics, reason about probability, and work with the normal distribution and regression.

Begin at I if you’re new to Statistics.

  1. Mean, Median, Mode

    Three ways to describe the middle of a data set. Mean averages, median picks the middle, mode finds what repeats most.

    Begin →

  2. Range and IQR

    Two simple measures of spread. Range covers the full extent; IQR covers the middle 50%, robust against outliers.

    Begin →

  3. Standard Deviation

    Variance measures the average squared deviation from the mean; standard deviation is its square root, in the original units.

    Begin →

  4. Five-Number Summary

    Describe a data set with five values: min, Q1, median, Q3, max. The foundation of the box plot.

    Begin →

  5. Graphical Displays

    Read histograms, box plots, dot plots, and stem-and-leaf displays. Each makes a different feature of the data obvious.

    Begin →

  6. Distribution Shape

    Classify a distribution as symmetric, skewed right, skewed left, or uniform — from a picture or from mean/median.

    Begin →

  7. Sampling Methods

    Identify the sampling design: simple random, stratified, systematic, cluster, or convenience.

    Begin →

  8. Observational vs Experimental Studies

    If the researcher assigned the treatment, it's an experiment. If they just measured, it's observational. Only experiments can show cause.

    Begin →

  9. Probability Fundamentals

    When outcomes are equally likely, probability is a counting problem: favorable outcomes divided by total outcomes. Plus the complement rule.

    Begin →

  10. Independent Events

    When two events don't affect each other, the probability they both happen is the product of their probabilities. The multiplication rule.

    Begin →

  11. Conditional Probability

    Probability of A given B already happened. Restrict the sample space to just the B outcomes, then count favorable A within that.

    Begin →

  12. Two-Way Tables

    Marginal, joint, and conditional probabilities — all read directly from the cells of a two-way table.

    Begin →

  13. Permutations and Combinations

    Count arrangements (order matters) and selections (order doesn't). Factorial, P(n,k), and C(n,k) are the three core tools.

    Begin →

  14. Discrete vs Continuous Variables

    Counting? It's discrete. Measuring on a real scale? It's continuous. The litmus test: could you have 4.7 of it?

    Begin →

  15. Expected Value

    The long-run average of a discrete random variable. Multiply each value by its probability, then add. The key tool for analyzing games and decisions.

    Begin →

  16. Normal Distribution and z-scores

    The bell curve, described by its mean and standard deviation. A z-score converts any raw value to standard-deviations-from-the-mean.

    Begin →

  17. Empirical Rule (68-95-99.7)

    Quick percentages for normal data: 68% within 1 SD, 95% within 2 SD, 99.7% within 3 SD. Combined with symmetry, you can read off almost any region.

    Begin →

  18. Binomial Probability

    Probability of exactly k successes in n independent trials with success probability p. P(X = k) = C(n, k) · p^k · (1-p)^(n-k).

    Begin →

  19. Correlation Coefficient

    r measures the strength and direction of a linear relationship between two variables, from -1 to 1. r² is the fraction of variance explained.

    Begin →

  20. Linear Regression

    The line of best fit y = mx + b. Compute slope and intercept from data; use the line to predict y from any x.

    Begin →

  21. Scatter Plots and Residuals

    Use the regression line to predict y. Residual = actual − predicted. A pattern in residuals signals a poor linear fit.

    Begin →

  22. Confidence Intervals and Hypothesis Tests

    Turn an estimate into a range with a confidence interval. Test a claim with a z-statistic. The framework underlying all modern inference.

    Begin →

Exams