Statistics Formulas
These are fundamental statistics formulas that are crucial for basic statistical analysis:
Descriptive Statistics
Descriptive Statistics
- Mean (Average): µ = Σx / N (for population), x̄ = Σx / n (for sample)
- Median: Middle value in an ordered data set
- Mode: Most frequent value in a data set
- Range: Max(x) - Min(x)
- Variance: σ² = Σ[(x - µ)²] / N (for population), s² = Σ[(x - x̄)²] / (n - 1) (for sample)
- Standard Deviation: σ = √σ² (for population), s = √s² (for sample)
- Probability of an Event: P(E) = Number of favorable outcomes / Total outcomes
- Complement Rule: P(E') = 1 - P(E)
- Addition Rule: P(E or F) = P(E) + P(F) - P(E and F)
- Multiplication Rule: P(E and F) = P(E) * P(F|E)
- Binomial Coefficient (Combinations): C(n, k) = n! / [k!(n-k)!]
- Binomial Probability: P(X=k) = C(n, k) * p^k * (1-p)^(n-k)
- Standard Normal Distribution: Z = (X - µ) / σ
- Empirical Rule: ~68% of data within 1σ, ~95% within 2σ, ~99.7% within 3σ
- Z-score for Hypothesis Test: Z = (x̄ - µ₀) / (σ / √n)
- T-score for Hypothesis Test: T = (x̄ - µ₀) / (s / √n)
- Confidence Interval (Z-distribution): CI = x̄ ± Z * (σ/√n)
- Confidence Interval (T-distribution): CI = x̄ ± T * (s/√n)
- Pearson's Correlation Coefficient: r = Σ[(xi - x̄)(yi - ȳ)] / √[Σ(xi - x̄)² * Σ(yi - ȳ)²]
- Slope of Regression Line: b = r * (Sy / Sx)
- Y-Intercept of Regression Line: a = ȳ - b*x̄
ANOVA (Analysis of Variance)
- Total Sum of Squares: SST = Σ(yi - ȳ)²
- Between-group Sum of Squares: SSB = Σn*(ȳ - ȳ)²
- Within-group Sum of Squares: SSW = Σ(yi - ȳ)²
- F-statistic: F = (SSB / dfb) / (SSW / dfw)