Mastering Fundamental Probability Distributions: A Complete Guide with 30 Practical Examples
Probability distributions form the backbone of data science and statistical analysis. This guide walks you through ten essential distributions using 30 real-world problems that demonstrate when and how to apply each one.
The Essential Distributions
Bernoulli Distribution: Single Trial, Two Outcomes
When to use: One trial with success/failure outcomes
A customer visits your e-commerce site with a 3% purchase probability. The Bernoulli distribution models this single trial where X = 1 (purchase) with probability 0.03, or X = 0 (no purchase) with probability 0.97.
Key formulas:
- Expected value: E(X) = p
- Variance: Var(X) = p(1-p)
Example: Basketball player attempts one free throw with 75% success rate. The probability of success equals 0.75, and the expected value also equals 0.75.
Poisson Distribution: Counting Events Over Time
When to use: Counting random events in fixed intervals
A call center receives 15 calls per hour. To find the probability of exactly 3 calls in 15 minutes, first scale the rate: λ = 15 × (15/60) = 3.75 calls per 15 minutes.
Using P(X = k) = (λ^k × e^(-λ))/k!, we get: P(X = 3) = (3.75³ × e^(-3.75))/3! ≈ 0.2066 or 21%
Applications: Website page views, typos per page, customer arrivals
Normal Distribution: The Bell Curve
When to use: Continuous data clustering around a mean
Male heights follow a normal distribution with mean 175 cm and standard deviation 6 cm. To find the probability a male is between 170-185 cm:
- Calculate Z-scores: Z₁ = (170-175)/6 = -0.83, Z₂ = (185-175)/6 = 1.67
- Find P(-0.83 < Z < 1.67) = Φ(1.67) - Φ(-0.83) = 0.9525 - 0.2033 = 74.92%
Binomial Distribution: Fixed Trials, Counting Successes
When to use: Fixed number of independent trials with constant success probability
Flipping a fair coin 10 times, the probability of exactly 7 heads: P(X = 7) = C(10,7) × 0.5⁷ × 0.5³ = 120 × 0.5¹⁰ ≈ 0.1172 or 11.72%
Marketing example: Email campaign sent to 20 subscribers with 25% open rate. Expected opens = 20 × 0.25 = 5 subscribers.
Exponential Distribution: Waiting Times
When to use: Time between events in a Poisson process
If calls arrive at 15 per hour, the probability the next call comes within 5 minutes:
- Convert rate: λ = 15/60 = 0.25 calls per minute
- P(X < 5) = 1 - e^(-0.25×5) = 1 - e^(-1.25) ≈ 0.7135 or 71.35%
Key property: Memoryless - past waiting time doesn’t affect future probability.
Geometric Distribution: Trials Until First Success
When to use: Counting trials needed for first success
A salesperson with 15% close rate expects their first sale on which call? Expected value = 1/p = 1/0.15 ≈ 6.67 calls
Probability first sale occurs on call 3: P(X = 3) = (1-p)^(k-1) × p = 0.85² × 0.15 ≈ 0.1084 or 10.84%
Uniform Distribution: Equal Probabilities
When to use: All outcomes equally likely
Discrete: Rolling a die - each number has probability 1/6 Continuous: Random number generator between 0 and 1
Probability of rolling 2, 3, or 4 on a die = 3/6 = 50%
Lognormal Distribution: Right-Skewed Positive Data
When to use: Data that grows multiplicatively (income, stock prices, organism sizes)
If ln(income) ~ Normal(11.2, 0.7²), find percentage earning over $150,000:
- Transform: ln(150,000) ≈ 11.918
- Z-score: (11.918 - 11.2)/0.7 ≈ 1.026
- P(Z > 1.026) = 1 - 0.8474 = 15.26%
Negative Binomial Distribution: Trials Until r Successes
When to use: Counting trials needed for a fixed number of successes
Cold caller with 10% success rate needs 3 sales. Expected calls = r/p = 3/0.1 = 30 calls.
Multinomial Distribution: Multiple Categories
When to use: Fixed trials with more than two outcomes
Restaurant customers order coffee (60%), tea (30%), or juice (10%). With 10 customers, probability of exactly 6 coffee, 3 tea, 1 juice:
P = (10!)/(6!×3!×1!) × 0.6⁶ × 0.3³ × 0.1¹ ≈ 0.1058 or 10.58%
Choosing the Right Distribution
Ask these questions:
- How many trials? (One = Bernoulli, Fixed = Binomial/Multinomial, Until success = Geometric)
- What are you counting? (Events = Poisson, Time = Exponential, Successes = Binomial)
- How many outcomes? (Two = Bernoulli/Binomial, Multiple = Multinomial/Uniform)
- What’s the data shape? (Bell curve = Normal, Right-skewed = Lognormal)
Common Applications by Industry
E-commerce: Bernoulli (conversions), Poisson (page views), Exponential (session duration) Finance: Lognormal (stock prices), Normal (returns), Poisson (trades) Manufacturing: Binomial (quality control), Exponential (machine lifetime) Healthcare: Binomial (treatment success), Geometric (time to recovery)
Next Steps
Master these distributions by practicing with your own data. Start with simple cases like coin flips or customer conversions, then progress to complex scenarios combining multiple distributions. Understanding when to apply each distribution transforms raw data into actionable insights.