Probability Playground: The T-Distribution

Student's t-distribution is a sampling distribution derived from the normal distribution. Suppose we take a random sample of size n from a normal distribution with mean μ and unknown variance. The sample mean and variance are X̅ and S². The distribution of (X̅ − μ)/(S/√n) is a t-distribution with ν = n − 1 degrees of freedom.

The function Γ(s) in the pdf denotes the gamma function. The function B(a, b) in the cdf denotes the beta function, while the function B(x; a, b) in the cdf denotes the incomplete beta function.

Parameter	Range	Description
ν	ν > 0	Degrees of freedom

Probability Density Function

f (x; ν) = \frac{Γ (\frac{ν + 1}{2})}{Γ (\frac{ν}{2})} \frac{1}{\sqrt{(ν π)}} \frac{1}{{(1 + \frac{x^{2}}{ν})}^{(ν + 1) / 2}}

F (x; ν) = 1 - \frac{1}{2} \frac{B (\frac{ν}{x^{2} + ν}; \frac{ν}{2}, \frac{1}{2})}{B (\frac{ν}{2}, \frac{1}{2})}

Support

- \infty < x < \infty

Mean

Variance

Example	ν
The length of human pregnancies is normally distributed with mean 266 days. A random sample of four new mothers found a mean pregnancy length of Y̅ with sample variance S². Then X = (Y̅ − 266)/(S/2) is a t(3) random variable.	3.000
Y₁, Y₂, ..., Y₉ is a sample taken from a normal(0, 4) distribution, with sample mean Y̅ and sample variance S². Then X = Y̅/(S/3) is a t(8) random variable.	8.000
100 students take the math SAT, which has a normal distribution with a mean score of 500. Their scores have mean Y̅ and variance S². Then X = (Y̅ − 500)/(S/10) is a t(99) random variable.	99.00

X ∼ T(ν)

ν =

E(X) = , Var(X) =

Student's t-distribution can be used to estimate a confidence interval for the true population mean μ of a normally distributed population based on a sample mean X̅ and sample standard deviation S.

Since X̅ and S are independent, the t-distribution is constructed as the ratio of two independent random variables. The mean and variance of such ratio distributions is often undefined. In this case, the mean is only defined for ν > 1, and the variance is only defined for ν > 2.

Student's t-distribution can be used to estimate a confidence interval for the true population mean μ of a normally distributed population based on a sample mean X̅ and sample standard deviation S.

The illustration above shows a sample of n points chosen independently and at random from a normal(μ, σ²) distribution. The random variable X = (X̅ − μ)/(S/√n) has a t(n − 1) distribution, where X̅ is the sample mean and S is the sample standard deviation.

The simulation above shows a sample of n points (marked red) chosen independently and at random from a normal(μ, σ²) distribution. The light blue line shows the value of X = (X̅ − μ)/(S/√n), where X̅ is the sample mean and S is the sample standard deviation. The random variable X has a t(n − 1) distribution. The histogram accumulates the results of each simulation.

Y = ν/(ν + X²) ∼ Beta(ν/2, 1/2) X² ∼ F(1, ν) T(1) ∼ Standard Cauchy lim_{ν→ ∞} X ∼ Standard Normal

E(Y) = , Var(Y) =

Proof

X ∼ T(ν)

Y = ν/(ν + X²) ∼ Beta(ν/2, 1/2) X² ∼ F(1, ν) T(1) ∼ Standard Cauchy limν→ ∞ X ∼ Standard Normal

Y = ν/(ν + X²) ∼ Beta(ν/2, 1/2) X² ∼ F(1, ν) T(1) ∼ Standard Cauchy lim_{ν→ ∞} X ∼ Standard Normal