Читать книгу Applied Univariate, Bivariate, and Multivariate Statistics - Daniel J. Denis - Страница 60
2.20.2 t‐Tests for Two Samples
ОглавлениеJust as the t‐test for one sample is a generalization of the z‐test for one sample, for which we use s2 in place of σ2, the t‐test for two independent samples is a generalization of the z‐test for two independent samples. Recall the z‐test for two independent samples:
where and denote the expectations of the sample means and respectively (which are equal to μ1 and μ2).
When we do not know the population variances and , we shall, as before, obtain estimates of them in the form of and . When we do so, because we are using these estimates instead of the actual variances, our new ratio is no longer distributed as z. Just as in the one‐sample case, it is now distributed as t:
on degrees of freedom v = n1 − 1 + n2 − 1 = n1 + n2 − 2.
The formulization of t in (2.6) assumes that n1 = n2. If sample sizes are unequal, then pooling variances is recommended. To pool, we weight the sample variances by their respective sample sizes and obtain the following estimated standard error of the difference in means:
which can also be written as
Notice that the pooled estimate of the variance is nothing more than an averaged weighted sum, each variance being weighted by its respective sample size. This idea of weighting variances as to arrive at a pooled value is not unique to t‐tests. Such a concept forms the very fabric of how MS error is computed in the analysis of variance as we shall see further in Chapter 3 when we discuss the ANOVA procedure in some depth.