Statistics Using Excel Succinctly^®
by Charles Zaiontz

CHAPTER 9

Analysis of Variance

Introduction

Essentially ANOVA is an extension of two sample hypothesis testing for comparing means (when variances are unknown) to more than two samples.

We start with the one factor case. We will define the concept of factor later, but for now we simply view this type of analysis as an extension of the two independent sample t test with equal population variances.

One-way ANOVA

One-way ANOVA Example

A food company wants to determine whether any of three new formulas for peanut butter is significantly tastier than their old formula. They create four random samples of ten people each, one for each type of peanut butter, and asked the people in the each sample to taste the peanut butter for that sample. They then asked all forty people to fill out a questionnaire rating the tastiness of the peanut butter they tried.

Based on the data in Figure 48, determine whether there is a significant difference between the four types of peanut butter.

Sample data

The null hypothesis for this example is that any difference between the four types of peanut butter is due to chance, i.e.

H₀: μ₁=μ₂=μ₃=μ₄

Basic Concepts

Before we proceed with the analysis for this example, we review a few basic concepts.

Suppose we have k samples, which we will call groups (or treatments); these are the columns in our analysis (corresponding to the four types of peanut butter in the example). We will use the index j for these. Each group consists of a sample of size n_j. The sample elements are the rows in the analysis. We will use the index i for these.

Suppose the jth group sample is {x1j,…, xnjj}, and so the total sample consists of all the elements {xij:1 ≤i ≤nj, 1 ≤j ≤k}. We will use the abbreviation xj for the mean of the jth group sample (called the group mean) and x for the mean of the total sample (called the total or grand mean).

Let the sum of squares for the jth group be SSj= ixij-xj2. We now define the following terms:

SST=jixij-x2

SSB=jnjxj-x2

SSW=jSSj=jixij-xj2

SS_T is the sum of squares for the total sample, i.e. the sum of the squared deviations from the grand mean. SS_W is the sum of squares within the groups, i.e. the sum of the squared means across all groups. SS_B is the sum of the squares between group sample means, i.e. the weighted sum of the squared deviations of the group means from the grand mean.

We also define the following degrees of freedom, where n= j=1knj:

dfT=n-1 dfB=k-1 dfW=j=1knj-1=n-k

Finally we define the mean square, as MS=SS/df, and so

MST=SST / dfT MSB=SSB / dfB MSW=SSW / dfW

We summarize these terms in the following table.

Table 11: Summary of ANOVA terms

	df	SS	MS
	n –1	jixij-x2
	k –1	jnjxj-x2
	n –k	jixij-xj2

Clearly MS_T is the variance for the total sample, MS_W is the sum of the group sample variances and MS_B is the variance for the “between sample” i.e. the variance of {n1x1, …, nkxk}.

It is also not hard to show that

SS_T=SS_W+SS_B

df_T=df_W+df_B

It turns out that if the null hypothesis is true, then MS_W and MS_B are both measures of the same error. Thus the null hypothesis becomes equivalent to the hypothesis that the population versions of these statistics are equal, i.e.

σ_B= σ_W

We can therefore use the F-test described in chapter 8 to determine whether or not to reject the null hypothesis. This means that if the xij are independently and normally distributed and all the μj are equal (null hypothesis) and all the σj2 are equal (homogeneity of variances), then the test statistic

F=MSBMSW

has an F distribution with dfB,dfW degrees of freedom.

Analysis

To carry out the analysis for the example, we will use the Anova: Single Factor data analysis tool. To access this tool, as usual, press Data > Analysis|Data Analysis and fill in the dialog box that appears as in Figure 49.

Single factor data analysis tool

Dialog box for Anova: Single factor data analysis tool

The output is as shown in Figure 50.

Single factor data analysis

Anova: Single factor data analysis

All the fields in Figure 50 are calculated as described previously. We see that the test statistics is F = 3.206928. Since p-value = F.DIST(3.206928, 3, 36) = 0.034463 < .05 = α, we reject the null hypothesis that there is no significant difference between the evaluations of the four different types of peanut butter.

Follow-up Analysis

Although we now know that there is a significant difference between the four types of peanut butter, we still don’t know where the differences lie.

It appears from Figure 50 that New 2 has a higher rating than the other types of peanut butter. We can use a t test to determine whether there is a significant difference between the rating for New 2 and the next highest rated peanut butter, New 1. The result is shown in Figure 51.

t-Test: Two-Sample Assuming Unequal Variances

	New 1	New 2
Mean	13.1	16.6
Variance	21.21111	7.155556
Observations	10	10
Hypothesized Mean Difference	0
df	14
t Stat	-2.07809
P(T<=t) one-tail	0.028289
t Critical one-tail	1.76131
P(T<=t) two-tail	0.056577
t Critical two-tail	2.144787

Follow-up analysis: New 1 vs. New 2

From Figure 51 we see that there is no significant difference between the two types of peanut butter based on a two-tailed test (0.056577 > .05). If we instead compare the New 2 with the Old formula we get the results shown in Figure 52.

t-Test: Two-Sample Assuming Unequal Variances

	Old	New 2
Mean	11.1	16.6
Variance	18.76667	7.155556
Observations	10	10
Hypothesized Mean Difference	0
df	15
t Stat	-3.41607
P(T<=t) one-tail	0.001915
t Critical one-tail	1.75305
P(T<=t) two-tail	0.003829
t Critical two-tail	2.13145

Follow-up analysis: analysis: Old vs. New 2

This time we see that there is a significant difference between the two types of peanut butter, using a two-tailed t-test (.003829 < .05).

The problem with this approach is that doing multiple tests incurs higher amounts of what is called experimentwise error. Remember that when we use a significance level of α = .05, we accept that 5% of the time we will get a type I error. If we perform three such tests then we will essentially increase our overall type I error to 1 – (1 – .05)³ = .14. This means that 14% of the time we will have a type I error, which is higher than we would like.

The general approach for addressing this issue is to either reduce α using Bonferroni’s correction (e.g., for three tests we use α/3 = .05/3 = .0167) or to use a different type of test (e.g., Tukey’s HSD or REGWQ), which is beyond the scope of this book.

Levene’s Test

As mentioned previously, the ANOVA test requires that the group variances be equal. There is a lot of leeway here and even when the variance of one group is four times another, the analysis will be pretty good. Another option for testing homogeneity of group variances is Levene’s test.

For Levene’s test, the residuals e_ij of the group means from the cell means are calculated as follows:

eij=xij-xj

An ANOVA is then conducted on the absolute value of the residuals. If the group variances are equal, then the average size of the residual should be the same across all groups.

There are three versions of the test: using the mean (as described) or using the median or trimmed mean.

Example: Use Levene’s test (residuals from the median) to determine whether the 4 groups in the ANOVA example have significantly different population variances.

We begin by calculating the medians for each group (range B14:E14). E.g., cell B14 contains the formula =MEDIAN(B4:B13).

We next create the table (in range B19:E29) with the absolute residuals from the median. We do this by entering the formula =ABS(B4-B$14) in cell B20, highlighting the range B20:E29 and then pressing Ctrl-R followed by Ctrl-D.

Finally we use the Anova: Single Factor data analysis tool as described earlier in this chapter on the Input Range B19:E29. The output is shown on the right side of Figure 53.

Levene’s test

Levene’s test

Since p-value = .25752 > .05 = α, we cannot reject the null hypothesis, and so we conclude there is no significant difference between the 4 group variances. Thus the ANOVA test conducted previously satisfies the homogenity of variances assumption.

Factorial ANOVA

Example

A new fertilizer has been developed to increase the yield on crops, and the makers of the fertilizer want to better understand which of the three formulations (blends) of this fertilizer are most effective for wheat, corn, soy beans and rice (crops). They test each of the three blends on five samples of each of the four types of crops. The crop yields for the 5 samples of each of the 12 combinations are as shown in Figure 54.

Determine whether there is a difference between crop yields based on these three types of fertilizer.

Sample data for two factor ANOVA

Sample data for two factor ANOVA

Before we proceed with the analysis for this example, we review a few basic concepts.