10.5 - Example: SAT-Math Scores by Award Preference

In this example, we are comparing the SAT scores of students who said that they would prefer to win an Academy Award, a Nobel Prize, or an Olympic gold medal.

The example uses the StudentSurvey dataset provided by the Lock5 textbook.

Let's apply the five-step hypothesis testing process to this example.

1. Check assumptions and write hypotheses

The assumptions for a one-way between-groups ANOVA are:

Assumption: Samples are independent: Each student selected either Olympic, Academy, or Nobel. Each student is in only one group and those groups are in no way matched or paired. This assumption is met.

 Assumption: The response variable is approximately normally distributed for each group or all group sample sizes are at least 30: To check this we can construct a histogram with groups in Minitab. These plots show that the distributions are all approximately normal. 

Histograms of SAT grouped by award

 Assumption: The population variances are equal across responses for the group levels (if the largest sample standard deviation divided by the smallest sample standard deviation is not greater than two, then assume that the population variances are equal). Again we can use Minitab to look at the standard deviations across the groups. The largest standard deviation is 151.3 and the smallest is 114.1 for a ratio of 1.33 which is less than 2. So this assumption is met.

Statistics

            

Award

N

N*

Mean

SE Mean

StDev

Minimum

Q1

Median

Q3

Maximum

Academy

31

0

1191.0

27.2

151.3

820.0

1070.0

1200.0

1300.0

1530.0

Nobel

149

0

1239.1

9.38

114.5

920.0

1160.0

1230.0

1310.0

1550.0

Olympic

182

0

1176.7

8.46

114.1

800.0

1100.0

1190.0

1250.0

1470.0

Use Minitab to run a one-way ANOVA with the Minitab file: StudentSurvey.mpx

The following will describe the output within the context of the five-step hypothesis testing process.

Given that you are comparing \(k\) independent groups, the null and alternative hypotheses are:

Method

Null hypothesis

All means are equal

Alternative hypothesis

At least one mean is different

Significance level

\(\alpha=0.05\)

Equal variances were assumed for the analysis

2. Calculate the test statistic

You should get the following output:

Factor Information

Factor

Levels

Values

Award

3

Academy, Nobel, Olympic

The factor information tells us that our factor is the Award.

Analysis of Variance

Source

DF

Adj SS

Adj MS

F-Value

P-Value

Award

2

323269

162134

11.67

0.000

Error

359

4986078

13889

  

Total

361

5310347

   

The analysis of variance table is also known as our ANOVA source table. The source of the Award is our between-groups variation so the DF K -1 or 3-1 = 2. Error is our within-groups variation so the degrees of freedom are n - K or 362-3 = 359. The 'Adj' stands for adjusted. For a one-way ANOVA, nothing is being adjusted. The F-value is our F-test statistic, which in this case is 11.67 with a p-value of 0.000. The F value could be written as F (2, 359) = 11.67.

Means

Factor

N

Mean

StDev

95% CI

Academy

31

1191.0

151.3

(1149.3, 1232.6)

Nobel

149

1239.11

114.54

(1220.13, 1258.10)

Olympic

182

1176.73

114.13

(1159.55, 1193.91)

Pooled StDev = 117.851

The Means table is just a table of descriptive statistics. The sample size, mean, and the standard deviation is presented for each group. The last column is an unadjusted 95% confidence interval. You should not refer to these confidence intervals since these do not take into account that there are three different groups and three different confidence intervals being computed at the same time. The confidence intervals that we're interested in come with our Tukey pairwise comparison.

 need to know two of these degrees of freedom: \(df_1 = df_{between}\) and \(df_2=df_{within}\).

3. Determine the p-value

The p-value is 0.000 from the Minitab output.

4. Make a decision

\(p \leq \alpha\) so reject the null hypothesis.

5. State a "real world" conclusion.

We can conclude that the group means are not all equal.

Remember that ANOVA is just an omnibus test. It only tells us there is a difference somewhere. To determine where the differences are you would have to look at the Tukey pairwise comparison output.

Grouping Information Using the Tukey Method and 95% Confidence

Award

N

Mean

Grouping

Nobel

149

1239.11

A

 

Academy

31

1190.97

A

B

Olympic

182

1176.73

 

B

Means that do not share a letter are significantly different.

In the grouping table, Nobel and Olympic do not share a grouping letter. The means of these two groups are significantly different. Both the Nobel and Academy belong to Group A and both Academy and Olympic belong to Group B. These pairs are not significantly different.

Tukey Simultaneous Tests for Differences of Means
      

Difference of Levels

Difference
of Means

SE of
Difference

95% CI

T-Value

Adjusted
P-Value

Nobel - Academy

48.1

23.3

(-6.3, 102.6)

2.07

0.096

Olympic - Academy

-14.2

22.9

(-67.8, 39.4)

-0.62

0.808

Olympic - Nobel

-62.4

13.0

(-92.9, -31.9)

-4.79

0.000

Individual confidence level = 98.87%

The Tukey Simultaneous Tests table provides the adjusted 95% confidence intervals. These are the confidence intervals and p-values you should be looking at when you're making pairwise comparisons. These are adjusted to take into account that there are three different pairwise tests being performed simultaneously. To determine which groups are statistically significant we can look at the adjusted p-value. In this case, the only p-value less than 0.05 is the Olympic - Nobel pairwise comparison. The means of the Olympic and Nobel group are significantly different.

ANOVA Output - Example 3

In the Tukey Simultaneous 95% CI graph any interval not containing zero indicates a statistically significant difference. In this case, we see that the Olympic - Nobel group does not contain zero so this pairing has a statistically significant difference.