Identify: The parameter we want to estimate is
which is the true average score under Version A
the true average score under Version B. We will estimate this parameter at the 95% confidence level.
Choose: Because we are comparing two means, we will use a 2-sample
-interval.
Check: The data was collected from a randomized experiment with two treatments: Version A and Version B of test. The 10% condition does not need to be checked here since we are not sampling from a population. There were 30 students in each group, so the condition that both group sizes are at least 30 is met.
Calculate: We will calculate the confidence interval as follows.
The point estimate is the difference of sample means:
The of a difference of sample means is:
In order to find the critical value
we must first find the degrees of freedom. Using a calculator, we find
We round down to 50, and using a
-table at row
and confidence level 95%, we get
The 95% confidence interval is given by:
Conclude: We are 95% confident that the true difference in average score between Version A and Version B is between -2.5 and 13.1 points. Because the interval contains both positive and negative values, the data do not convincingly show that one exam version is more difficult than the other, and the teacher should not be convinced that she should add points to the Version B exam scores.