Skip to main content

Section 22.2 Example 4.2: Speed Limit Changes

Try these questions yourself before you use the solutions following to check your answers.
In 1995, the National Highway System Designation Act abolished the federal mandate of 55 miles per hour maximum speed limit and allowed states to establish their own limits. Of the 50 states (plus District of Columbia), 32 increased their speed limits in 1996. The data in TrafficFatalities.txt shows the percentage change in interstate highway traffic fatalities from 1995 to 1996 and whether or not the state increased their speed limit. (Data from the National Highway Traffic Safety Administration as reported in Ramsey and Schafer, 2002.)
Speed limit change study illustration

Checkpoint 22.2.1. Study Design.

Identify the observational units and response variable of interest. Is this a randomized experiment or an observational study?
Solution.
The observational units are the 50 states and the District of Columbia. The response variable of interest is the percentage change in traffic fatalities from 1995 to 1996 (quantitative). This is an observational study because the researchers did not randomly assign which states would increase their speed limits.

Checkpoint 22.2.2. Descriptive Analysis.

Produce numerical and graphical summaries of these data and describe how the two groups compare.

Aside: Descriptive Statistics Applet.

Solution.
The following graphical display is dotplots of the percentage change in traffic fatalities for each state (and D.C.) in the two groups on the same scale:
Dotplots comparing percentage change for states with and without speed limit increases
Because the distributions are reasonably symmetric, it makes sense to report the means and standard deviations as the numerical summaries:
Group Mean Standard Deviation
No increase \(\bar{x}_{no} = -8.53\%\) \(s_{no} = 31\%\)
Increase \(\bar{x}_{yes} = 13.69\%\) \(s_{yes} = 22\%\)
These results indicate that there is a tendency for the percentage change in traffic fatalities to be higher in those states that increase their speed limits. This tendency is also seen in stacked boxplots:
Stacked boxplots comparing the two groups
The boxplots also reveal an outlier, the District of Columbia, which did not change its speed limit and had an unusually high decrease in the percentage change of accidents.
These summaries also reveal that the two sample distributions are reasonably similar in shape and variability.

Checkpoint 22.2.3. Technical Conditions.

Are the technical conditions for a two-sample t-test met for this study? Explain.
Solution.
In considering the technical conditions, we see that the sample sizes (19 and 32) are reasonably large. Coupled with the normal shaped sample distributions, the normality/large sample size conditions appears to be satisfied for us to use the t-distribution.

Checkpoint 22.2.4. Two-Sample t-Test and Confidence Interval.

Carry out a two-sample t-test to determine whether the average percentage change in interstate highway traffic fatalities is significantly higher in states that increased their speed limit. If you find a significant difference, estimate its magnitude with a confidence interval.

Aside: Theory-Based Inference Applet.

Solution.
Let \(\mu_{no} - \mu_{yes}\) represent the true "effect" of increasing the speed limit on the traffic fatality rate (states that didn’t change speed limit – states that did change speed limit)
\(H_0: \mu_{no} - \mu_{yes} = 0\)
(there is no true effect from increasing the speed limit)
\(H_a: \mu_{no} - \mu_{yes} \lt 0\)
(increasing the speed limit leads to an increase in traffic fatalities,higher average percentage change with increase in speed limit)
We can apply a randomization test that would look at what would happen if these groups were mixed up with no difference between the "no" group and the "yes" group.
Simulation results assuming null hypothesis
Simulation results assuming alternative
We can also approximate this randomization distribution with the two-sample t-procedure. In this case, the (unpooled) standardized statistic will be \(t = \frac{-8.53 - 13.69}{\sqrt{\frac{31^2}{19} + \frac{22^2}{32}}} = -2.74\)
If we approximate the degrees of freedom by \(\min(19-1, 32-1) = 18\text{,}\) then we find the one-sided p-value to be:
> iscamtprob(-2.74, 18, "below")
probability: 0.006728
T distribution showing p-value calculation
These calculations are confirmed with technology (with different df approximations):
Theory-based inference applet
Theory-based inference applet results
Note: Our "by hand" method (using df = 18) is conservative in that the p-value found will be larger than the actual p-value as seen here.
Such a small p-value (0.005 \(\lt\) 0.01) reveals that we would observe such a large difference in group means by random assignment alone if there was no treatment effect only about 5 times in 1000, convincing us that the observed difference in the group means is larger than what we would expect just from random assignment. We have strong evidence that something other than "random chance" led to this difference. However, we cannot attribute the difference solely to the speed limit change because this was not actually a randomized experiment. As the states self-selected, there could be confounding variables that help to explain the larger increase in fatality rates in states that increased their speed limit.
Because we rejected the null hypothesis, we are also interested in examining a confidence interval to estimate the size of the treatment effect. We first approximate the \(t^*\) critical value for say 95% confidence, again using \(\min(19-1, 32-1) = 18\) as the degrees of freedom.
> iscaminvt(.95, 18, "between")
There is 0.95 probability between -2.101 and 2.101
T distribution showing critical values
Then the 95% confidence interval can be calculated:
\begin{gather*} (-8.53 - 13.69) \pm 2.101\sqrt{\frac{31^2}{19} + \frac{22^2}{32}}\\ = -22.2 \pm 16.85 \text{ or } (-39.05, -5.35) \end{gather*}
We are 95% confident that the true "treatment effect" is in this interval or that the mean percentage increase in traffic fatality rates is between 5.4 percentage points to 39.1 percentage points higher in states that increase their speed limit compared to states that do not increase their speed limit (continuing to be careful not to state this as a cause and effect relationship).
Before we complete this analysis, it is worthwhile to investigate the amount of influence that the outlier (the District of Columbia) has on the results, especially because D.C. does have different characteristics from the states in general. The updated R output (two-sided p-value) is below:
R output after removing D.C. outlier
As we might have guessed, the mean increase in fatalities for the "No" group has increased so that the difference in the group means is less extreme. This leads to a less extreme standardized statistic and a larger p-value (one-sided p-value = 0.01785/2 = 0.0089) so somewhat weaker evidence against the null hypothesis in favor of the one-sided alternative hypothesis.

Checkpoint 22.2.5. Interpret p-value.

Discuss what the p-value in the previous question measures.
Solution.
The other technical condition is that we have independent random samples or random assignment to groups. We do not have either in this study, because we are examining the population of all states (and D.C.), and the states self-selected whether they changed their speed limit. Thus, any p-value we calculate is in a sense hypothetical because we have all the states here, we might ask the question: would the two groups look this different if whether or not they increased their speed limit had been assigned at random?
So the above p-value measures how often we would see a difference in group means at least this large based on random assignment to the two groups if there were no true treatment effect. Although this p-value is hypothetical, we still have some sense that the difference observed between the groups is larger than we would expect to see "by chance" even in a situation like this where it is not feasible to carry out a true randomized experiment. This gives some information that can be used in policy decisions but we must be careful not to overstate the attribution to the speed limit change.
You have attempted of activities on this page.