Investigation 1.10: Heart Transplant Mortality (cont.)

Section 4.6 Investigation 1.10: Heart Transplant Mortality (cont.)

The Exact Binomial and Normal approximation (z) confidence intervals have advantages and disadvantages, so let’s explore some other options.

🔗

Exercises 4.6.1 More Confidence Intervals

Recall Investigation 1.4, where you learned that 8 of the 10 most recent heart transplantation operations at St. George’s Hospital resulted in a death.

🔗

1. Exact Binomial.

Use technology to find the "exact binomial" (Clopper-Pearson) 95% confidence interval for the probability of death in the heart transplantation process at St. George’s Hospital.

🔗

Report the interval, including the midpoint and width.

Interval: (, )

Midpoint:

Width:

Hint 1. R

Use the iscambinomtest() function with the following parameters:

🔗

observed = number of successes (8)
🔗

🔗
n = sample size (10)
🔗

🔗
alternative = "two.sided" (for confidence intervals)
🔗

🔗
conf.level = confidence level (0.95 or 95)
🔗

🔗

🔗

Hint 2. JMP

Use Journal File … confidence interval (Summary Stats) … binomial

🔗

Hint 3. Calculation hints

Midpoint = (upper endpoint + lower endpoint)/2

🔗

Width = (upper endpoint - lower endpoint)

🔗

Solution.

R.

R output showing exact binomial confidence interval

In JMP:

JMP output showing exact binomial confidence interval

The midpoint of this interval is (0.4439 + 0.9748)/2 = 0.709 which is lower than \(\hat{p}\) = 0.80. This makes sense as if we observed \(\hat{p}\) = 0.80, the largest \(\pi\) could be is 1.0 (0.20 away) but \(\pi\) could conceivably be as small as 0.0 (0.80 away), so we don’t want an interval that is symmetric around 0.80. The width is 0.9748 - 0.4439 = 0.531.

🔗

2. Calculate 95% z-Confidence Interval.

Use your calculator to calculate the 95% one proportion \(z\) confidence interval \(\hat{p} \pm z^* \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\text{.}\) Also report the midpoint and width.

🔗

Hint.

First calculate SE(\(\hat{p}\)) = \(\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\text{,}\) then use \(z^* = 1.96\) for 95% confidence.

🔗

Midpoint = (upper endpoint + lower endpoint)/2

🔗

Width = (upper endpoint - lower endpoint)

Interval: (, )

Midpoint:

Width:

Solution.

\(SE() = sqrt{0.8(0.2)/10} = 0.1265\)

🔗

\(0.8 ± 1.96 \sqrt{0.8(0.2)/10}\) \(\Rightarrow\) \((0.552, 1.05)\)

🔗

Midpoint is 0.80 and the width is 0.498.

🔗

3. Verify z-Interval with Technology.

Use technology to verify your calculation of the one proportion \(z\)-interval (also called the Wald interval). Does the technology output match your calculation from the previous question?

🔗

Hint 1. R/Sage instructions

Use the iscamonepropztest() function with the following parameters:

🔗

observed = number of successes (8)
🔗

🔗
n = sample size (10)
🔗

🔗
conf.level = confidence level (0.95 or 95)
🔗

🔗

🔗

Hint 2. JMP instructions

Use Journal File … confidence interval …

🔗

Verified
The technology output should match your calculation: (0.552, 1.05), though some software packages may round the upper value to 1.
Not Verified
Check your calculation again. The technology should produce the same interval: (0.552, 1.05), though some software pages may round the upper limit to 1.

Solution.

R instructions.

R output showing z-interval

🔗

JMP.

JMP output showing z-interval

🔗

The technology output matches the hand calculation: (0.552, 1.05), but may report (0.552, 1), recognizing that values above 1 are not legitimate.

🔗

4. Assess Validity of z-Interval.

Do you consider the one proportion \(z\)-interval valid for these data? Explain how you are deciding.

🔗

Solution.

No, we do not pass the validity conditions for this procedure and get a result that includes values for \(\pi\) above 1.

🔗

Key Idea.

The confidence level of an interval procedure is supposed to indicate the long-run percentage of intervals that capture the actual parameter value, if repeated random samples were taken from the identical process. As such, the confidence level presents a measure of the reliability of the method. A confidence interval procedure is considered valid when the achieved long-run coverage rate matches (or comes close to) the stated (aka nominal) confidence level.

🔗

Statistical confidence is a statement about the method, not a probability statement about an individual interval.

🔗

To consider whether the one sample \(z\)-interval is valid for this study, we can pretend we know the process probability, generate many random samples from that process, calculate confidence intervals from the simulated samples, and see what percentage of these confidence intervals capture the actual value of the process probability that we specified.

🔗

Using the Simulating Confidence Intervals applet.

🔗

Suppose the actual process probability of death is 0.15 and you plan to take a sample of 10 operations and apply the \(z\)-interval (aka Wald) method.

🔗

Set the value of \(\pi\) to 0.15 and the Sample size (\(n\)) to 10.
🔗

🔗
Press Sample.
🔗

🔗
Click on the interval to see the endpoint(s), midpoint, and width (verify).
🔗

🔗

🔗

Simulating Confidence Intervals applet showing single interval for pi=0.15, n=10

🔗

5. Simulating Confidence Intervals.

Did you obtain the same interval as in Question 3?

🔗

Yes
That would be very unlikely! Each simulation uses a different random sample from the process with \(\pi = 0.15\text{,}\) so you’ll get different intervals. Question 3 used the actual data with 8 out of 10 deaths.
No
Correct! Each simulation uses a different random sample from the process with \(\pi = 0.15\text{,}\) so you’ll get different intervals. Question 3 used the actual data with 8 out of 10 deaths.

Solution.

Most likely no - each simulation uses a different random sample from the process with \(\pi = 0.15\text{,}\) so you’ll get different intervals. Question 3 used the actual data with 8 out of 10 deaths.

🔗

6. Did Interval Capture π?

Did your interval succeed in capturing the value of \(\pi = 0.15\text{?}\) (Check if 0.15 falls between your interval endpoints, or if the interval is colored green in the applet.)

🔗

Solution.

Results will vary. Green intervals capture \(\pi = 0.15\text{,}\) red intervals do not. Most intervals should be green.

🔗

Change the Number of intervals to 100 and press Sample until you have at least 1,000 intervals in the Running total.

🔗

🔗

7. Evaluate z-Interval Coverage Rate.

How many of these random intervals succeed in capturing the actual value of the process probability (0.15)? Does this method appear to succeed in capturing the value of the parameter (which we assumed to be equal to 0.15) 95% "of the time"?

🔗

Solution.

The coverage rate is not close to 95%, but closer to 80%.

🔗

Because the \(z\)-interval procedure has the sample size conditions of the Central Limit Theorem, there are scenarios where we should not use this \(z\)-procedure to determine the confidence interval. In these cases, one option is to use the Exact Binomial distribution (Clopper-Pearson) method.

🔗

Use the Method pull-down menu to select Exact Binomial and continue to press Sample until you generate at least 1,000 confidence intervals from a process with \(\pi = 0.15\) and \(n = 10\text{.}\)

🔗

🔗

Simulating Confidence Intervals applet showing Exact Binomial method

🔗

8. Evaluate Exact Binomial Coverage Rate.

Does this appear to be a 95% confidence interval method?

🔗

Yes
Correct! The coverage rate is much closer to 95%.
No
Not quite. Check the running total - the coverage rate should be close to 95%.
Can’t tell
You can tell by looking at the running total percentage in the applet - it should be close to the stated 95% confidence level.

Solution.

Now the coverage rate is much closer to 95%.

🔗

9. Average Width of Exact Binomial Intervals.

What is the average width of these intervals? (See the Mean of the "(CI Widths)" graph.)

🔗

Average width:

🔗

Solution.

The average width should be about 0.46.

🔗

The Clopper-Pearson method will have a coverage rate of at least 95%, but tends to be "conservative," meaning the coverage rate is higher than 95% so the intervals are longer than they need to be. The method is also fairly complicated (as opposed to: "go two standard deviations on each side"). An alternative method that has been receiving much attention of late is often called the Plus Four procedure:

🔗

Definition: Plus Four 95% Confidence Interval for \(\pi\).

Determine the number of successes (\(X\)) and sample size (\(n\)) in the study
🔗

🔗
Increase the number of successes by two and the sample size by four, and calculate a new proportion: \(\tilde{p} = (X + 2)/(n + 4)\text{.}\) Make this value the midpoint of the interval.
🔗

🔗
Use the \(z\)-interval procedure as above for the augmented sample size of \((n + 4)\text{:}\) \(\tilde{p} \pm z^* \sqrt{\frac{\tilde{p}(1-\tilde{p})}{n+4}}\)
🔗

🔗

🔗

In short, with 95% confidence, the idea is to pretend you have 2 more successes and 2 more failures than you really did. For confidence levels other than 95%, researchers have recommended using the Adjusted Wald method: \(\tilde{p} = (X + 0.5(z^*)^2)/(n + (z^*)^2)\) and \(\tilde{n} = n + (z^*)^2\) so the interval becomes \(\tilde{p} \pm z^* \sqrt{\frac{\tilde{p}(1-\tilde{p})}{\tilde{n}}}\text{.}\)

🔗

Investigate the reliability of the Plus Four procedure using the applet:

🔗

Use the Method pull-down menu to change from Exact Binomial to Plus Four (95%).
🔗

🔗
Generate at least 1,000 confidence intervals from a process with \(\pi = 0.15\) and \(n = 10\text{.}\)
🔗

🔗

🔗

Simulating Confidence Intervals applet showing Plus Four method

🔗

10. Evaluate Plus Four Coverage Rate.

Is this coverage rate close to 95%? Is this an improvement over the (Wald) \(z\)-interval?

🔗

Solution.

The coverage rate is again close to 95%, much better than the Wald procedure.

🔗

11. Average Width of Plus Four Intervals.

What is the average width of the Plus Four intervals?

🔗

Solution.

The average width is about 0.44.

🔗

12. Compare Plus Four to Exact Binomial Width.

Do these intervals tend to be narrower than intervals calculated using the Exact Binomial interval procedure?

🔗

Solution.

Yes, they are narrower than the Exact Binomial.

🔗

Use the Method pull-down menu to toggle between the Wald and Plus Four intervals. [Hint: You can Sort the intervals first.]

🔗

🔗

13. Compare Interval Midpoints.

What do you notice about how the midpoints of the intervals compare? (Also think how \(\tilde{p}\) and \(\hat{p}\) differ, these are the values shown in the "Sample statistics (CI Midpoints)" graph.) Why is the change in the midpoints of the intervals useful in this scenario?

🔗

Solution.

The midpoints shift towards 0.5, this moves the midpoints off of zero and gives all intervals a positive length.

🔗

In the applet, change the Number of intervals to 100.

🔗
Press Reset.

🔗
Use the pull-down menu to select the Blaker method (the Exact Binomial using the tail probabilities), and wait a bit. (If it’s just 100 at a time, it’s not too bad to hit Sample a few more times.)

🔗
Report the percentage of intervals that capture the parameter AND the average width of these 100 intervals.

🔗

🔗

14. Evaluate Blaker Method.

Report the percentage of intervals that capture the parameter AND the average width of these 100 intervals.

🔗

Running Total:

🔗

Average width:

🔗

Solution.

The running total should be very close to 95% and the average width close to 0.45.

🔗

15. Assess Blaker Method.

Does this appear to be a 95% confidence interval procedure?

🔗

Yes
Correct! Especially if you look at a few hundred, about 95% of the intervals appear to capture the parameter.
No
Not quite. Look at the running total - it should be very close to 95%.

Solution.

Yes, especially if look at a few hundred, about 95% of the intervals appear to capture the parameter.

🔗

16. Shortest Intervals.

In your simulations, which method(s) appear to have the shortest intervals?

🔗

Binomial
Not quite. The binomial intervals tend to be wider on average.
Plus Four
Correct! The Plus Four and Blaker methods tend to have shorter intervals than the binomial method.
Blaker
Correct! The Plus Four and Blaker methods tend to have shorter intervals than the binomial method.

Solution.

Opinions may vary but the binomial intervals tend to be wider on average.

🔗

Key Idea.

The Plus Four method is more likely to be a 95% confidence interval method than the one-sample \(z\)-interval method. It is never a bad idea to use the Plus Four method.

🔗

"Fudging our data" may seem like cheating, but there is actually some nice theory behind this method. The plus four adjustment is not that different in principle from the continuity correction we suggested with p-values. Many statisticians now recommend using this procedure over the one proportion \(z\)-interval procedure because it achieves the stated confidence level for any sample size and over the Clopper-Pearson binomial interval procedure because the intervals tend to be shorter (narrower). In fact, the more general case (something other than 95%) is becoming the default in many software packages. (The Blaker method in others.)

🔗

17. Plus Four Interval for St. George’s Hospital.

Determine and then interpret a Plus Four confidence interval for the probability of a death during a heart transplant operation at St. George’s hospital.

🔗

Hint.

You can do the calculation either by hand, first finding \(\tilde{p}\) and \(z^*\text{,}\) or with software (e.g., the Theory-Based Inference applet) by telling the technology there were 4 more operations consisting of two more deaths than in the actual sample.

🔗

Note: You can’t use the Simulating Confidence Intervals applet to answer this question.

🔗

Plus Four interval: (, )

🔗

Interpretation:

🔗

Solution.

\(10/14 = 0.714; 0.714 ± 1.96 \sqrt{0.714(1 - 0.714)/14} = 0.714 ± 0.2367\)\(\Rightarrow (0.477, 0.951)\)

🔗

I am 95% confident that the underlying mortality rate at St. George’s Hospital is between 0.477 and 0.951.

🔗

18. Compare Intervals for Larger Study.

Repeat for the larger study at St. George’s that revealed 71 deaths out of 361 operations. Round to 3 decimal places.

🔗

Wald interval: (, )

🔗

Plus Four interval: (, )

🔗

Compare:

🔗

Solution.

With the larger sample size, the intervals are more similar.

🔗

Wald: (0.156, 0.238); Plus four: (0.160, 0.241)

🔗

Subsection 4.6.2 Practice Problem 1.10A

Return to your exploration using the Simulating Confidence Intervals applet. Assume that \(\pi = 0.667\) is the probability that a kissing couple leans to the right.

🔗

Checkpoint 4.6.1. Evaluate Score Interval Method.

From the Method pull-down menu select the Score interval method. Evaluate the performance of this interval method, clearly explaining your steps. [Hint: Does the performance depend on the value of \(n\) that you use? What if you change the value of \(\pi\) to something closer to 0 or 1? Average widths?]

🔗

Solution.

The Score interval method performs well across different sample sizes and parameter values. The coverage rate is close to the nominal 95% confidence level even for small sample sizes and extreme values of \(\pi\text{.}\) As sample size increases, the average width of the intervals decreases while maintaining the appropriate coverage rate.

🔗

Checkpoint 4.6.2. Derive Score Interval Formula.

Starting with \(z = \frac{\hat{p} - \pi}{\sqrt{\pi(1-\pi)/n}}\text{,}\) use the quadratic formula to solve for \(\pi\) and verify the first formula for the Score interval.

🔗

Solution.

Starting with \(z = \frac{\hat{p} - \pi}{\sqrt{\pi(1-\pi)/n}}\text{,}\) square both sides and rearrange to get a quadratic equation in \(\pi\text{.}\) Using the quadratic formula yields the Score interval formula:

🔗

\(\frac{\hat{p} + \frac{z^2}{2n} \pm z\sqrt{\frac{\hat{p}(1-\hat{p})}{n} + \frac{z^2}{4n^2}}}{1 + \frac{z^2}{n}}\)

🔗

Checkpoint 4.6.3. Compare Score and Adjusted Wald Midpoints.

Show that the midpoint of the Score formula found in the previous question simplifies to the midpoint of the Adjusted Wald procedure.

🔗

Solution.

The midpoint of the Score interval is \(\frac{\hat{p} + \frac{z^2}{2n}}{1 + \frac{z^2}{n}}\text{.}\) With \(z^* = 1.96 \approx 2\text{,}\) this becomes approximately \(\frac{\hat{p} + \frac{4}{2n}}{1 + \frac{4}{n}} = \frac{\hat{p} + \frac{2}{n}}{1 + \frac{4}{n}}\text{,}\) which matches the midpoint of the Adjusted Wald (Plus Four) procedure.

🔗

Checkpoint 4.6.4. Relate Adjusted Wald to Plus Four.

Show that with 95% confidence the midpoint of the Adjusted Wald procedure can be approximated with \(\frac{X + 2}{n + 4}\text{,}\) the Plus Four method.

🔗

Solution.

With \(z^* = 1.96 \approx 2\) for 95% confidence, the Adjusted Wald midpoint \(\frac{\hat{p} + \frac{z^{*2}}{2n}}{1 + \frac{z^{*2}}{n}}\) becomes \(\frac{\hat{p} + \frac{4}{2n}}{1 + \frac{4}{n}} = \frac{\hat{p} + \frac{2}{n}}{1 + \frac{4}{n}}\text{.}\) Since \(\hat{p} = \frac{X}{n}\text{,}\) this equals \(\frac{\frac{X}{n} + \frac{2}{n}}{1 + \frac{4}{n}} = \frac{\frac{X+2}{n}}{\frac{n+4}{n}} = \frac{X+2}{n+4}\text{,}\) which is the Plus Four method.

🔗

Subsection 4.6.3 Practice Problem 1.10B

Checkpoint 4.6.5. Probability of Selecting Capturing Interval.

Suppose that you take 1000 samples from a random process and generate a 95% confidence interval for \(\pi\) with each sample. If you randomly select one of these intervals, what is the probability that you will select an interval that captures \(\pi\text{?}\)

🔗

Probability:

🔗

Solution.

The probability is approximately 0.95 (or 95%). Since these are 95% confidence intervals, in the long run about 95% of the intervals will capture the true parameter value \(\pi\text{.}\) Therefore, randomly selecting one interval gives you about a 95% chance of selecting one that captures \(\pi\text{.}\)

🔗

Checkpoint 4.6.6. Probability for Specific Interval.

Suppose you take one sample from a random process and generate a 95% confidence interval to be (0.15, 0.25). What is the probability that this interval captures \(\pi\text{?}\)

🔗

Probability:

🔗

Solution.

The probability is either 0 or 1 - we just don’t know which! Once a specific interval is constructed, the parameter \(\pi\) either is or isn’t in that interval. The "95% confidence" refers to the method, not to any particular interval. If we were to use this method repeatedly, about 95% of the intervals constructed would contain \(\pi\text{.}\)

🔗

Checkpoint 4.6.7. Effect of Confidence Level.

In general, how does increasing the confidence level impact the width and coverage rate of a confidence interval procedure?

🔗

Width decreases, coverage rate decreases
Incorrect. To be more confident, we need a wider interval and higher coverage rate.
Width decreases, coverage rate increases
Incorrect. Higher confidence level requires a wider interval.
Width increases, coverage rate increases
Correct! Increasing confidence level requires a larger critical value, which increases the margin of error and makes the interval wider. It also increases the coverage rate - a higher percentage of intervals will capture the true parameter.
Width increases, coverage rate decreases
Incorrect. Higher confidence level means we capture the parameter more often, not less.

🔗

Checkpoint 4.6.8. Effect of Sample Size.

In general, how does increasing the sample size impact the width and coverage rate of a confidence interval procedure?

🔗

Width decreases, coverage rate stays the same
Correct! Larger sample sizes reduce the standard error, which decreases the margin of error and makes intervals narrower. For a valid confidence interval procedure, the coverage rate should remain at the stated confidence level (e.g., 95%) regardless of sample size.
Width decreases, coverage rate increases
Incorrect. While larger samples produce narrower intervals, the coverage rate should match the confidence level regardless of sample size for a valid procedure.
Width stays the same, coverage rate stays the same
Incorrect. Sample size affects the width through the standard error.
Width increases, coverage rate stays the same
Incorrect. Larger samples give more precise estimates with narrower intervals, not wider.

🔗

You have attempted of activities on this page.

🔗

Prev Top Next

Section 4.6 Investigation 1.10: Heart Transplant Mortality (cont.)

Exercises 4.6.1 More Confidence Intervals

1. Exact Binomial.

R.

2. Calculate 95% z-Confidence Interval.

3. Verify z-Interval with Technology.

R instructions.

JMP.

4. Assess Validity of z-Interval.

Key Idea.

5. Simulating Confidence Intervals.

6. Did Interval Capture π?

7. Evaluate z-Interval Coverage Rate.

8. Evaluate Exact Binomial Coverage Rate.

9. Average Width of Exact Binomial Intervals.

Definition: Plus Four 95% Confidence Interval for \(\pi\).

10. Evaluate Plus Four Coverage Rate.

11. Average Width of Plus Four Intervals.

12. Compare Plus Four to Exact Binomial Width.

13. Compare Interval Midpoints.

14. Evaluate Blaker Method.

15. Assess Blaker Method.

16. Shortest Intervals.

Key Idea.

17. Plus Four Interval for St. George’s Hospital.

18. Compare Intervals for Larger Study.

R (Wald):.

R (Plus Four):.

JMP (Wald):.

JMP (Plus Four):.

Subsection 4.6.2 Practice Problem 1.10A

Checkpoint 4.6.1. Evaluate Score Interval Method.

Checkpoint 4.6.2. Derive Score Interval Formula.

Checkpoint 4.6.3. Compare Score and Adjusted Wald Midpoints.

Checkpoint 4.6.4. Relate Adjusted Wald to Plus Four.

Subsection 4.6.3 Practice Problem 1.10B

Checkpoint 4.6.5. Probability of Selecting Capturing Interval.

Checkpoint 4.6.6. Probability for Specific Interval.

Checkpoint 4.6.7. Effect of Confidence Level.

Checkpoint 4.6.8. Effect of Sample Size.