Skip to main content

Section 4.9 Investigation 1.18: Women Senators

Suppose that an alien lands on Earth, notices different sexes among humans, and sets out to estimate the proportion of humans that identify as women. Fortunately, the alien had a good statistics course on its home planet, so it knows to take a sample of human beings and produce a confidence interval. Suppose that the alien happened upon the members of the 2025 U.S. Senate as its sample of human beings, so the alien finds 26 women in its sample.

Checkpoint 4.9.1. Calculate confidence interval.

Use this sample information to form a 95% confidence interval for the actual proportion of all humans who are female.
Solution.
Sample: \(n = 100\text{,}\) \(\hat{p} = 26/100 = 0.26\)
\(SE(\hat{p}) = \sqrt{\frac{0.26(0.74)}{100}} = \sqrt{0.001924} \approx 0.0439\)
95% CI: \(0.26 \pm 1.96(0.0439) = 0.26 \pm 0.086 = (0.174, 0.346)\)
Or (17.4%, 34.6%)

Checkpoint 4.9.2. Evaluate reasonableness.

Is this confidence interval a reasonable estimate of the actual proportion of all humans who are female?
Solution.
No, this confidence interval is not a reasonable estimate. We know that approximately 50% of humans are female, which is well outside this interval.

Checkpoint 4.9.3. Explain why procedure fails.

Explain why the confidence interval procedure fails to produce an accurate estimate of the population parameter in this situation.
Solution.
The confidence interval procedure assumes the sample is randomly selected from the population. The U.S. Senate is not a random sample of all humans - it’s a highly biased sample. Women are underrepresented in the Senate compared to the general population. Statistical procedures cannot correct for biased sampling.

Checkpoint 4.9.4. Interval for U.S. Senate.

It clearly does not make sense to use the confidence interval in checkpoint 1 to estimate the proportion of women on Earth or even the U.S., but does the interval make sense for estimating the proportion of women in the 2025 U.S. Senate? Explain your answer.
Solution.
No, it doesn’t make sense to construct a confidence interval for the 2025 U.S. Senate because we have data on the entire population (all 100 senators). We know exactly that 26% of the 2025 U.S. Senate are women - there’s no uncertainty and no need for an interval estimate. Confidence intervals are only useful when estimating an unknown parameter from a sample.

Discussion.

  • First, statistical tests and confidence intervals do not compensate for the problems of a biased sampling procedure. If the sample is collected from the population in a biased manner, the ensuing confidence interval will be a biased, and potentially misleading, estimate of the population parameter of interest.
  • A second important point to remember is that confidence intervals and significance tests use sample statistics to estimate population or process parameters. When the data at hand constitute the entire population of interest (i.e., you have a census), then constructing a confidence interval from these data is meaningless. In this case, you know precisely that the proportion of women in the population of the 2025 U.S. Senators is 0.26 (exactly! no margin-of-error!), so it is senseless to construct a confidence interval from these data.
You have attempted of activities on this page.