Example 3.1: Wording of Questions

Section 16.1 Example 3.1: Wording of Questions

Try these questions yourself before you use the solutions following to check your answers.

Researchers have conjectured that the use of the words "forbid" and "allow" can affect people’s responses to survey questions. Students in an introductory statistics class were randomly assigned to answer one of the following questions:

🔗

Should your college allow speeches on campus that might incite violence?

🔗
Should your college forbid speeches on campus that might incite violence?

🔗

🔗

Of the 11 students who received the first question, 8 responded yes. Of the 14 students who received the second question, 12 said no.

🔗

ISCAM data files and applets page

🔗

Checkpoint 16.1.1. Identify Study Components.

Identify the observational units and the explanatory and response variables.

🔗

Solution.

The observational units in this study are the statistics students. The explanatory variable is the word choice in the question they responded to (categorical) and the response variable is whether their response was in favor of the speeches ("yes" with the allow question and "no" with the forbid question, categorical).

🔗

Checkpoint 16.1.2. Study Type and Design.

Is this an observational study or an experiment? If an observational study, suggest a potential confounding variable. If an experiment, explain the roles of randomization and blinding in this study.

🔗

Solution.

This was an experiment because the word choice in the question was randomly assigned to the students. Presumably the instructor mixed up the order of the questionnaires prior to passing them out to the students. This is important to equalize other variables between these two groups such as political inclinations and gender. The students did not know that there were two different forms of the questions, so the study was blind. If they had realized that the instructor was focusing on how they responded to the two words, they probably would have responded differently eliminating any subconscious effect of the word choice.

🔗

Checkpoint 16.1.3. Construct Two-Way Table.

Construct a two-way table to summarize these results.

🔗

Solution.

Two-way table, with the explanatory variable, word choice, as the column variable, and defining a "success" to mean that the student is in favor of such speeches:

🔗

Response	Allow	Forbid	Total
Success	8	12	20
Failures	3	2	5
Total	11	14	25

🔗

Checkpoint 16.1.4. Construct Segmented Bar Graph.

Construct a segmented bar graph to display these results and comment on the relationship revealed by this graph.

🔗

Solution.

We see that most of these students were in favor of the speeches (80%). There was a slight tendency for those responding to the forbid question to appear more in favor (more likely to say no, do not forbid the speeches), 0.857 versus 0.727. However, the distribution within the bars look fairly similar and the association does not appear to be strong.

🔗

Segmented bar graph showing response distributions by question wording

🔗

Checkpoint 16.1.5. Test of Significance.

Based on earlier studies, researchers expected people to be less likely to agree to "forbid" the speeches, leading to more no responses (and thus appearing to be in favor of having the speeches), whereas they expected people to be comparatively less likely to agree to "allow" the speeches. Do these data provide strong evidence that these students responded more positively toward having such speeches if their question was phrased in terms of "forbid" rather than "allow"? Carry out a test of significance and explain the decision you would make based on the p-value. Write a paragraph summarizing your conclusions including whether a cause-and-effect conclusion can be drawn and the population you are willing to generalize these results to.

🔗

Solution.

The null hypothesis will be that there is no effect due to wording of the question. So if \(\pi_{allow}\) is the probability someone says "yes" to the allow question and if \(\pi_{forbid}\) is the probability someone says no to the forbid question:

🔗

\(H_0: \pi_{allow}\) - \(\pi_{forbid}\) = 0

🔗

and based on the prior research:

🔗

\(H_a: \pi_{allow}\) - \(\pi_{forbid}\) < 0

🔗

Fisher’s Exact Test indicates how often we expect to see as few as 8 or fewer successes in the allow group (equivalently, at least as many as 12 successes in the forbid group). So if we define X to be the number in the allow group in favor of the speeches, X follows a hypergeometric distribution with N = 25, M = 20, and n = 11. The p-value will be P(X ≤ 8).

🔗

P(X ≤ 8) = P(X = 0) + P(X = 1) + ... + P(X = 8)

🔗

= [C(20,0)×C(5,11) + C(20,1)×C(5,10) + ... + C(20,8)×C(5,3)] / C(25,11)

🔗

= 0.3783

🔗

Simulation distribution showing p-value for Fisher’s Exact Test

Note, it would NOT be valid to use the normal approximation here because we do not have at least 5 failures in both groups (normal approx. p-value equals 0.2149, with continuity correction equals 0.384).

🔗

Thus, if the word choice in the question had no effect, we would get experimental results at least this extreme in about 38% of random assignments. This indicates that our experimental data is not surprising and does not provide evidence that the wording of the question had an effect on these students. Thus, we will not say the word choice in the question made a difference (though if the p-value had been small, because this was a randomized experiment, a cause-and-effect conclusion would have been valid). Furthermore, we should be hesitant in generalizing these results beyond introductory statistics students at this school. We do not know how these students were selected from this school nor whether these students might be representative of other college students. Perhaps this is a private school or the students tend to be more liberal than at other schools.

🔗

R Output:

🔗

JMP Output:

🔗

Checkpoint 16.1.6. Compare with Larger Study.

In a 1976 study, one group of subjects was asked, "Do you think the United States should forbid public speeches in favor of communism?", whereas another group was asked, "Do you think the United States should allow public speeches in favor of communism?" Of the 409 subjects randomly asked the "forbid" version, 161 favored the forbidding of communist speeches. Of the 432 subjects asked the "allow" version, 189 favored allowing the speeches. Construct a segmented bar graph for these data and comment on whether you believe the p-value for this table will be larger or smaller than that in Question 5. Explain your reasoning.

🔗

Solution.

In this larger study, individuals were less likely to be in favor of the speeches in general (52%). More importantly, the difference between the two groups is a bit larger (0.606 - 0.438 = 0.169 in this study, compared to a difference of 0.130 before), providing stronger evidence that such a difference would not happen due to chance alone and thus a smaller p-value.

🔗

Segmented bar graph for the larger 1976 study

We must also take into account the fact that the group sizes are much, much larger in this study. With such large sample sizes, even small differences between the groups may appear statistically significant. Thus, with the larger samples and the slightly larger difference in success proportions, the p-value from this study will be much smaller than that from the original study.

🔗

Watch video walkthrough of this example.

🔗

You have attempted of activities on this page.

🔗

Prev Top Next