Skip to main content

Section 3.2 Investigation 1.2: Can Wolves Understand Human Cues?

In this investigation, you will explore using a mathematical model to evaluate the results from a random process.

Exercises 3.2.1 The Study

Previous research has demonstrated that domesticated dogs can be trained to understand human cues, such as looking or pointing at an object. But it was not clear how the animals would perform with "behavioral cues" showing intent such as reaching for, but not obtaining, or trying to open an object.
Lampe et al. (2017) gave captive wolves, pack dogs living in identical conditions to the wolves, and pet dogs living with families a series of "object-choice tasks." A table was placed outside a fenced compartment, and a container was placed at each end of the table, one containing food and one empty. The experimenter would give the cue as to which container had the food and then the animal would touch one of the two targets next to the containers. A 6-year-old female timber wolf, Yukon, chose the intended container in 6 of the 8 trials with behavioral cues.

1. Identify Sample and Process.

Identify the sample/random process for this study.
Hint.
What is the sample size?
Solution.
The random process is the repeated trials given to Yukon (\(n = 8\)).

2. Identify the Variable.

Identify the variable of interest for this process.
Hint.
The variable is the measurements recorded for each observation.
Solution.
The variable is whether or not Yukon picks the cued container.

3. Classify Variable Type.

Is this variable quantitative or categorical?
Hint.
Think about whether the data is a number or a category.
  • Categorical
  • Correct! The variable records which category (cued or not cued) Yukon chose, making it a categorical variable.
  • Quantitative
  • Not quite. A quantitative variable would be a numerical measurement. Here we’re recording which container was chosen, which is a category.
Solution.
The variable is categorical.

4. Expected Success Rate.

If Yukon was purely guessing, how often would you expect her to identify the correct container?
  • about 50% of the time
  • Correct! With two equally likely options, we would expect Yukon to choose correctly about half the time by chance.
  • about 33% of the time
  • This would be the probability if there were three equally likely options.
  • about 75% of the time
  • This would mean Yukon has some understanding of the cues, not just guessing.
  • about 25% of the time
  • This would be the probability if there were four equally likely options.
  • never
  • Even when guessing, there’s a chance of being correct.
We will think of the sample of 8 attempts as identical observations from a random process, and we are choosing between two possibilities:
  1. Yukon can understand behavioral cues,
  2. Yukon does not consistently understand behavioral cues and is guessing randomly.

Definition: Hypotheses.

In drawing conclusions beyond our sample data to the underlying random process, we will often be choosing between two competing claims about the underlying process:
  • The null hypothesis, which is the "by chance alone" explanation;
  • The alternative hypothesis, which is usually what the researchers are hoping to show.
In Investigation 1.1, the null hypothesis was that infants (in general) choose equally among the two toys in the long run. The alternative hypothesis was that infants have a genuine preference for the helper toy.

Aside: How to use matching questions.

5. State Hypotheses.

Solution.
Null: Yukon does not consistently understand behavioral cues and is guessing randomly between the 2 containers.
Alternative: Yukon can understand behavioral cues.

6. Plan the Simulation.

So our simulation model is going to assume the null hypothesis is true and we will see how unusual it is to find 6 successes and 2 failures in 8 attempts. Can we use "coin tossing" again? How many times will we toss the coin?
Solution.
Yes, we can use coin tosses again with heads = Yukon correct and tails = Yukon incorrect. We want to use 8 tosses for each repetition and count how many heads we have.

Carry out the simulation.

Use the One Proportion Inference applet to carry out 1,000 repetitions of the simulation. Check the Summary Statistics box and report the mean and standard deviation of your simulated distribution of "could have been" outcomes.

7. Summarize your results.

Mean: Standard deviation:
Solution.
Example results:
Possible applet results shows dot plot of Number of heads with mean 3.988 and standard deviation 1.464
The mean number of heads is about 4 (half of 8) and the standard deviation of the number of heads is about 1.5.

8. Evaluate Surprise Level.

Based on your simulation results, which assumes Yukon is guessing, should we be very surprised to see 6 correct guesses? Explain/support your reasoning.
Hint.
Is the observation near the middle/bulk of the distribution or out in the edges/tails of the distribution?
Solution.
An outcome of 6 heads does not appear to be in the tail of the distribution and so does not appear to be an unusual outcome when the null hypothesis is true.

Measuring "Rareness".

In Investigation 1.1, the observed result (14) was pretty far in the tail of the distribution and you probably agreed that we could consider it unusual. This result (6 out of 8) is not as inconsistent with the null hypothesis; so where do we "draw the line"? First, we need agree on a measure of "rareness." We could just calculate the theoretical probability of 6 heads in 8 tosses (0.1094), but keep in mind that if we have a larger sample size (e.g., 100 tosses), then any individual outcome (e.g., 62 heads) will have a small probability. So we want to judge how extreme our observation is relative to the other observations in the simulated distribution. One way to do this is to count how many outcomes are as or even more extreme (even further from the expected value) than the observed outcome. For example, if we tell you that only 1% of rattlesnakes are longer than 2.5 meters, then you know to be very surprised to see a 3-meter rattlesnake and you may even begin to think that what you are looking at is not a rattlesnake at all!
  • In the applet, enter 6 in the Count Samples box and keep the "as extreme as" button to greater than or equal to, \(\geq\text{.}\)
9. Calculate p-value.
Report the proportion of the repetitions with 6 or more heads (correct choices).
In 1,000 repetitions, assuming the probability of choosing correctly is 0.50, we found % of repetitions of 8 attempts to result in 6 or more successes.
Definitions: Null Distribution and p-value.
Because the simulation assumes the null hypothesis to be true, we can refer to the distribution you simulated as the null distribution. When you determine the proportion of the results in the null distribution that are at least as extreme as the observed result, you are estimating a probability. We will refer to this probability as the p-value. We use the p-value to evaluate the strength of evidence against the null hypothesis.
The smaller the p-value, the stronger the evidence against the null hypothesis. There are no hard-and-fast cut-off values for gauging the smallness of a p-value, but generally speaking:
  • A p-value above 0.10 constitutes little or no evidence against the null hypothesis.
  • A p-value below 0.10 but above 0.05 constitutes moderate evidence against the null hypothesis.
  • A p-value below 0.05 but above 0.01 constitutes strong evidence against the null hypothesis.
  • A p-value below 0.01 constitutes very strong evidence against the null hypothesis.
10. Compare p-values.
Did everyone in your class obtain the same p-value? (If not, are you surprised?)
  • Yes
  • Actually, because we’re using simulation, each person’s results will vary slightly.
  • No
  • Correct! The p-value will vary slightly across simulations because of the random nature of the process.
Solution.
We expect the p-value to vary slightly across the simulations because of the random nature of the process
Because we are assuming a random process, we can use probability rules to calculate an "exact" value for the p-value. In fact, we have been assuming a very special random process, a binomial process.

Definition: Binomial Random Variable.

A Binomial random variable counts the number of successes in a random process with the following properties:
  • Each trial results in either "success" or "failure" (we have a binary categorical variable).
  • The trials are independent: The outcome of one trial does not change the probability of success on the next trial.
  • The probability of success, \(\pi\text{,}\) is constant across the trials.
  • There are a fixed number of trials, \(n\text{.}\)
If the number of successes, \(X\text{,}\) is a binomial random variable, then we can say \(X \sim \text{Binomial}(n, \pi)\text{.}\)
Because we are assuming the null hypothesis to be true, we are treating each attempt as an identical repetition of a random process with a probability of success of 0.50 for each attempt (always two containers to choose from), and one attempt outcome does not impact the probability of success on the next attempt (e.g., a screen was lowered between attempts and the location of the food randomized each time) so the attempts are independent. We are counting \(X\text{,}\) the observed number of correct choices in the \(n = 8\) trials. So \(X \sim\) (is distributed as) \(\text{Binomial}(8, 0.50)\) and we want to determine the probability of observing 6 or more successes in 8 attempts by chance alone, \(P(X \geq 6)\text{.}\) We will illustrate this calculation next, and then turn to technology to find binomial probabilities.

Calculating Binomial Probabilities.

11. Calculate Single Outcome Probability.
Let S represent Success and F represent Failure. Consider one particular outcome for the 8 trials with 6 successes: SSFSSSFS. Because we are assuming the trials are independent, we can find the probability of this event by multiplying together the probabilities of the individual outcomes in the event.
\begin{align*} P(\text{SSFSSSFS}) \amp = P(S) \times P(S) \times P(F) \times P(S)\\ \amp \quad \times P(S) \times P(S) \times P(F) \times P(S) \end{align*}
What is the calculated product?
Solution.
\((0.5)^8 = 0.00390625\)
12. Count Arrangements.
But there are other possible outcomes that would also result in 6 successes and 2 failures (for example, SFSSSSSF). How many ways are there to arrange the 6 successes among the 8 slots?
This is where the binomial coefficient comes in handy. This binomial coefficient, denoted by \(C(n, k)\) or \(\binom{n}{k} = \frac{n!}{k!(n-k)!}\text{,}\) counts the number of ways there are to select \(k\) items from a set of \(n\) items. In this case, we want to find the number of ways to select 6 spots from the 8 trials for the successes:
\begin{equation*} C(8, 6) = \frac{8!}{6!2!} \end{equation*}
What is the calculated result?
Solution.
13. Calculate Probability for 6 Successes.
Because the 28 outcomes corresponding to 6 successes are mutually exclusive (can’t occur simultaneously), we can find the probability of at least one of them happening by adding all of the probabilities together:
\begin{equation*} P(\text{SSFSSSFS}) + P(\text{SFSSSSSF}) + \ldots = 28 \times (0.5^8) \end{equation*}
What is the calculated product?
In general, the binomial probability of obtaining \(k\) successes in a sequence of \(n\) independent trials with success probability \(\pi\) on each trial is:
\begin{equation*} P(X = k) = \binom{n}{k} \pi^k (1-\pi)^{n-k} \end{equation*}
where \(\binom{n}{k} = \frac{n!}{k!(n-k)!}\)
Solution.
14. Calculate Complete p-value.
Is the number you found in Question 13 our p-value for this study? If not, how would we find the p-value?
Solution.
No, so far we have found P(X = 6), but the p-value is defined as P(X > 6).
15. Confirm with technology.
Report the binomial (or "exact") p-value from the applet.
Exact p-value =
Solution.
0.1445. This should be similar to the simulation results and everyone would obtain the same calculated value.
16. Interpret p-value.
Write a one-sentence interpretation of this p-value in this context. Key components to include: what random process is being repeated, what is the null hypothesis that we are assuming to be true, what is the observed result we are considering, which outcomes are we considering to be more extreme?
Solution.
If we were to repeatedly examine samples of 8 attempts from wolves who do not understand the communication cues, we would expect 6 or more wolves to pick the correct container for about 14% of those samples by chance alone.

17. State Conclusion.

Based on this analysis, do these data provide convincing evidence that wolves can understand behavioral cues?
Solution.
No, a p-value of 0.14 is not strong evidence against the null hypothesis. It is plausible that Yukon would do as well in 8 trials even if she didn’t understand the cue and was simply guessing.

Study Conclusions.

If we assume Yukon is guessing between the two containers, regardless of the cue, then finding 6 or more successes in 8 trials is actually not very surprising (binomial p-value = 0.1445); about 14% of samples from a 50/50 process would show 6 or more successes in 8 trials. This does not provide convincing evidence that Yukon can perform "above chance," but there is not information about whether other wolves or other cues would show better performance.

Technology Detour – Calculating Binomial Probabilities.

18. Calculate Binomial Probabilities with Technology.
Use technology (R or JMP) to calculate the exact binomial probability for the Friend or Foe study (14 successes in 16 trials with probability 0.5). Choose one set of instructions below by clicking on a hint. Then adapt the instructions for the Wolf study.
Enter the probability you find.
Hint 1. Calculating Binomial Probabilities in R
Make sure you have the ISCAM Workspace loaded.
The iscambinomprob function takes the following inputs:
  • k = the observed value you want to calculate the probability about
  • n = the sample size of the binomial distribution
  • prob = the probability of success in the binomial distribution
  • lower.tail = TRUE if you want the probability less than or equal to the observed value, FALSE if you want the probability greater than or equal to the observed value
For the Friend or Foe study, run the following code:
You should see both the probability appear in the output and a shaded binomial distribution graph.
R Reminder: Yes, "FALSE" needs to be capitalized. The lower.tail=FALSE parameter gives you \(P(X \geq k)\text{,}\) while lower.tail=TRUE gives you \(P(X \leq k)\text{.}\)
Hint 2. JMP Instructions
Using the Distribution Calculator in the ISCAM Journal File (Download this file to your computer and then open the .jrn file to launch JMP.)
JMP Distribution Calculator input screen for binomial probabilities
JMP Distribution Calculator output screen showing binomial distribution
JMP Reminder In JMP’s Distribution Calculator, you need to subtract 1 from your observed value when using the \(\geq\) option because JMP calculates \(P(X > Q_a)\) as \(P(X \geq Q_a + 1)\text{.}\) So to get \(P(X \geq 14)\text{,}\) you enter 13 as \(Q_a\text{.}\)
Solution.
For the wolf study with 6 successes out of 8 trials, the binomial p-value should be 0.1445 as found in the applet.

Subsection 3.2.2 Practice Problem 1.2A

When Yukon was given "communicative cues" (looking at the container, pointing to the container) instead of behavioral cues, she was correct in 7 out of 8 attempts. Does this outcome provide convincing evidence that wolves understand communicative cues?

Checkpoint 3.2.11. State Hypotheses.

Checkpoint 3.2.12. Calculate and Interpret p-value.

Report both a simulation-based and the exact p-value. Include a one-sentence interpretation of the p-value in this context.
Simulation-based p-value:
Exact p-value:
Interpretation: The probability of assuming .

Checkpoint 3.2.13. Summarize Analysis and Conclusions.

Summarize your analysis and the conclusions you would draw from this study.

Checkpoint 3.2.14. Evaluate Pooled Data Analysis.

Across the 12 wolves in the study, in the 64 trials with communicative cues, the wolves understood the cue 41 times. What is the new p-value for these results? Explain why this would not be an appropriate analysis.

Subsection 3.2.3 Practice Problem 1.2B

A study in Psychonomic Bulletin and Review (Lea, Thomas, Lamkin, & Bell, 2007) presented evidence that "people use facial prototypes when they encounter different names." Similar to one of the experiments they conducted, you will be asked to match photos of two faces to the names Tim and Bob. The researchers wrote that their participants "overwhelmingly agreed" on which face belonged to Tim. You will conduct a similar study in class to see whether your class also agrees with which face is Tim’s more than you would expect from random chance (here "random chance" = there is no facial prototyping and people pick a name for the face on the left at random).

Checkpoint 3.2.15. Identify Process and Variable.

What is the sample/random process and variable in this study? What assumptions are we making about this process?

Checkpoint 3.2.16. State Hypotheses.

Checkpoint 3.2.17. Report Class Results.

Report the count, proportion, and percentage "correctly" identifying Tim’s face in your class.

Aside: One Proportion Inference applet.

Checkpoint 3.2.18. Calculate p-value and Conclude.

Use the applet or the software to obtain a binomial p-value. What do you conclude about the null hypothesis?

Checkpoint 3.2.19. Verify Binomial Assumptions.

Do you think the properties of the binomial random variable are met here? What assumptions need to be made?
You have attempted of activities on this page.