Section19.2Investigation 4.3: Left-Handedness and Life Expectancy
In this investigation you will explore which factors impact the size of the p-values/our assessment of statistical significance when comparing two sample means.
Psychologist Stanley Coren has conducted several studies investigating the life expectancy of left-handers compared to right-handers, believing that the stress of being left-handed in a right-handed world leads to earlier deaths among the left-handers.
In one study Coren and Halpern (1991) sent surveys to thousands of next-of-kin of recently deceased southern Californians and asked whether the person had been right-handed or left-handed. They were very careful in how they collected their data. First, they consulted a bereavement counselor who suggested that they not contact anyone unless at least 9 months had passed since the death. The counselor also suggested that they make the contact as gentle as possible and not follow up or press people for responses. The researchers also decided that they would not contact next of kin if the death had been a result of murder or suicide or if the deceased was a child age 6 or younger. They received 987 replies and found that the average age of right-handed people at death was 75 years and for left-handed people it was 66 years.
This is really a single random sample (among those willing to participate). Participants were then asked about their handedness (cross-classified) but because there is no connection between the left and right handers in this study, it is reasonable to consider them two independent random samples.
This is really a single random sample (among those willing to participate). Participants were then asked about their handedness (cross-classified) but because there is no connection between the left and right handers in this study, it is reasonable to consider them two independent random samples.
Even though the researchers did not take a random sample of left-handers and a separate random sample of right-handers, we are still willing to consider these samples as independent because the results for one group should have no effect on the results for the other group. In a situation with two independent random samples, we can apply the two-sample \(t\)-test to make inferences about the difference in the population means.
The summary of the study reported the sample means; what additional information do we need to assess the statistical significance of their difference? Why is this information important?
Let \(\mu_L\) represent the mean lifetime for the population of left-handers and let \(\mu_R\) represent the mean lifetime for the population of right-handers. State the null and alternative hypotheses for Coren and Halpernβs study (based on the research conjecture).
The table below lists some guesses for the sample size breakdown (based on estimates of the proportion of the population that are left-handed) and the sample standard deviations.
6.Comparing Scenarios: Effect of Standard Deviation.
Scenarios 2 and 3 have the same sample means and sample sizes but different standard deviations (SD = 15 vs. SD = 25). Which scenario will produce stronger evidence against the null hypothesis (lower p-value)? Match each scenario to the strength of evidence.
Scenarios 3 and 4 have the same sample means and standard deviations but different sample sizes for left-handers (\(n_L\) = 50 vs. \(n_L\) = 10). Which scenario will produce stronger evidence against the null hypothesis (lower p-value)? Match each scenario to the strength of evidence.
Scenario 3 (\(n_L\) = 50) provides stronger evidence because a larger sample size in the smaller group reduces the standard error, making it easier to detect a difference.
See technology instructions in Investigation 4.2 to carry out the two-sample \(t\)-tests for these hypotheses, using the sample sizes, sample means, and sample standard deviations given in the table above. For each of these five scenarios, report the resulting standardized statistic, p-value, and whether or not the difference is statistically significant with a significance level of \(\alpha = 0.01\) in the table below.
Summarize what your analysis reveals about the effects of the sample size breakdown and the sample standard deviations on the values of the \(t\)-statistic and p-value.
When the sample size for the left handers is larger, we have more evidence against the null hypothesis (larger |t-statistics|, smaller p-values). When the sample standard deviations are larger, we have less evidence against the null hypothesis.
Considering the five scenarios of sample sizes and standard deviations used in the table, which scenario do you think is most realistic for this study of left- and right-handersβ lifetimes? Explain, based on the context for these data.
Probably scenario 1 or 2 as they have more a more realistic percentage of left-handers and the sample standard deviations (15) are more reasonable (the others are too large if we are expecting about 32% of data values to fall more than one standard deviation above or below the mean. We probably arenβt expecting a normal distribution, but these standard deviations still feel too large).
Based on your analyses and how these data were collected, are you convinced that being left-handed causes individuals to die sooner on average than being right-handed? How should you phrase your conclusion?
For even the remotely realistic scenarios, the p-values were quite small indicating statistical significance. However, we canβt draw any "cause-and-effect" conclusions as this was an observational study.
Part of the motivation for this research was an earlier study by Porac and Coren (1981) that surveyed 5147 men and women of all ages in North America. They found that 15% of ten-year-olds were left-handed, compared to only 5% of fifty-year-olds and less than 1% of eighty-year-olds. At the age of 85, right-handers outnumbered left-handers by a margin of 200 to 1. Suggest another explanation for these puzzling findings that actually provides a counter-argument to the conclusion from the 1991 Coren and Halpern study that left-handers tend to live shorter lives than right-handers.
For those who would be in their eighties in 1981, many of them would have been encouraged to not be left-handed when they were younger. This would explain why there were fewer left-handers in the older age groups.
The difference in the sample mean lifetimes does appear to be statistically significant for all reasonable choices of the sample sizes and the sample standard deviations, so we have strong evidence that left-handers do tend to have shorter life spans than right-handers. However, there are numerous cautions to heed when drawing conclusions from such a study. This was a retrospective study with voluntary response, and in fact the researchers reported that they tended to hear from the left-handed relatives more often than right-handers, and they only heard from fewer than half of the families contacted. There is also no information given about the proportion of left-handers in the two southern California counties studied or the average ages of their residents now. In particular, one explanation for the lower percentage of left-handers among the elderly is not that they have died younger but that it used to be quite common practice to strongly encourage left-handed children to switch to being right-handed. Nowadays, that is less common (in fact many athletes love having this advantage!), and so now there is a higher percentage of left-handers among younger age groups. This helps to explain why the left-handers who had died would tend to be younger. In other words, maybe the average age difference between living left-handers and right-handers is also nine years.
These studies have actually become hot topics for debate as some other studies have not been able to replicate Coren and Halprenβs results. Other prospective longitudinal studies in the United States (Marks and Williams, 1991; Wolf, DβAgostino, and Cobb, 1991) have not found a significant difference in age at death. Still others have found connections between handedness and accident rates, lower birth rates, cancer, alcohol misuse, smoking, and schizophrenia. Alas, we donβt see any randomized, comparative experiments being conducted to answer these questions in the near future!
You should have found that larger variability in each sample (so a larger SE) produces a larger p-value and therefore less convincing evidence that the population means differ. Or to put this in a more positive light: Reducing variability within groups makes it easier to distinguish between the groups. You should also have found that a bigger discrepancy in sample sizes between the two groups produces a larger p-value and therefore less convincing evidence that the population means differ.
Recall the study on childrenβs television viewing habits for two schools in San Jose, CA from Practice Problem 2.6B (Robinson, 1999). The researcher wanted to study whether a new classroom curriculum could reduce childrenβs television viewing habits, which might in turn help to prevent obesity. One of the schools, chosen at random, incorporated an 18-lesson, 6-month classroom curriculum designed to reduce watching television and playing video games, whereas the other school made no changes to its curriculum. Both before the curriculum intervention, all children were asked to report how many hours per week they spent on these activities.
The researchers are happy about the large p-value at baseline because it indicates there is no significant difference between the control and intervention groups at the start of the study, which is what they want.
Without performing any calculations, how should the p-value for these data compare (larger or smaller) to the p-value from the actual baseline results? Explain.
Checkpoint19.2.6.Subsetting by Socio-Economic Class.
Suppose the researchers decide to look at a subset of children in this study that belong to the same social-economic class (with the expectation that their television watching habits will be more similar to each other). Discuss one advantage and one disadvantage to this approach in terms of detecting a difference between the control group and the intervention group at the conclusion of the study. [Hints: Relate to your answer in the previous question?]
Advantage: By reducing the variability within groups (smaller SDs), it would be easier to detect a difference between groups at the conclusion. Disadvantage: By subsetting, the sample sizes would be smaller, making it harder to detect a difference.