Week 9: Inference for Many Means (ANOVA)
Welcome!
In this week’s coursework we are transitioning into our last topic of the quarter—comparing multiple means. These comparisons have a specific name, ANalysis of VAriance (ANOVA).
We are going to use all of the simulation-based methods we learned previously for this new context. An ANOVA relies on a new statistic, the \(F\)-statistic, to summarize how different 3 or more means are from each other.
The primary focus of an ANOVA is to detect if the means of 3 or more groups are different. Because of this we focus on hypothesis tests for a difference in the group means and do not use confidence intervals. We will generate permutation distributions of \(F\)-statistics that we could have observed if the null hypothesis was true (there is no difference in the group means).
0.1 Learning Outcomes
By the end of this coursework you should be able to:
- describe what an ANOVA tests for
- use a visualization to outline how the following are calculated:
- total sum of squares
- group sum of squares
- residual sum of squares
- describe why the mean squares of groups is called the “between group variability”
- describe why the mean square error is called the “within group variability”
- outline how an F-statistic is calculated
- explain what a “large” or a “small” F-statistic indicates
- describe the conditions for performing an ANOVA procedure
- outline when it is appropriate to use an \(F\)-distribution in an ANOVA
- explain the similarities and differences between parametric (\(F\)-based) methods and non-parametric (simulation-based) methods
- use R to:
- generate a permutation distribution for F-statistics
- visualize the permutation distribution
- calculate the observed F-statistic statistic
- calculate a p-value for a hypothesis test
1 Textbook Reading
First, we are going to learn about one-way ANOVA!
Required Reading: Inference for Comparing Many Means
Next, we’re going to do a refresher on experimental design.
Required Reading: Principles of Experimental Design
There is no reading guide for this chapter, just a concept quiz that is due on Wednesday!
2 Concept Quizzes
We have two concept quizzes this week, one that is due by Tuesday (on one-way ANOVA) and one that is due by Wednesday (on experimental design).
2.1 Due Tuesday by the start of class
- Which of the following are true about the mean squares between groups? (Select all that apply)
- it is a standardized measure of the variability in responses between groups
- it compares the mean of each group to the overall mean across all groups
- it compares the observations within each group to the mean of that group
- it is used as the numerator in an F-statistic
- it is used as the denominator in an F-statistic
- it is found by dividing the sum of squares between groups by the number of groups minus 1 (\(k\) - 1)
- it is found by dividing the sum of squares between groups by the sample size minus the number of groups (\(n - k\))
- Which of the following are true about the mean square errors? (Select all that apply)
- it is a standardized measure of the variability in responses within each group
- it compares the mean of each group to the overall mean across all groups
- it compares the observations within each group to the mean of that group
- it is used as the numerator in an F-statistic
- it is used as the denominator in an F-statistic
- it is found by dividing the sum of square errors by the number of groups minus 1 (\(k\) - 1)
- it is found by dividing the sum of square errors by the sample size minus the number of groups (\(n - k\))
- An F-statistic uses which formula?
\(\frac{MSG}{MSE}\)
\(\frac{SSG}{SSE}\)
\(\frac{MSE}{MSG}\)
\(\frac{SSE}{SSG}\)
- Ideally, in an ANOVA we’d like to see… (select all that apply)
- large differences in the means of the groups
- small differences in the means of the groups
- large variability in the observations within each group
- small variability in the observations within each group
- If the null hypothesis that the means of four groups are all the same is rejected using ANOVA at a 5% significance level, then… (Select all that apply)
- we can then conclude that all the means are different from one another.
- the variability between groups is higher than the variability within groups.
- we can then conclude that at least one mean is different from the others.
- the pairwise analysis will identify at least one pair of means that are significantly different.
True or false, as the total sample size increases, the degrees of freedom for the residuals increases.
True or false, the constant variance condition is very important when the sample sizes of each group are different.
True or false, the independence assumption can be relaxed when the total sample size is large.
True or false, the normality condition is very important when the sample sizes of each group are small.
2.2 Due Wednesday by the start of class
1. Which of the following statements are true about observational studies and experiments? (Select all that apply)
- Experiments randomly assign the explanatory variable
- Observational studies randomly assign the explanatory variable
- Observational studies can make causal statements about the relationship between the explanatory and response variables
- Experiments can make causal statements about the relationship between the explanatory and response variables
2. Suppose we wanted to compare cattle weight gain on four different dietary supplements to determine which is the most effective. We conducted an experiment in which groups of eight cows are given a particular supplement for one month. A fifth group serves as the control group—they do not receive any supplements. At the end of the experiment, we measure how much weight each cow has gained over the month.
Experimental Units
Treatments
Control Group
Factor Levels
Eight Cows
Cows
No Supplement
Four Supplement Types & No Supplement
Dietary Supplements
3. Below are two brief descriptions of experimental designs, one represents replication, and the other represents pseudoreplication. Match each description to the correct design.
Pseudoreplication
Replication
A researcher applies a fertilizer treatment to five separate pots, each containing a different plant, and compare their growth to five control pots without fertilizer.
A researcher applies a fertilizer treatment to one large pot containing five plants and compares their growth to five plants in a single control pot without fertilizer.
4. Randomization means that every experimental unit has equal probability of being assigned to a treatment.
- True
- False
5. A good variable to include as a blocking variable would have [small / large] variation within a block, and [small / large] variation between blocks.
6. The table below summarizes the number of channel types (unittype) in each section of the HJ Andrews Experimental Forest. Based on this table, it would be reasonable to block on the section of the forest (CC, OG).
unittype | CC | OG |
---|---|---|
C | 10199 | 8917 |
P | 3339 | 4076 |
R | 97 | 395 |
S | 11 | 1 |
SC | 2461 | 1925 |
I | 0 | 23 |
IP | 0 | 155 |
7. The use of a placebo pill in an experiment does which of the following? (Select all that apply)
- Acts as a procedural control
- Acts as a blocking variable
- Allows researchers to measure the changes in patients resulting from simply being treated.
- Fools the participants so they don’t know what treatment they received.
- Gives researchers a baseline “no effect” comparison.