library(openintro)
Lab 3 Revisions are due today (May 1)
Second round of revisions for Lab 2 are due today (May 1)
Statistical Critique revisions are due next Wednesday (May 8)
Lab 4 Revisions are due next Wednesday (May 8)
Making a copy of your group’s Lab 4
If you were the recorder (typer) for your group, you need to make your project public. If you were not the recorder, you need to make a copy of your group’s project.
The first draft of your Midterm Project is due on Sunday at midnight.
Deadline Extension
A deadline extension is permitted for the first draft. Deadline extensions are not permitted for the final version (due next week).
The description of your data goes in your Introduction.
The description of your variables goes at the beginning of your Methods, in the Variables subsection!
Be cautious in how you are using the resources I provided—do not copy these descriptions.
Inserting a verbatim copy of the descriptions seen in the data resources is plagiarism.
In text citation
If you wish to borrow elements of these descriptions, you need to quote them and provide a reference to the resource. e.g., “This data set has been of interest to medical researchers who are studying the relation between habits and practices of expectant mothers and the birth of their children” (United States Department of Health and Human Services, 2014).
For the and_vertebrates
data, you should include species
as an explanatory variable. If you don’t you are assuming the same relationship applies to trout AND salamanders.
For the hbr_maples
data, you cannot use year
as a numerical variable. There are only two years of data!
For the pie_crabs
data:
site
and latitude
measure the same thingLocate what package your data live in (found in the directions for the midterm project proposal)
Load in the package you need!
Get started!
moderndive Package
We will be using the moderndive package to get our regression tables, so do not remove this package from your project!
Two Numerical Variables
Three total visualizations
color
gradientOne Categorical & One Numerical Variable
Two total visualizations
geom_smooth(method = "lm")
geom_parallel_slopes()
Two Numerical Variables
If there appears to be a relationship with the colors – include both variables!
If the colors are equally dispersed throughout the plot – choose the one variable that has the stronger relationship (larger slope)!
One Categorical & One Numerical Variable
Look at the plot where the lines are allowed to be different! Does it look like they are?
If the lines look different – you should use the different slopes (interaction) model!
If the lines look similar – you should use the parallel slopes (additive) model!
lm()
Two Numerical Variables
One Categorical & One Numerical Variable
*
to separate the variables!+
to separate the variables!get_regression_table()
Now interpret!
Comments from Project Proposals