Today we will…
Q1: Who collected these data? When? Why? How?
Q4: Column titles of 2008 and 2018 are not descriptive!
names_prefix = argument to pivot_wider() can help you make better column names!Q4: Unless you specify .groups = "drop" within summarize() your table still is grouped!
group_by() + summarize() only drops the first group.group_by(), then the data will still be grouped by the second variable!Q7: The data description contains important information!
mc_toddler – Aggregated weekly, full-time median price charged for Center-based Care for toddlers.mhi_2018 – Median household income expressed in 2018 dollars.PE-4: I can use modern tools when carrying out my analysis.
recode() from the dplyr package has been superseded for the case_match() function.
case_match() is the SQL cousin of case_when()factor()!I strongly recommend against nested functions, as they are difficult for people to understand what your code is doing. Having two lines is not less efficient and is more readable.
I strongly recommend against nested functions, as they are difficult for people to understand what your code is doing. Having two lines is not less efficient and is more readable.
DVS-2: I use plot modifications to make my visualizations clearer to the reader
DVS-3: I show creativity in my visualizations
DVS-3: I show creativity in my visualizations
theme_bw(), but you might like others!Due to the time it is taking me to give feedback for the lab and challenge assignments, I need to find a new plan for revisions.
New Plan:
A process of tracking changes to a file or set of files over time so that you can recall specific versions later.


GitHub makes it easy to have multiple people working on the same files at the same time.
You can host a URL of fun things (like the class text, these slides, the course website, etc.) with GitHub pages.
You were asked to complete the following steps before coming to class today:
Using git and GitHub in RStudio
I’m going to guide you through how to interact with git and GitHub through RStudio. This is not the only way to do this! If you are comfortable with version control, feel free to use the tool that you like best.
Git is based on repositories.

Make your own copy of a project.

Navigate the the Final Portfolio repository linked on Canvas.
Create your own fork of that repository.
Download the project onto your computer.

Navigate to your fork of the Final Project Portfolio repository – the one you own!
Follow the instructions posted on Canvas to clone that repository into RStudio.
Record any changes you’ve made.

Dr. Theobold will live code this process!
Send your commit to the remote repository so others can see and access them.

Dr. Theobold will live code this process!
Download the latest changes from the remote repository into your local copy.
Only relevant if you are working on two computers!
If this is you, keeping both local versions up to date will be important!
