Lab 6 Grading Guide

Model Selection Order

Note

Based on my code, the model selection process should go:

cls_credits \(\rightarrow\) bty_avg \(\rightarrow\) gender

In Step 4 they are given two variables (cls_perc_eval, eval_completion) which both increase adjusted \(R^2\) by 0.018, not the 0.02 which is required to add another variable. So, everyone should stop at Step 4.

Question 1

To earn a Success:

fits other 7 models (rank, pic_outfit, pic_color, large_class, eval_completion, cls_level)
finds other 7 adjusted \(R^2\) values
states top model includes cls_credits variable

If they do not state the top model or what variable was chosen:

Careful! Technically, you told me the name of the object that contains the model you decided was best. I want you to tell me the name of the variable(s) included in that model!

Question 2

To earn a Success:

fits 13 models that include the variable chosen in #1 as an explanatory variable
finds 13 adjusted \(R^2\) values
states top model includes cls_credits & bty_avg

Indicating Multiple Variables

The response needs to indicate in some way that there are two variables included in the model. This can be by saying bty_avg was “added” to the previous model or by stating both variables included in the model.

If they do not indicate that there are two variables in the model (e.g., “beauty average was chosen” or “beauty score was included”):

Careful! Your model contains more variables than just bty_avg. Be specific about naming every variable included in your top model!

Question 3

To earn a Success:

modifies code to fit models that include the variables chosen in #1 and #2 as an explanatory variables
states top model includes all the variables they’ve chosen (cls_credits, bty_avg, gender)

If they don’t include a variable from #1 or #2 as an explanatory variable

Careful! Thus far, you’ve chosen two variables to be included in your model. So, your code (the map() function and select() function) need to have both these variables included, not just one!

Indicating Multiple Variables

If they do not indicate that there are three variables in the model (e.g., “gender was included in the model” or “gender was chosen”):

Careful! You were specifically asked to name EVERY variable included in your top model.

Question 4

To earn a Success:

modifies code to fit models that include the variables chosen in #1, #2, and #3 as an explanatory variables
state the top model is the model from Step 3 (cls_credits, bty_avg, gender)

If they don’t include a variable from #1, #2, or #3 as an explanatory variable

Careful! Thus far, you’ve chosen three variables to be included in your model. So, your code (the map() function and select() function) need to have both these variables included, not just one!

If they choose to add either cls_perc_eval or eval_completion:

Careful! Look back at the adjusted R-squareds for these variables. Did they increase 2% (0.02) from your last adjusted R-squared?

Question 6

To earn a Success:

states that the step() function did not choose the same model
state that this difference was caused by different criteria (or the use of AIC)

If they don’t talk about the step() function using a different model comparison criteria:

Look back at the output from the step() function, specifically the first line of the output. Based on this line, do you think the step() function is also using adjusted R-squared to decide what model is best? If not, what criteria do you think the step() function is using?