Predictive Modeling with Pipelines in tidymodels and scikit-learn
In this workshop, we will use predictive modeling to analyze data about NHL hockey players between 2006 and 2018. The goal is to learn and practice the standard workflow used for machine learning analysis:
- Cleaning the data
- Visualizing relationships
- Choosing predictor (input) variables
- Choosing between competing models
- Interpreting model conclusions
- Ethics considerations

Accessing Workshop Materials
You can either work locally with the workshop materials (on your own laptop) or remotely (online). To work locally, you will need to have either R or Python installed on your computer, and a variety of packages. If you do not have either of these installed, you can still work online!