Project Checkpoint 3
Now that you have explored your original dataset and produced data summaries addressing your primary research questions, it is time to integrate new country-level data.
By the end of the week, your updated report should contain:
two new country-level data sources
Stat 431: One of these must be acquired using APIs or webscraping
Stat 541: One of these must be acquired using APIs, the other using webscraping.
descriptions of each of the new datasets
a “meta” dataset containing all the acquired data (joined together)
at least two data summaries OR visualizations incorporating the additional country-level data
Stat 541 Only
- your webscraped data must come from a multi-paged source, so that iteration is used to gather all the data
Warning! Most APIs require you to make an account to access them to prevent too much automated data collection. Some APIs also charge a fee to use. You should not use any paid APIs for this class!
Helpful Links
Here are some sources for country-level data that you may use if you so choose.
These are just to get you started. You are not obligated to use these, and we cannot guarantee that they will all be functional, free, or useful.
Tabular and/or Scrapable Data
APIs
Much of the data we have relied on for years - such as NOAA and BEA data listed above - are no longer being collected and made available, due to US Government cuts to these public statistics services. Other data sources that result from independent academic research at US institutions are rapidly dwindling as well, also due to cuts to this funding.
Although these data sources may eventually be restored, or replaced with data from alternative agencies, we will never be able to go back and fill the gap for these lost years.