This assignment lets you practice data exploration from scratch using Tableau.
To guide you in this assignment you can follow this workflow:
Download one of the datasets from the link above that interests you. Alternatively, you can use a different dataset. In that case, make sure you include the link or dataset in this iPython Notebook.
As you have seen in the tutorial workbooks of the previous assignment, the visualizations start with a question such as: 'Which artists sell the most?'. When you pose a question, think of searching for outliers, trends or clusters.
Ask 3 questions about your dataset and include them here. Then start the exploration workflow and include the resulting dashboard in this notebook.
I used the following dataset: Cat vs Dog Popularity in the US
While you can create beautiful visualizations that are extremely useful in the data expoloration phase with Tableau, if you want to use it as part of reproducible research you have to be able to get to the same visualization using Python.
This assignment is an optional challenge. Recreate your dashboard above using iPython. This means you must:
You do not have to reproduce exactly the style and colors of the Tableau plots, but they should be similar enough to give the same insights. Good luck!
In [ ]:
# your iPython version of the above visualization