(In order to load the stylesheet of this notebook, execute the last code cell in this notebook)

Clustering food recipes from around the world

In this assignment, you are expected to hierarchically cluster food recipes. We will be using YY Ahn's dataset "Recipe datasets with cuisines", which for each recipe provides a list of ingredients and the country of origin, scraped from popular recipe websites. He also provides an ingredient and compounds dataset, which you could potentially also make use of, if you find it interesting.

You will be graded as follow:

  • Data parsing and representation (20 pts)
  • Clustering (30 pts)
  • Visualization (30 pts)
  • Short (datailed) analysis of specific cluster (20 pts)

In [ ]:


  1. Ahn, Yong-Yeol, et al. "Flavor network and the principles of food pairing." Scientific reports 1 (2011).

In [21]:
# Code for setting the style of the notebook
from IPython.core.display import HTML
def css_styling():
    styles = open("../../theme/custom.css", "r").read()
    return HTML(styles)