A difficulty with feed-forward networks is that they have to compute everything at once. This is not how most programs work, most programs involve loops and control flow. Likewise, this is hardly how vision works. While humans have some ability to recognize an image without moving their gaze, they are much better at it when they can use saccades to scan the image, jumping from clue to clue.
An attention model lets the network sequentially focus on a subset of the input, process it, and then change its focus to some other part of the input. This makes is easier to reason sequentially about the data, even if the data isn't sequential in nature.
The objective of this assigment is to understand one the simplest attention models: DRAW. To this end, the specific points for this assignment are:
Deliverable: Notebook.
Submission date: 21/6/2017
Scoring criteria: the most important feature for getting a good scoring of this assignment will be the pedagogical value of your notebook.