Assignment 3

Attention

A difficulty with feed-forward networks is that they have to compute everything at once. This is not how most programs work, most programs involve loops and control flow. Likewise, this is hardly how vision works. While humans have some ability to recognize an image without moving their gaze, they are much better at it when they can use saccades to scan the image, jumping from clue to clue.

An attention model lets the network sequentially focus on a subset of the input, process it, and then change its focus to some other part of the input. This makes is easier to reason sequentially about the data, even if the data isn't sequential in nature.

The objective of this assigment is to understand one the simplest attention models: DRAW. To this end, the specific points for this assignment are:

Read (carefully) this paper: DRAW: A Recurrent Neural Network For Image Generation
Look at this implementation in tf: http://blog.evjang.com/2016/06/understanding-and-implementing.html
Re-code the previous implementation (in tf or keras) in order to have a clear implementation that can be used for pedagogical purposes. For example, use more data visualization, simplify the code, etc.
Write a tutorial notebook (include latex formulas, figures, etc.) with your code about the DRAW paper.

Deliverable: Notebook.

Submission date: 21/6/2017

Scoring criteria: the most important feature for getting a good scoring of this assignment will be the pedagogical value of your notebook.