Big picture

Stopping condition


In [ ]:

Taylor series

Convexity

Gradient descend

Choice of learning rate

Adaptive learning rate

SGD

Feature scaling

Limitation

Newton's method

Batch learning vs Online learning


In [ ]: