For a deeper treatment, you can read D2L Online Book until end of 5.2. Implementation of Multilayer Perceptrons.
6 Parts + Bonus (complete sequentially).
Understand computational trade-offs between analytical and iterative solutions. Implement both methods, measure runtime, accuracy, memory usage and conditioning effects. See the template function compare_methods
in the starter notebook.
Reproduce the two-hole landscape, perform a systematic hyper-parameter study and design your own complex loss landscape.
Compare SGD, Momentum, Adam, AdaGrad and RMSProp on challenging optimisation problems including the Rosenbrock function and your two-hole landscape.
Implement pure Hebbian learning, Oja's rule and analyse their limitations using a pattern association task.
Build and train a minimal neural network that solves the XOR problem to solidify your understanding of non-linear activation functions.
Conduct an empirical study of the bias-variance trade-off across model complexities using bootstrap sampling.