Step through the forward pass and backward pass of a 2-layer neural network — watch how gradients flow from loss back through each weight using the chain rule.
Chain Rule of Calculus
Backprop computes how much each weight contributed to the loss by applying the chain rule backwards through the computation graph.