Question 1

What is the algorithm pattern for Backpropagation?

Accepted Answer

Chain Rule of Calculus: Backprop computes how much each weight contributed to the loss by applying the chain rule backwards through the computation graph.

Question 2

How do you solve Backpropagation step by step?

Accepted Answer

Run the forward pass: compute z1, a1 (hidden), z2, a2 (output). Compute the loss (MSE: 0.5 * (a2 - y)²). Compute dL/da2 = a2 - y (output error). Multiply by the sigmoid derivative da2/dz2 = a2*(1-a2) to get dL/dz2. Compute dW2 = dL/dz2 * a1 (gradient for output weights). Propagate error back: dL/da1 = W2.T @ dL/dz2. Multiply by sigmoid derivative to get dL/dz1. Compute dW1 = dL/dz1 * x.T (gradient for hidden weights).

Question 3

What are common mistakes when solving Backpropagation?

Accepted Answer

The sigmoid derivative σ'(z) = σ(z)*(1-σ(z)) — reuse the already-computed activations. Gradient dimensions must match weight matrix dimensions exactly. Vanishing gradients: sigmoid saturates near 0 and 1, making gradients tiny in deep nets.

Backpropagation — Step-by-Step Visualization

Algorithm Pattern

Key Idea

Step-by-Step Approach

Common Gotchas

Related Problems