Step through 2D convolution — watch a 2×2 filter slide across a 4×4 input, computing each cell of the 3×3 feature map via dot products.
Sliding Window Dot Product
A convolutional layer learns spatial features by sliding a small filter (kernel) over the input and computing a dot product at each position.