Step through beam search decoding — watch the top-k candidate sequences expand step by step, keeping only the most probable beams.
Width-Limited Tree Search
Beam search keeps the top-k (beam width) partial sequences at each step, avoiding exponential cost while outperforming greedy decoding.