Q: How do you solve Beam Search step by step?

Start with beam = [( , score=1.0)]. Expand each beam by scoring all vocab words as the next token. Keep only the top-k sequences by cumulative score. Repeat until all beams reach or max length. Longer sequences get lower scores — apply length normalization.

Question 1

What is the algorithm pattern for Beam Search?

Accepted Answer

Width-Limited Tree Search: Beam search keeps the top-k (beam width) partial sequences at each step, avoiding exponential cost while outperforming greedy decoding.

Question 2

How do you solve Beam Search step by step?

Accepted Answer

Start with beam = [(, score=1.0)]. Expand each beam by scoring all vocab words as the next token. Keep only the top-k sequences by cumulative score. Repeat until all beams reach or max length. Longer sequences get lower scores — apply length normalization.

Question 3

What are common mistakes when solving Beam Search?

Accepted Answer

Greedy search = beam width 1 — fast but suboptimal. Beam search can repeat phrases; n-gram blocking is a common fix. Top-p sampling is now preferred over beam search for open-ended generation.

Beam Search — Step-by-Step Visualization

Algorithm Pattern

Key Idea

Step-by-Step Approach

Common Gotchas

Related Problems