Step through Byte Pair Encoding — watch the most frequent character pairs iteratively merge into subword tokens, building a vocabulary from scratch.
Iterative Pair Merging
BPE starts with character-level tokens and repeatedly merges the most frequent adjacent pair. This creates subword vocabulary — rare words decompose into known subwords, fixing the unknown-word problem.