Step through temperature sampling — watch how dividing logits by temperature T sharpens or flattens the probability distribution for text generation.
Logit Rescaling Before Softmax
Temperature T controls randomness: T<1 makes the model more confident (peaky distribution), T>1 makes it more exploratory (flat). T=1 is standard softmax.