Repeat yourself

Wed, 18 Feb 2026 00:00:00 +0000

If you repeat your prompt, the model gives you a better answer. Not a smarter model, not a bigger context window, not chain of thought – you say the same thing twice and it works better. Google researchers tested this across Gemini, GPT, Claude, DeepSeek – 47 wins out of 70 benchmarks, zero losses.

In a transformer, token 1 can’t see token 50. Causal masking – each token only attends to what came before it. The first words of your prompt are always processed with the least context. They’re flying blind. When you repeat the prompt, the second copy’s early tokens can attend to the entire first copy. You’re giving the beginning of your question the context it never had.

Transformers on vnykmshr

Repeat yourself