research

Reasoning vs memorization: what's the difference?

June 1, 2026 · 5 min read

REASONING VS MEMORIZATIONBoth ace the familiar question.Only one survives the unfamiliar one.MEMORIZATIONKnown questionrecalls the stored answerReworded questionno stored answer to findREASONINGKnown questionworks out the stepsReworded questionre-runs the same stepsThe crux: what happens when the question changes?

Definition

Memorization is when an AI recalls an answer it saw in training; reasoning is when it works out a fresh answer step by step, even on problems it has never seen.

At a glance

How it works

Picture two job candidates. One memorized last year’s exam answers; the other understands the math. They tie on the old test, but only the second solves a new problem. AI behaves the same way — memorization recalls training patterns, reasoning chains steps for something genuinely new.[4] Both look confident and correct on familiar questions, so a polished demo cannot tell them apart.

What to do

Don’t buy on benchmarks or a clean demo. Test the AI on your own messy cases and variations of them — reword them, add an irrelevant detail, change the numbers.[3] If it holds up, you have reasoning you can trust. If it collapses, it was matching memorized patterns and will misfire when customers ask something off-script.

Bottom line

The difference is invisible on familiar questions and decisive on unfamiliar ones — change the question and watch what survives.

Connects to Computer SciencePhilosophy

References

  1. On Memorization of Large Language Models in Logical Reasoning — Chulin Xie, Yangsibo Huang. arXiv arxiv.org
  2. None of the Others, distinguishing reasoning from memorization in multiple-choice benchmarks — Eva Sanchez Salido. arXiv arxiv.org
  3. GSM-Plus, a benchmark for the robustness of LLMs as math problem solvers — Qintong Li. arXiv arxiv.org
  4. Generalization vs Memorization, tracing capabilities back to pretraining data — Antonis Antoniades. arXiv arxiv.org
  5. Beyond Memorization, reasoning-driven synthesis against benchmark contamination. arXiv arxiv.org