Sapiens
Research

What is model collapse?

Published June 2, 2026 · 4 min read

MODEL COLLAPSEA photocopy of a photocopy.Train AI on AI output and the detail fades to gray mush.real datacopy of copygray musheach training round copies the lastEvery round is a copy of a copy — errors compound until the model forgets the real world.

Definition

Model collapse is the progressive quality loss that occurs when AI systems are trained on data generated by earlier AI systems, causing outputs to grow blander, less accurate, and less diverse with each generation.[1]

At a glance

  • Caused by AI learning from AI-made content instead of real human data.[1]
  • Rare and unusual cases vanish first, so outputs converge on generic averages.[2]
  • Even small amounts of synthetic data in the mix can start the decay.[3]
  • Matters as the web fills with AI-generated text, images, and reviews.

Why it happens

AI models naturally lean toward common patterns and skip rare details. When their output becomes the next model’s training data, those rare details get dropped repeatedly.[2] Across generations the unusual edges shrink, errors compound, and the model forgets how varied the real world actually is.[3]

Why a business should care

If your tools, vendors, or marketing rely on AI trained on polluted web data, you risk repetitive, generic, or subtly wrong output.[1] Keeping original human-created data, knowing your data sources, and not blindly recycling AI output protects quality and competitive edge over time.

Bottom line

Model collapse is the slow rot AI suffers when it feeds on its own output, making clean, human-sourced data an increasingly valuable asset.

References

  1. AI models collapse when trained on recursively generated data — Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Nicolas Papernot, Ross Anderson, Yarin Gal. Nature www.nature.com
  2. What Is Model Collapse? IBM www.ibm.com
  3. Model collapse explained: How synthetic training data breaks AI. TechTarget www.techtarget.com

Comments

Questions, corrections, and links welcome. Be specific and civil.

  • Loading comments…