Definition

An embedding is a list of numbers that turns content into a point on a map of meaning, where similar things sit close together and unrelated things sit far apart.

At a glance

Computers match by meaning, not keywords: ‘cancel my plan’ finds an article titled ‘ending your subscription.’^[1]
Closeness equals similarity. Every item is a point; the system answers by finding the nearest ones.^[2]
They power semantic search, recommendations, and ‘chat with your documents’ AI (RAG).
You buy embeddings, not build them: call a model, store results in a vector database.

How it works

An embedding model gives each piece of content coordinates on a map of meaning. Because meaning becomes distance, the computer answers ‘what is this most like?’ by finding the nearest points. The classic proof: the math ‘king minus man plus woman’ lands near ‘queen.‘^[3]

What it powers

Semantic search finds results by intent, tolerating typos and slang. Recommendations surface items nearest to what someone liked. RAG lets a chatbot answer from your own files: documents and the question both become embeddings, the closest passages are pulled, then the AI writes a grounded answer.^[4]

Before you trust it

Embeddings are cheap; the real risk is fit. A model strong on web text can be weak on your contracts or catalog, and public leaderboards are self-reported.^[5] Ask vendors which model they use, whether your data leaves your environment, and to show retrieval accuracy on a sample of your real content.

Bottom line

Embeddings turn meaning into distance so software finds what is similar, not just matching words; the decision that matters is how well a model retrieves answers on your own data.

What are embeddings?