Sapiens
Technicals

What are embeddings?

Published June 1, 2026 · 4 min read

MAP OF MEANING Closeness is meaning. The nearest points are the answer. “my card got declined” unrelated topics, far off Every phrase is a point; close points share meaning. To answer, just find the nearest ones.

Definition

An embedding is a list of numbers that turns content into a point on a map of meaning, where similar things sit close together and unrelated things sit far apart.

At a glance

  • Computers match by meaning, not keywords: ‘cancel my plan’ finds an article titled ‘ending your subscription.’[1]
  • Closeness equals similarity. Every item is a point; the system answers by finding the nearest ones.[2]
  • They power semantic search, recommendations, and ‘chat with your documents’ AI (RAG).
  • You buy embeddings, not build them: call a model, store results in a vector database.

How it works

An embedding model gives each piece of content coordinates on a map of meaning. Because meaning becomes distance, the computer answers ‘what is this most like?’ by finding the nearest points. The classic proof: the math ‘king minus man plus woman’ lands near ‘queen.‘[3]

What it powers

Semantic search finds results by intent, tolerating typos and slang. Recommendations surface items nearest to what someone liked. RAG lets a chatbot answer from your own files: documents and the question both become embeddings, the closest passages are pulled, then the AI writes a grounded answer.[4]

Before you trust it

Embeddings are cheap; the real risk is fit. A model strong on web text can be weak on your contracts or catalog, and public leaderboards are self-reported.[5] Ask vendors which model they use, whether your data leaves your environment, and to show retrieval accuracy on a sample of your real content.

Bottom line

Embeddings turn meaning into distance so software finds what is similar, not just matching words; the decision that matters is how well a model retrieves answers on your own data.

References

  1. What is Embedding? - Embeddings in Machine Learning Explained. Amazon Web Services aws.amazon.com
  2. What are Vector Embeddings. Pinecone www.pinecone.io
  3. King - man + woman = queen: the hidden algebraic structure of words. University of Edinburgh, School of Informatics informatics.ed.ac.uk
  4. Embeddings Explained: Vector Databases, Semantic Search and RAG for LLM Apps. Medium (QuarkAndCode) medium.com
  5. Embedding Model Specs 2026: Dimensions, Price per 1M Tokens, and MTEB Table. PE Collective pecollective.com

Comments

Questions, corrections, and links welcome. Be specific and civil.

  • Loading comments…