LLM08 · OWASP LLM Top 10 (2025)

Vector and Embedding Weaknesses

RAG put vector databases everywhere. Most teams trust their similarity-search results without realising they are a wide-open attack surface — for retrieval hijack, embedding inversion, and tenant boundary failures.

01What it is

Risks arising from embedding generation, storage, retrieval, and re-use. Includes embedding-inversion attacks (reconstruct text from embeddings), retrieval hijack (craft documents that always rank highly), cross-tenant leakage, and embedding-space adversarial perturbations.

02Why it matters

A poisoned chunk in the vector store is worse than a poisoned email. Every relevant query retrieves it; the LLM treats it as ground truth. Embedding inversion has progressed from theoretical to demonstrable — full text recovery from embeddings of sensitive documents is now feasible. Most production RAG systems have no defences.

03Attack vectors

  • Retrieval hijack — craft a document with embedding close to common queries; it dominates retrieval.
  • Embedding inversion — given an embedding, recover the source text (e.g., via vec2text).
  • Cross-tenant leak — filtering tenant_id AFTER similarity search instead of BEFORE.
  • Index poisoning — flood the store with adversarial documents to push legitimate ones out.
  • Adversarial perturbation — small text changes that drastically shift embeddings.
  • Embedding model swap — re-embed with a different model; existing vectors become meaningless but get returned anyway.

04Defence patterns

  • Apply tenant + access filters BEFORE the ANN search, via metadata pre-filtering.
  • Treat embeddings as sensitive data — encrypt at rest, restrict access, audit retrieval.
  • Hybrid retrieval (BM25 + dense) makes pure embedding-space attacks harder.
  • Reranker as a sanity check — does the top result genuinely answer the question?
  • Quarantine new chunks — never ingest user uploads directly into the live index.
  • Pin embedding model versions; re-embed deliberately on upgrade.

05Detection

Signals to watch

Audit retrieval — sample queries and verify returned chunks belong to the right tenant. Log the top-k chunks for every query; alert on retrieval that contradicts known-good answers. Watch for adversarial uploads (high embedding similarity to common queries).

06India context

DPDP · RBI · CERT-In

DPDP processor obligations apply to embeddings of personal data — they are processed personal data, not just numbers. Cross-tenant leakage in a B2B AI tool serving multiple Indian businesses can trigger separate breach notifications per tenant.

08Related modules on RingSafe

09Further reading