VYPR
Unrated severityNVD Advisory· Published Jun 20, 2026

vLLM - Denial of Service via Unvalidated Multimodal Embeddings

CVE-2026-56340

Description

vLLM versions >= 0.10.2 and < 0.13.0 are missing sparse tensor validation in multimodal embeddings processing. Because PyTorch disables sparse tensor invariant checks by default, an attacker can submit crafted embedding requests with malformed (negative or out-of-bounds) tensor indices, when the prompt-embeds feature is enabled, to trigger crashes or resource exhaustion (denial of service), with potential for out-of-bounds/write-what-where memory corruption. This continues CVE-2025-62164, whose prior fix only disabled the feature by default rather than addressing the root cause.

AI Insight

LLM-synthesized narrative grounded in this CVE's description and references.

Affected products

2

Patches

Vulnerability mechanics

Root cause

"Missing sparse tensor validation in multimodal embeddings processing allows malformed (negative or out-of-bounds) tensor indices to be accepted, leading to crashes, resource exhaustion, or memory corruption."

Attack vector

An attacker can submit crafted embedding requests with malformed sparse tensor indices (negative or out-of-bounds values) when the prompt-embeds feature is enabled. Because PyTorch disables sparse tensor invariant checks by default for performance, these malformed tensors are accepted without validation [ref_id=1]. This can trigger crashes, resource exhaustion (denial of service), and potentially out-of-bounds/write-what-where memory corruption. The attack requires the prompt-embeds feature to be explicitly enabled by the server operator.

Affected code

The vulnerability resides in the multimodal embeddings processing path of vLLM (versions >= 0.10.2 and < 0.13.0). The root cause is missing sparse tensor validation when handling prompt-embed requests, specifically the lack of checks for malformed (negative or out-of-bounds) tensor indices. The prior fix for CVE-2025-62164 only disabled the prompt-embeds feature by default rather than adding proper validation [ref_id=1].

What the fix does

The patch adds sparse tensor validation to ensure indices are valid, non-negative, and within bounds [ref_id=1]. This addresses the root cause rather than merely disabling the feature. By validating tensor invariants before processing, the fix prevents malformed embedding requests from causing crashes, resource exhaustion, or memory corruption. The advisory notes that PyTorch disables these checks by default, making explicit validation necessary.

Preconditions

  • configThe prompt-embeds feature must be explicitly enabled by the server operator (it is disabled by default after the prior fix).
  • networkAttacker must be able to send crafted embedding requests to the vLLM server.

Generated on Jun 21, 2026. Inputs: CWE entries + fix-commit diffs from this CVE's patches. Citations validated against bundle.

References

2

News mentions

0

No linked articles in our index yet.