High severityNVD Advisory· Published Aug 21, 2025· Updated Aug 21, 2025
vLLM API endpoints vulnerable to Denial of Service Attacks
CVE-2025-48956
Description
vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.10.1.1, a Denial of Service (DoS) vulnerability can be triggered by sending a single HTTP GET request with an extremely large header to an HTTP endpoint. This results in server memory exhaustion, potentially leading to a crash or unresponsiveness. The attack does not require authentication, making it exploitable by any remote user. This vulnerability is fixed in 0.10.1.1.
AI Insight
LLM-synthesized narrative grounded in this CVE's description and references.
Affected packages
Versions sourced from the GitHub Security Advisory.
| Package | Affected versions | Patched versions |
|---|---|---|
vllmPyPI | >= 0.1.0, < 0.10.1.1 | 0.10.1.1 |
Affected products
5- osv-coords4 versionspkg:apk/chainguard/tritonserver-backend-vllm-24.04pkg:apk/chainguard/tritonserver-backend-vllm-cuda-12.9pkg:apk/chainguard/tritonserver-backend-vllm-meta-cuda-12.9pkg:pypi/vllm
< 24.04-r15+ 3 more
- (no CPE)range: < 24.04-r15
- (no CPE)range: < 25.7.1_git20250821-r1
- (no CPE)range: < 25.7.1_git20250821-r1
- (no CPE)range: >= 0.1.0, < 0.10.1.1
Patches
Vulnerability mechanics
References
5- github.com/advisories/GHSA-rxc4-3w6r-4v47ghsaADVISORY
- nvd.nist.gov/vuln/detail/CVE-2025-48956ghsaADVISORY
- github.com/vllm-project/vllm/commit/d8b736f913a59117803d6701521d2e4861701944ghsax_refsource_MISCWEB
- github.com/vllm-project/vllm/pull/23267ghsax_refsource_MISCWEB
- github.com/vllm-project/vllm/security/advisories/GHSA-rxc4-3w6r-4v47ghsax_refsource_CONFIRMWEB
News mentions
0No linked articles in our index yet.