Vllm

pypi: vllm

Source repositories

https://github.com/vllm-project/vllm

CVEs (53)

CVE	Vendor / Product	Sev	Risk	CVSS	EPSS	Published	Description
CVE-2026-4944	Vllm Vllm	Hig	0.57	8.8	0.01	May 28, 2026	vllm-project/vllm version 0.14.1 contains a vulnerability where the `trust_remote_code=True` parameter is hardcoded in two model implementation files (`vllm/model_executor/models/nemotron_vl.py` and `vllm/model_executor/models/kimi_k25.py`). This bypasses the user's explicit…
CVE-2026-48746	Vllm Vllm	cri	0.52	—	0.01	Jun 16, 2026	### Summary A vulnerability in ASGI web servers and starlette's trust on those web servers enables an authentication bypass of the OpenAI API `AuthenticationMiddleware`, which was discovered during @x41sec's source code audit. It allows to use the API without providing the…
CVE-2026-5497	Vllm Vllm	Hig	0.42	7.5	0.01	Jun 11, 2026	vLLM versions 0.8.0 and later are vulnerable to an Out-of-Memory (OOM) Denial of Service (DoS) attack due to unbounded frame count processing in the `VideoMediaIO.load_base64()` method. When processing `video/jpeg` data URLs, the method splits the base64 data string on commas to…
CVE-2024-8768	Vllm Vllm	Hig	0.42	7.5	0.01	Sep 17, 2024	A flaw was found in the vLLM library. A completions API request with an empty prompt will crash the vLLM API server, resulting in a denial of service.
CVE-2024-8939	Vllm Vllm	Med	0.40	6.2	0.00	Sep 17, 2024	A vulnerability was found in the ilab model serve component, where improper handling of the best_of parameter in the vllm JSON web API can lead to a Denial of Service (DoS). The API used for LLM-based sentence or chat completion accepts a best_of parameter to return the best…
CVE-2026-41523	Vllm Vllm	hig	0.39	—	0.00	Jun 16, 2026	### Summary An `assert`-based security check in vLLM's activation function loading allows any unauthenticated attacker to achieve arbitrary code execution on the server by publishing a malicious HuggingFace model, when vLLM runs in Python optimized mode (`python -O` or…
CVE-2025-6242	Vllm Vllm	Hig	0.39	7.1	0.00	Oct 7, 2025	A Server-Side Request Forgery (SSRF) vulnerability exists in the MediaConnector class within the vLLM project's multimodal feature set. The load_from_url and load_from_url_async methods fetch and process media from user-provided URLs without adequate restrictions on the target…
CVE-2025-9141	Vllm Vllm	hig	0.39	—	0.04	Aug 21, 2025	### Summary An unsafe deserialization vulnerability allows any authenticated user to execute arbitrary code on the server if they are able to get the model to pass the code as an argument to a tool call. ### Details vLLM's [Qwen3 Coder tool…
CVE-2026-44223	Vllm Vllm	Med	0.35	6.5	0.00	May 12, 2026	vLLM is an inference and serving engine for large language models (LLMs). From to before 0.20.0, the extract_hidden_states speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the…
CVE-2026-44222	Vllm Vllm	Med	0.35	6.5	0.00	May 12, 2026	vLLM is an inference and serving engine for large language models (LLMs). From 0.6.1 to before 0.20.0, there is a a Token Injection vulnerability in vLLM’s multimodal processing. Unauthenticated, text-only prompts that spell special tokens are interpreted as control. Image and…
CVE-2026-34756	Vllm Vllm	Med	0.35	6.5	0.00	Apr 6, 2026	vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.19.0, a Denial of Service vulnerability exists in the vLLM OpenAI-compatible API server. Due to the lack of an upper bound validation on the n parameter in the ChatCompletionRequest…
CVE-2026-34755	Vllm Vllm	Med	0.35	6.5	0.00	Apr 6, 2026	vLLM is an inference and serving engine for large language models (LLMs). From 0.7.0 to before 0.19.0, the VideoMediaIO.load_base64() method at vllm/multimodal/media/video.py splits video/jpeg data URLs by comma to extract individual JPEG frames, but does not enforce a frame…
CVE-2026-9540	Vllm Vllm	Med	0.34	5.3	0.00	May 26, 2026	A vulnerability was identified in vllm-project vllm 0.19.0. This issue affects some unknown processing of the component OpenAI-compatible Serving Path. Such manipulation leads to denial of service. It is possible to launch the attack remotely. The exploit is publicly available…
CVE-2026-34760	Vllm Vllm	Med	0.31	5.9	0.00	Apr 2, 2026	vLLM is an inference and serving engine for large language models (LLMs). From version 0.5.5 to before version 0.18.0, Librosa defaults to using numpy.mean for mono downmixing (to_mono), while the international standard ITU-R BS.775-4 specifies a weighted downmixing algorithm.…
CVE-2026-7141	Vllm Vllm	Med	0.29	5.6	0.00	Apr 27, 2026	A vulnerability was found in vllm up to 0.19.0. The affected element is the function has_mamba_layers of the file vllm/v1/kv_cache_interface.py of the component KV Block Handler. Performing a manipulation results in uninitialized resource. It is possible to initiate the attack…
CVE-2026-34753	Vllm Vllm	Med	0.28	5.4	0.00	Apr 6, 2026	vLLM is an inference and serving engine for large language models (LLMs). From 0.16.0 to before 0.19.0, a server-side request forgery (SSRF) vulnerability in download_bytes_from_url allows any actor who can control batch input JSON to make the vLLM batch runner issue arbitrary…
CVE-2026-12491	Vllm Vllm	mod	0.24	4.8	0.00	Jun 10, 2026	vllm: vllm: image EXIF Rotation & PNG tRNS Transparency Not Normalized, Causing Mismatch Between Model Input and Expectations
CVE-2025-61620	Vllm Vllm	med	0.19	—	0.00	Oct 7, 2025	### Summary A resource-exhaustion (denial-of-service) vulnerability exists in multiple endpoints of the OpenAI-Compatible Server due to the ability to specify Jinja templates via the `chat_template` and `chat_template_kwargs` parameters. If an attacker can supply these…
CVE-2026-54232	Vllm Vllm		0.00	—	0.00	Jun 22, 2026	vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.1, the vLLM Dockerfile is vulnerable to a dependency confusion attack through the flashinfer-jit-cache package. The package is installed from a custom index (flashinfer.ai/whl/) using…
CVE-2026-56340	Vllm Vllm		0.00	—	0.00	Jun 20, 2026	vLLM versions >= 0.10.2 and < 0.13.0 are missing sparse tensor validation in multimodal embeddings processing. Because PyTorch disables sparse tensor invariant checks by default, an attacker can submit crafted embedding requests with malformed (negative or out-of-bounds) tensor…

CVE-2026-4944HigMay 28, 2026
Vllm
Vllm
risk 0.57cvss 8.8epss 0.01
vllm-project/vllm version 0.14.1 contains a vulnerability where the `trust_remote_code=True` parameter is hardcoded in two model implementation files (`vllm/model_executor/models/nemotron_vl.py` and `vllm/model_executor/models/kimi_k25.py`). This bypasses the user's explicit…
CVE-2026-48746criJun 16, 2026
Vllm
Vllm
risk 0.52cvss —epss 0.01
### Summary A vulnerability in ASGI web servers and starlette's trust on those web servers enables an authentication bypass of the OpenAI API `AuthenticationMiddleware`, which was discovered during @x41sec's source code audit. It allows to use the API without providing the…
CVE-2026-5497HigJun 11, 2026
Vllm
Vllm
risk 0.42cvss 7.5epss 0.01
vLLM versions 0.8.0 and later are vulnerable to an Out-of-Memory (OOM) Denial of Service (DoS) attack due to unbounded frame count processing in the `VideoMediaIO.load_base64()` method. When processing `video/jpeg` data URLs, the method splits the base64 data string on commas to…
CVE-2024-8768HigSep 17, 2024
Vllm
Vllm
risk 0.42cvss 7.5epss 0.01
A flaw was found in the vLLM library. A completions API request with an empty prompt will crash the vLLM API server, resulting in a denial of service.
CVE-2024-8939MedSep 17, 2024
Vllm
Vllm
risk 0.40cvss 6.2epss 0.00
A vulnerability was found in the ilab model serve component, where improper handling of the best_of parameter in the vllm JSON web API can lead to a Denial of Service (DoS). The API used for LLM-based sentence or chat completion accepts a best_of parameter to return the best…
CVE-2026-41523higJun 16, 2026
Vllm
Vllm
risk 0.39cvss —epss 0.00
### Summary An `assert`-based security check in vLLM's activation function loading allows any unauthenticated attacker to achieve arbitrary code execution on the server by publishing a malicious HuggingFace model, when vLLM runs in Python optimized mode (`python -O` or…
CVE-2025-6242HigOct 7, 2025
Vllm
Vllm
risk 0.39cvss 7.1epss 0.00
A Server-Side Request Forgery (SSRF) vulnerability exists in the MediaConnector class within the vLLM project's multimodal feature set. The load_from_url and load_from_url_async methods fetch and process media from user-provided URLs without adequate restrictions on the target…
CVE-2025-9141higAug 21, 2025
Vllm
Vllm
risk 0.39cvss —epss 0.04
### Summary An unsafe deserialization vulnerability allows any authenticated user to execute arbitrary code on the server if they are able to get the model to pass the code as an argument to a tool call. ### Details vLLM's [Qwen3 Coder tool…
CVE-2026-44223MedMay 12, 2026
Vllm
Vllm
risk 0.35cvss 6.5epss 0.00
vLLM is an inference and serving engine for large language models (LLMs). From to before 0.20.0, the extract_hidden_states speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the…
CVE-2026-44222MedMay 12, 2026
Vllm
Vllm
risk 0.35cvss 6.5epss 0.00
vLLM is an inference and serving engine for large language models (LLMs). From 0.6.1 to before 0.20.0, there is a a Token Injection vulnerability in vLLM’s multimodal processing. Unauthenticated, text-only prompts that spell special tokens are interpreted as control. Image and…
CVE-2026-34756MedApr 6, 2026
Vllm
Vllm
risk 0.35cvss 6.5epss 0.00
vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.19.0, a Denial of Service vulnerability exists in the vLLM OpenAI-compatible API server. Due to the lack of an upper bound validation on the n parameter in the ChatCompletionRequest…
CVE-2026-34755MedApr 6, 2026
Vllm
Vllm
risk 0.35cvss 6.5epss 0.00
vLLM is an inference and serving engine for large language models (LLMs). From 0.7.0 to before 0.19.0, the VideoMediaIO.load_base64() method at vllm/multimodal/media/video.py splits video/jpeg data URLs by comma to extract individual JPEG frames, but does not enforce a frame…
CVE-2026-9540MedMay 26, 2026
Vllm
Vllm
risk 0.34cvss 5.3epss 0.00
A vulnerability was identified in vllm-project vllm 0.19.0. This issue affects some unknown processing of the component OpenAI-compatible Serving Path. Such manipulation leads to denial of service. It is possible to launch the attack remotely. The exploit is publicly available…
CVE-2026-34760MedApr 2, 2026
Vllm
Vllm
risk 0.31cvss 5.9epss 0.00
vLLM is an inference and serving engine for large language models (LLMs). From version 0.5.5 to before version 0.18.0, Librosa defaults to using numpy.mean for mono downmixing (to_mono), while the international standard ITU-R BS.775-4 specifies a weighted downmixing algorithm.…
CVE-2026-7141MedApr 27, 2026
Vllm
Vllm
risk 0.29cvss 5.6epss 0.00
A vulnerability was found in vllm up to 0.19.0. The affected element is the function has_mamba_layers of the file vllm/v1/kv_cache_interface.py of the component KV Block Handler. Performing a manipulation results in uninitialized resource. It is possible to initiate the attack…
CVE-2026-34753MedApr 6, 2026
Vllm
Vllm
risk 0.28cvss 5.4epss 0.00
vLLM is an inference and serving engine for large language models (LLMs). From 0.16.0 to before 0.19.0, a server-side request forgery (SSRF) vulnerability in download_bytes_from_url allows any actor who can control batch input JSON to make the vLLM batch runner issue arbitrary…
CVE-2026-12491modJun 10, 2026
Vllm
Vllm
risk 0.24cvss 4.8epss 0.00
vllm: vllm: image EXIF Rotation & PNG tRNS Transparency Not Normalized, Causing Mismatch Between Model Input and Expectations
CVE-2025-61620medOct 7, 2025
Vllm
Vllm
risk 0.19cvss —epss 0.00
### Summary A resource-exhaustion (denial-of-service) vulnerability exists in multiple endpoints of the OpenAI-Compatible Server due to the ability to specify Jinja templates via the `chat_template` and `chat_template_kwargs` parameters. If an attacker can supply these…
CVE-2026-54232Jun 22, 2026
Vllm
Vllm
risk 0.00cvss —epss 0.00
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.1, the vLLM Dockerfile is vulnerable to a dependency confusion attack through the flashinfer-jit-cache package. The package is installed from a custom index (flashinfer.ai/whl/) using…
CVE-2026-56340Jun 20, 2026
Vllm
Vllm
risk 0.00cvss —epss 0.00
vLLM versions >= 0.10.2 and < 0.13.0 are missing sparse tensor validation in multimodal embeddings processing. Because PyTorch disables sparse tensor invariant checks by default, an attacker can submit crafted embedding requests with malformed (negative or out-of-bounds) tensor…

Page 1 of 3