vllm
PyPI33 known CVEs affecting this package
Aggregated from OSV, GitHub Security Advisories, NVD, and vendor advisories. Each CVE links to its full detail page with vendor advisories, patches, fixed versions, and remediation guidance.
CVEs affecting vllmpage 1 of 1
- CVE-2024-11041CRITICALCVSS 9.8EG 9.82025-03-20
vulnerable: 0.0.1 ... 0.6.2 (40 versions)
vllm-project vllm version v0.6.2 contains a vulnerability in the MessageQueue.dequeue() API function. The function uses pickle.loads to parse received sockets directly, leading to a remote code execution vulnerability. An attacker can expl…
- CVE-2024-8768HIGHCVSS 7.5EG 7.5✓ Fixed in 0.5.52024-09-17
vulnerable: 0.0.1 ... 0.5.4 (34 versions)
A flaw was found in the vLLM library. A completions API request with an empty prompt will crash the vLLM API server, resulting in a denial of service.
- CVE-2024-8939MEDIUMCVSS 6.2EG 6.22024-09-17
vulnerable: 0.0.1 ... 0.5.0.post1 (29 versions)
A vulnerability was found in the ilab model serve component, where improper handling of the best_of parameter in the vllm JSON web API can lead to a Denial of Service (DoS). The API used for LLM-based sentence or chat completion accepts a …
- CVE-2025-24357HIGHCVSS 7.5EG 7.5✓ Fixed in 0.7.02025-01-27
vulnerable: 0.0.1 ... 0.6.6.post1 (47 versions)
vLLM is a library for LLM inference and serving. vllm/model_executor/weight_utils.py implements hf_model_weights_iterator to load the model checkpoint, which is downloaded from huggingface. It uses the torch.load function and the weights_o…
- CVE-2025-25183LOWCVSS 2.6EG 2.6✓ Fixed in 0.7.22025-02-07
vulnerable: 0.0.1 ... 0.7.1 (49 versions)
vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Maliciously constructed statements can lead to hash collisions, resulting in cache reuse, which can interfere with subsequent responses and cause uninten…
- CVE-2025-29783CRITICALCVSS 9.0EG 9.0✓ Fixed in 0.8.02025-03-19
vulnerable: 0.6.5 ... 0.7.3 (7 versions)
vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. When vLLM is configured to use Mooncake, unsafe deserialization exposed directly over ZMQ/TCP on all network interfaces will allow attackers to execute r…
- CVE-2025-30165HIGHCVSS 8.0EG 8.0✓ Fixed in 0.10.02025-05-06
vulnerable: 0.5.2 ... 0.9.2 (32 versions)
vLLM is an inference and serving engine for large language models. In a multi-node vLLM deployment using the V0 engine, vLLM uses ZeroMQ for some multi-node communication purposes. The secondary vLLM hosts open a `SUB` ZeroMQ socket and co…
- CVE-2025-30202HIGHCVSS 7.5EG 7.5✓ Fixed in 0.8.52025-04-30
vulnerable: 0.5.2 ... 0.8.4 (26 versions)
vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.5.2 and prior to 0.8.5 are vulnerable to denial of service and data exposure via ZeroMQ on multi-node vLLM deployment. In a mult…
- CVE-2025-46560MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.8.52025-04-30
vulnerable: 0.8.0, 0.8.1, 0.8.2, 0.8.3, 0.8.4
vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.8.0 and prior to 0.8.5 are affected by a critical performance vulnerability in the input preprocessing logic of the multimodal t…
- CVE-2025-46570LOWCVSS 2.6EG 2.6✓ Fixed in 0.9.02025-05-29
vulnerable: 0.0.1 ... 0.8.5.post1 (58 versions)
vLLM is an inference and serving engine for large language models (LLMs). Prior to version 0.9.0, when a new prompt is processed, if the PageAttention mechanism finds a matching prefix chunk, the prefill process speeds up, which is reflect…
- CVE-2025-46722MEDIUMCVSS 4.2EG 4.2✓ Fixed in 0.9.02025-05-29
vulnerable: 0.7.0 ... 0.8.5.post1 (11 versions)
vLLM is an inference and serving engine for large language models (LLMs). In versions starting from 0.7.0 to before 0.9.0, in the file vllm/multimodal/hasher.py, the MultiModalHasher class has a security and data integrity issue in its ima…
- CVE-2025-47277CRITICALCVSS 9.8EG 9.8✓ Fixed in 0.8.52025-05-20
vulnerable: 0.6.5 ... 0.8.4 (12 versions)
vLLM, an inference and serving engine for large language models (LLMs), has an issue in versions 0.6.5 through 0.8.4 that ONLY impacts environments using the `PyNcclPipe` KV cache transfer integration with the V0 engine. No other configura…
- CVE-2025-48887MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.9.02025-05-30
vulnerable: 0.6.4 ... 0.8.5.post1 (16 versions)
vLLM, an inference and serving engine for large language models (LLMs), has a Regular Expression Denial of Service (ReDoS) vulnerability in the file `vllm/entrypoints/openai/tool_parsers/pythonic_tool_parser.py` of versions 0.6.4 up to but…
- CVE-2025-48942MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.9.02025-05-30
vulnerable: 0.8.0 ... 0.8.5.post1 (7 versions)
vLLM is an inference and serving engine for large language models (LLMs). In versions 0.8.0 up to but excluding 0.9.0, hitting the /v1/completions API with a invalid json_schema as a Guided Param kills the vllm server. This vulnerability …
- CVE-2025-48943MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.9.02025-05-30
vulnerable: 0.8.0 ... 0.8.5.post1 (7 versions)
vLLM is an inference and serving engine for large language models (LLMs). Version 0.8.0 up to but excluding 0.9.0 have a Denial of Service (ReDoS) that causes the vLLM server to crash if an invalid regex was provided while using structured…
- CVE-2025-48944MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.9.02025-05-30
vulnerable: 0.8.0 ... 0.8.5.post1 (7 versions)
vLLM is an inference and serving engine for large language models (LLMs). In version 0.8.0 up to but excluding 0.9.0, the vLLM backend used with the /v1/chat/completions OpenAPI endpoint fails to validate unexpected or malformed input in t…
- CVE-2025-48956HIGHCVSS 7.5EG 7.5✓ Fixed in 0.10.1.12025-08-21
vulnerable: 0.1.0 ... 0.9.2 (63 versions)
vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.10.1.1, a Denial of Service (DoS) vulnerability can be triggered by sending a single HTTP GET request with an extremely large header to an HTT…
- CVE-2025-59425HIGHCVSS 7.5EG 7.5✓ Fixed in 0.11.02025-10-07
vulnerable: 0.0.1 ... 0.9.2 (66 versions)
vLLM is an inference and serving engine for large language models (LLMs). Before version 0.11.0rc2, the API key support in vLLM performs validation using a method that was vulnerable to a timing attack. API key validation uses a string com…
- CVE-2025-62164HIGHCVSS 8.8EG 8.8✓ Fixed in 0.11.12025-11-21
vulnerable: 0.10.2, 0.11.0
vLLM is an inference and serving engine for large language models (LLMs). From versions 0.10.2 to before 0.11.1, a memory corruption vulnerability could lead to a crash (denial-of-service) and potentially remote code execution (RCE), exist…
- CVE-2025-62372MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.11.12025-11-21
vulnerable: 0.10.0 ... 0.9.2 (33 versions)
vLLM is an inference and serving engine for large language models (LLMs). From version 0.5.5 to before 0.11.1, users can crash the vLLM engine serving multimodal models by passing multimodal embedding inputs with correct ndim but incorrect…
- CVE-2025-6242HIGHCVSS 7.1EG 7.1✓ Fixed in 0.11.02025-10-07
vulnerable: 0.10.0 ... 0.9.2 (39 versions)
A Server-Side Request Forgery (SSRF) vulnerability exists in the MediaConnector class within the vLLM project's multimodal feature set. The load_from_url and load_from_url_async methods fetch and process media from user-provided URLs witho…
- CVE-2025-62426MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.11.12025-11-21
vulnerable: 0.10.0 ... 0.9.2 (33 versions)
vLLM is an inference and serving engine for large language models (LLMs). From version 0.5.5 to before 0.11.1, the /v1/chat/completions and /tokenize endpoints allow a chat_template_kwargs request parameter that is used in the code before …
- CVE-2025-66448HIGHCVSS 7.1EG 7.1✓ Fixed in 0.11.12025-12-01
vulnerable: 0.0.1 ... 0.9.2 (67 versions)
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.11.1, vllm has a critical remote code execution vector in a config class named Nemotron_Nano_VL_Config. When vllm loads a model config that contains an au…
- CVE-2026-22773MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.12.02026-01-10
vulnerable: 0.10.0 ... 0.9.2 (27 versions)
vLLM is an inference and serving engine for large language models (LLMs). In versions from 0.6.4 to before 0.12.0, users can crash the vLLM engine serving multimodal models that use the Idefics3 vision model implementation by sending a spe…
- CVE-2026-22778CRITICALCVSS 9.8EG 9.8✓ Fixed in 0.14.12026-02-02
vulnerable: 0.10.0 ... 0.9.2 (18 versions)
vLLM is an inference and serving engine for large language models (LLMs). From 0.8.3 to before 0.14.1, when an invalid image is sent to vLLM's multimodal endpoint, PIL throws an error. vLLM returns this error to the client, leaking a heap …
- CVE-2026-22807HIGHCVSS 8.8EG 8.8✓ Fixed in 0.14.02026-01-21
vulnerable: 0.10.1 ... 0.13.0 (8 versions)
vLLM is an inference and serving engine for large language models (LLMs). Starting in version 0.10.1 and prior to version 0.14.0, vLLM loads Hugging Face `auto_map` dynamic modules during model resolution without gating on `trust_remote_co…
- CVE-2026-24779HIGHCVSS 7.1EG 7.1✓ Fixed in 0.14.12026-01-27
vulnerable: 0.0.1 ... 0.9.2 (72 versions)
vLLM is an inference and serving engine for large language models (LLMs). Prior to version 0.14.1, a Server-Side Request Forgery (SSRF) vulnerability exists in the `MediaConnector` class within the vLLM project's multimodal feature set. Th…
- CVE-2026-34753MEDIUMCVSS 5.4EG 5.4✓ Fixed in 0.19.02026-04-06
vulnerable: 0.16.0, 0.17.0, 0.17.1, 0.18.0, 0.18.1
vLLM is an inference and serving engine for large language models (LLMs). From 0.16.0 to before 0.19.0, a server-side request forgery (SSRF) vulnerability in download_bytes_from_url allows any actor who can control batch input JSON to make…
- CVE-2026-34755MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.19.02026-04-06
vulnerable: 0.10.0 ... 0.9.2 (33 versions)
vLLM is an inference and serving engine for large language models (LLMs). From 0.7.0 to before 0.19.0, the VideoMediaIO.load_base64() method at vllm/multimodal/media/video.py splits video/jpeg data URLs by comma to extract individual JPEG …
- CVE-2026-34756MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.19.02026-04-06
vulnerable: 0.1.0 ... 0.9.2 (79 versions)
vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.19.0, a Denial of Service vulnerability exists in the vLLM OpenAI-compatible API server. Due to the lack of an upper bound validation on the n…
- CVE-2026-44222MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.20.02026-05-12
vulnerable: 0.10.0 ... 0.9.2 (46 versions)
vLLM is an inference and serving engine for large language models (LLMs). From 0.6.1 to before 0.20.0, there is a a Token Injection vulnerability in vLLM’s multimodal processing. Unauthenticated, text-only prompts that spell special toke…
- CVE-2026-44223MEDIUMCVSS 6.5EG 6.5✓ Fixed in 0.20.02026-05-12
vulnerable: 0.18.0, 0.18.1, 0.19.0, 0.19.1
vLLM is an inference and serving engine for large language models (LLMs). From to before 0.20.0, the extract_hidden_states speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing…
- CVE-2026-7141MEDIUMCVSS 5.6EG 5.6✓ Fixed in 0.19.12026-04-27
vulnerable: 0.0.1 ... 0.9.2 (81 versions)
A vulnerability was found in vllm up to 0.19.0. The affected element is the function has_mamba_layers of the file vllm/v1/kv_cache_interface.py of the component KV Block Handler. Performing a manipulation results in uninitialized resource.…
Check whether vllm is used in your infrastructure
EchelonGraph scans your cloud and SBOMs to map every package to your actual deployments. See blast radius for vllm CVEs against the assets you own.
Start Free Scan →