CVE-2025-46570 - Vulnerability Details

vLLM is an inference and serving engine for large language models (LLMs). Prior to version 0.9.0, when a new prompt is processed, if the PageAttention mechanism finds a matching prefix chunk, the prefill process speeds up, which is reflected in the TTFT (Time to First Token). These timing differences caused by matching chunks are significant enough to be recognized and exploited. This issue has been patched in version 0.9.0.

No CVSS v4.0

Attack Vector Network

Attack Complexity High

Privileges Required Low

Scope Unchanged

Confidentiality Impact Low

Integrity Impact None

Availability Impact None

User Interaction Required

No CVSS v3.0

No CVSS v2

This CVE is not in the KEV list.

Exploitation none

Automatable no

Technical Impact partial

Vendors	Products
Vllm	Vllm
Vllm-project	Vllm

Configuration 1 [-]

cpe:2.3:a:vllm:vllm:*:*:*:*:*:*:*:*

No data.

References

Link	Providers
https://github.com/vllm-project/vllm/commit/77073c77bc2006eb80ea6d5128f076f5e6c6f54f
https://github.com/vllm-project/vllm/pull/17045
https://github.com/vllm-project/vllm/security/advisories/GHSA-4qjh-9fv9-r85r
https://nvd.nist.gov/vuln/detail/CVE-2025-46570
https://www.cve.org/CVERecord?id=CVE-2025-46570

History

Tue, 24 Jun 2025 19:00:00 +0000

Type	Values Removed	Values Added
First Time appeared		Vllm Vllm vllm
Weaknesses		CWE-203
CPEs		cpe:2.3:a:vllm:vllm::::::::
Vendors & Products		Vllm Vllm vllm

Fri, 30 May 2025 21:45:00 +0000

Type	Values Removed	Values Added
References		https://nvd.nist.gov/vuln/detail/CVE-2025-46570 https://www.cve.org/CVERecord?id=CVE-2025-46570
Metrics	threat_severity `None`	threat_severity `Low`

Thu, 29 May 2025 18:15:00 +0000

Type	Values Removed	Values Added
Metrics		ssvc `{'options': {'Automatable': 'no', 'Exploitation': 'none', 'Technical Impact': 'partial'}, 'version': '2.0.3'}`

Thu, 29 May 2025 16:45:00 +0000

Type	Values Removed	Values Added
Description		vLLM is an inference and serving engine for large language models (LLMs). Prior to version 0.9.0, when a new prompt is processed, if the PageAttention mechanism finds a matching prefix chunk, the prefill process speeds up, which is reflected in the TTFT (Time to First Token). These timing differences caused by matching chunks are significant enough to be recognized and exploited. This issue has been patched in version 0.9.0.
Title		vLLM’s Chunk-Based Prefix Caching Vulnerable to Potential Timing Side-Channel
Weaknesses		CWE-208
References		https://github.com/vllm-project/vllm/commit/77073c77bc2006eb80ea6d5128f076f5e6c6f54f https://github.com/vllm-project/vllm/pull/17045 https://github.com/vllm-project/vllm/security/advisories/GHSA-4qjh-9fv9-r85r
Metrics		cvssV3_1 `{'score': 2.6, 'vector': 'CVSS:3.1/AV:N/AC:H/PR:L/UI:R/S:U/C:L/I:N/A:N'}`

MITRE

Status: PUBLISHED

Assigner: GitHub_M

Published: 2025-05-29T16:32:42.794Z

Updated: 2025-05-29T18:05:10.768Z

Reserved: 2025-04-24T21:10:48.175Z

Link: CVE-2025-46570

Vulnrichment

Updated: 2025-05-29T18:05:04.545Z

NVD

Status : Analyzed

Published: 2025-05-29T17:15:21.327

Modified: 2025-06-24T18:25:31.883

Link: CVE-2025-46570

Redhat

Severity : Low

Publid Date: 2025-05-29T16:32:42Z

Links: CVE-2025-46570 - Bugzilla

Metrics

Attack Vector Network

Attack Complexity High

Privileges Required Low

Scope Unchanged

Confidentiality Impact Low

Integrity Impact None

Availability Impact None

User Interaction Required

Exploitation none

Automatable no

Technical Impact partial

Affected Vendors & Products

JSON object

JSON object

JSON object

JSON object