CVE-2025-47277: Critical Vulnerability in vLLM’s `PyNcclPipe` KV Cache Transfer Integration

Overview

The vulnerability CVE-2025-47277 is a highly critical security flaw found in the vLLM, an inference and serving engine for large language models. Specifically, the vulnerability affects the `PyNcclPipe` KV cache transfer integration with the V0 engine in versions 0.6.5 through 0.8.4. The exploitation of this vulnerability could potentially lead to system compromise or data leakage, posing significant risks to any environment using the affected configurations. The severity of this vulnerability underscores the importance of immediate action to mitigate the risks.

Vulnerability Summary

CVE ID: CVE-2025-47277
Severity: Critical (CVSS: 9.8)
Attack Vector: Network
Privileges Required: None
User Interaction: None
Impact: System Compromise and Data Leakage

Affected Products

Escape the Surveillance Era

Most apps won’t tell you the truth.
They’re part of the problem.

Phone numbers. Emails. Profiles. Logs.
It’s all fuel for surveillance.

Ameeba Chat gives you a way out.

• No phone number
• No email
• No personal info
• Anonymous aliases
• End-to-end encrypted

Chat without a trace.

Download Ameeba Chat Learn More

Product | Affected Versions

vLLM | 0.6.5 to 0.8.4

How the Exploit Works

The vulnerability arises from vLLM’s implementation of the `PyNcclPipe` class to establish a P2P communication domain for data transmission between distributed nodes. The `TCPStore` interface, part of the PyTorch framework, was intended to listen only on the IP address specified by the `–kv-ip` CLI parameter, thus limiting exposure to a private, secured network. However, due to a security oversight, the `TCPStore` interface listens on all interfaces, disregarding the provided IP address. This behavior exposes the vLLM engine to potential unauthorized access and exploitation over the network.

Conceptual Example Code

Here’s a conceptual example of an HTTP request that could exploit this vulnerability:

POST /vLLM/execute HTTP/1.1
Host: vulnerable.example.com
Content-Type: application/json
{
"command": "dump_kvcache",
"kvstore": "PyNcclPipe"
}

In this conceptual example, an attacker sends a malicious `POST` request to the vLLM engine’s execution endpoint. The `”dump_kvcache”` command could force the vLLM engine to dump the contents of the KV cache, potentially leaking sensitive data over the network.

Mitigation Steps

This vulnerability has been patched in vLLM version 0.8.5, which now limits the `TCPStore` socket to the private interface as configured. All users are advised to update to this version or later to mitigate the risks posed by CVE-2025-47277.
In situations where immediate patching is not feasible, deploying a Web Application Firewall (WAF) or Intrusion Detection System (IDS) could serve as a temporary mitigation measure. These systems can monitor and control incoming network traffic based on predefined security policies, potentially preventing exploitation of this vulnerability.
Despite these measures, it is imperative to apply the vendor patch as soon as possible to fully address the vulnerability.

Ameeba Exploit Tracker

CVE-2025-47277: Critical Vulnerability in vLLM’s `PyNcclPipe` KV Cache Transfer Integration

Escape the Surveillance Era

More posts

CVE-2025-52802: Missing Authorization Vulnerability in Import YouTube Videos as WP Posts Plugin

CVE-2025-52715: PHP Remote File Inclusion Vulnerability in RadiusTheme Classified Listing

CVE-2025-52708: PHP Remote File Inclusion Vulnerability in RealMag777 HUSKY

CVE-2025-48705: Critical Vulnerability in COROS PACE 3 Leads to Forced Device Reboot