vllm.v1.kv_offload ¶
Modules:
| Name | Description |
|---|---|
abstract | OffloadingManager class for managing KV data offloading in vLLM v1 |
arc_manager | |
backend | |
factory | |
lru_manager | |
mediums | |
reuse_manager | Reuse-frequency gating for CPU KV-cache offload stores. |
spec | |
worker | |