vllm.v1.core.kv_cache_metrics ¶
KV cache metrics tracking.
Classes:
-
BlockMetricsState–Tracks lifecycle metrics for a single KV cache block.
-
KVCacheMetricsCollector–Collects KV cache residency metrics with sampling.
BlockMetricsState ¶
Tracks lifecycle metrics for a single KV cache block.
Source code in vllm/v1/core/kv_cache_metrics.py
KVCacheMetricsCollector ¶
Collects KV cache residency metrics with sampling.
Methods:
-
reset–Clear all state on cache reset.