vllm.v1 ¶
Modules:
-
attention– -
core– -
cudagraph_dispatcher– -
engine– -
executor– -
kv_cache_interface– -
kv_cache_spec_registry–Registry for KVCacheSpec types and their associated managers.
-
kv_offload– -
metrics– -
outputs– -
pool– -
request– -
sample– -
serial_utils– -
simple_kv_offload– -
spec_decode– -
structured_output– -
utils– -
worker–