vllm.v1.worker.gpu.model_states.interface ¶
Classes:
-
ModelSpecificAttnMetadata–Base class for model-specific attention metadata.
ModelSpecificAttnMetadata ¶
Base class for model-specific attention metadata.