vllm.distributed.kv_transfer.kv_connector.v1.offloading.common ¶
Classes:
-
OffloadingWorkerMetadata–Worker -> Scheduler metadata for completed transfer jobs.
-
TransferJob–A transfer job bundling request context with transfer spec.
OffloadingWorkerMetadata dataclass ¶
Bases: KVConnectorWorkerMetadata
Worker -> Scheduler metadata for completed transfer jobs.
Each worker reports {job_id: 1} for newly completed transfer jobs (load or store). aggregate() sums counts across workers within a step. The scheduler accumulates across steps and processes a transfer completion only when count reaches num_workers.
Methods:
-
mark_completed–Record a transfer job completion from this worker.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/offloading/common.py
mark_completed(job_id) ¶
TransferJob dataclass ¶
A transfer job bundling request context with transfer spec.
Used for both loads and stores, keyed by scheduler-assigned job ID. The worker reports the job ID back when the transfer finishes, and the scheduler processes the completion.