vllm.distributed.kv_transfer.kv_connector.v1.hf3fs.utils.common ¶
Classes:
-
AtomicCounter–Thread-safe atomic counter for round-robin operations.
-
HF3FSConnectorMetadata–Container for HF3FS connector metadata.
-
HF3FSRequestMetadata–Metadata for a single request in HF3FS connector.
-
LoadBlockInfo–Operation for loading blocks from external storage.
-
RequestSchedulingState–Unified request scheduling state management.
-
SaveBlockInfo–Operation for saving blocks to external storage.
AtomicCounter ¶
Thread-safe atomic counter for round-robin operations.
Methods:
-
next–Get next value in round-robin fashion.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
next() ¶
HF3FSConnectorMetadata ¶
Bases: KVConnectorMetadata
Container for HF3FS connector metadata.
Methods:
-
add_request–Add request to metadata.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
HF3FSRequestMetadata dataclass ¶
Metadata for a single request in HF3FS connector.
Methods:
-
from_scheduling_state–Create request metadata from scheduling state.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
from_scheduling_state(state, block_size, load_op=None, skip_leading_blocks=None) staticmethod ¶
Create request metadata from scheduling state.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
LoadBlockInfo dataclass ¶
Operation for loading blocks from external storage.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
RequestSchedulingState dataclass ¶
Unified request scheduling state management.
Methods:
-
is_ready_to_load–Check if request is ready for loading.
-
needs_loading–Check if request needs loading.
-
update_tokens_and_blocks–Update with new tokens and blocks.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
_normalize_block_ids(block_ids) ¶
Normalize block_ids to list format.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/hf3fs/utils/common.py
is_ready_to_load() ¶
needs_loading() ¶
update_tokens_and_blocks(new_token_ids, new_block_ids) ¶
Update with new tokens and blocks.