vllm.v1.kv_offload.tiering.fs.io ¶
Functions:
-
load_block–Load callback: read one KV block from disk. Remove the file on failure.
-
store_block–Store callback: Writes to a temp file then atomically replaces the destination.
_ensure_dirs(path) ¶
_get_tmp_suffix() ¶
Generate a thread-local unique suffix for temporary files.
Source code in vllm/v1/kv_offload/tiering/fs/io.py
load_block(source_path, view, offset, block_size) ¶
Load callback: read one KV block from disk. Remove the file on failure.
Source code in vllm/v1/kv_offload/tiering/fs/io.py
store_block(dest_path, buffer, offset, block_size) ¶
Store callback: Writes to a temp file then atomically replaces the destination.