vllm.models.deepseek_v4.common.ops.save_partial_states ¶
Functions:
-
save_partial_states–Write packed [kv, score+ape] partial states into the compressor cache.
save_partial_states(kv, score, ape, positions, state_cache, slot_mapping, block_size, state_width, compress_ratio, pdl_kwargs=None) ¶
Write packed [kv, score+ape] partial states into the compressor cache.
One program per token; pads (slot_id == -1) are skipped.