vllm.v1.sample ¶
Modules:
-
logits_processor– -
ops– -
rejection_sampler– -
sampler–A layer that samples the next tokens from the model's outputs.
-
thinking_budget_state–Per-batch thinking token budget state; applied after penalties at sample time.