vllm.entrypoints.openai.engine.protocol ¶
Classes:
-
GenerationError–raised when finish_reason indicates internal server error (500)
Functions:
-
validate_structural_tag_response_format–Validate structural tags before they are sent to the engine.
GenerationError ¶
Bases: Exception
raised when finish_reason indicates internal server error (500)
Source code in vllm/entrypoints/openai/engine/protocol.py
validate_structural_tag_response_format(response_format) ¶
Validate structural tags before they are sent to the engine.
Engine-side validation reports malformed structural tags as generation failures. OpenAI request parsing should classify them as bad requests.