vllm.entrypoints.cli.launch ¶
Classes:
-
LaunchSubcommand–The
launchsubcommand for the vLLM CLI. -
LaunchSubcommandBase–The base class of subcommands for
vllm launch. -
RenderSubcommand–The
rendersubcommand forvllm launch.
Functions:
-
run_launch_fastapi–Run the online serving layer with FastAPI (no GPU inference).
LaunchSubcommand ¶
Bases: CLISubcommand
The launch subcommand for the vLLM CLI.
Uses nested sub-subcommands so each component can define its own arguments independently (e.g. vllm launch render).
Source code in vllm/entrypoints/cli/launch.py
LaunchSubcommandBase ¶
Bases: CLISubcommand
The base class of subcommands for vllm launch.
Methods:
-
add_cli_args–Add the CLI arguments to the parser.
Source code in vllm/entrypoints/cli/launch.py
add_cli_args(parser) classmethod ¶
Add the CLI arguments to the parser.
By default, adds the standard vLLM serving arguments. Subclasses can override to add component-specific arguments.
Source code in vllm/entrypoints/cli/launch.py
RenderSubcommand ¶
Bases: LaunchSubcommandBase
The render subcommand for vllm launch.
Source code in vllm/entrypoints/cli/launch.py
run_launch_fastapi(args) async ¶
Run the online serving layer with FastAPI (no GPU inference).