Skip to content

vllm.kernels.vllm_c

Attributes:

  • CUDA_ALIKE

    Most kernels in this file are supported on all CUDA-alike platforms.

  • IS_ROCM

    ROCm needs shape normalization before calling some vLLM C kernels.

  • rms_add_no_var_size

    vLLM Kernel does not support variance_size parameter and requires

  • rms_no_var_size

    vLLM kernel requires no variance_size override and matching input/weight dtype.

CUDA_ALIKE = current_platform.is_cuda_alike() module-attribute

Most kernels in this file are supported on all CUDA-alike platforms.

IS_ROCM = current_platform.is_rocm() module-attribute

ROCm needs shape normalization before calling some vLLM C kernels.

rms_add_no_var_size = lambda x, x_residual, weight, epsilon, variance_size=None: variance_size is None and (weight is None or weight.dtype == x.dtype) module-attribute

vLLM Kernel does not support variance_size parameter and requires matching input/weight dtype.

rms_no_var_size = lambda x, weight, epsilon, variance_size=None: variance_size is None and (weight is None or weight.dtype == x.dtype) module-attribute

vLLM kernel requires no variance_size override and matching input/weight dtype.