vllm.model_executor.layers.fused_moe.oracle.mxfp8 ¶
Functions:
-
select_mxfp8_moe_backend–Select the MXFP8 MoE backend and the best expert class.
_select_kernel_cls(backend, config) ¶
Select the first supported expert class for the MXFP8 config.
Source code in vllm/model_executor/layers/fused_moe/oracle/mxfp8.py
select_mxfp8_moe_backend(config) ¶
Select the MXFP8 MoE backend and the best expert class.
Returns:
-
tuple[Fp8MoeBackend, type[FusedMoEExperts]]–A tuple of (fp8_backend, experts_cls).