vllm.v1.attention.ops.flashmla ¶
Functions:
-
is_flashmla_dense_supported–Return: is_supported_flag, unsupported_reason (optional).
-
is_flashmla_sparse_supported–Return: is_supported_flag, unsupported_reason (optional).
is_flashmla_dense_supported() ¶
Return: is_supported_flag, unsupported_reason (optional).
Source code in vllm/v1/attention/ops/flashmla.py
is_flashmla_sparse_supported() ¶
Return: is_supported_flag, unsupported_reason (optional).