vllm.model_executor.layers.quantization.utils.machete_utils ¶
Functions:
-
query_machete_supported_group_sizes–Queries the supported group sizes for Machete based on the activation type.
query_machete_supported_group_sizes(act_type) ¶
Queries the supported group sizes for Machete based on the activation type.
Parameters:
Returns:
-
list[int]–A list of supported group sizes. The group size must
-
list[int]–be divisible by
TileShapeK = 128 * 8 // num_bits(act_type). -
list[int]–-1 indicates per-channel quantization.