vllm.distributed.communication_op ¶
Functions:
-
tensor_model_parallel_all_gather–All-gather the input tensor across model parallel group.
-
tensor_model_parallel_all_reduce–All-reduce the input tensor across model parallel group.
-
tensor_model_parallel_gather–Gather the input tensor across model parallel group.
-
tensor_model_parallel_reduce_scatter–Reduce-Scatter the input tensor across model parallel group.
tensor_model_parallel_all_gather(input_, dim=-1) ¶
All-gather the input tensor across model parallel group.
tensor_model_parallel_all_reduce(input_) ¶
tensor_model_parallel_gather(input_, dst=0, dim=-1) ¶
Gather the input tensor across model parallel group.
tensor_model_parallel_reduce_scatter(input_, dim=-1) ¶
Reduce-Scatter the input tensor across model parallel group.