vllm.lora.layers.replicated_linear ¶
Classes:
ReplicatedLinearWithLoRA ¶
Bases: BaseLinearLayerWithLoRA
Methods:
-
forward–Forward of ReplicatedLinearWithLoRA
-
slice_lora_a–Slice lora a if splitting for tensor parallelism.
-
slice_lora_b–Slice lora b if splitting with tensor parallelism.
Source code in vllm/lora/layers/replicated_linear.py
forward(input_) ¶
Forward of ReplicatedLinearWithLoRA
Parameters:
Returns:
Source code in vllm/lora/layers/replicated_linear.py
slice_lora_a(lora_a) ¶
Slice lora a if splitting for tensor parallelism.
slice_lora_b(lora_b) ¶
Slice lora b if splitting with tensor parallelism.