vllm.model_executor.models.blip ¶
Minimal implementation of BlipVisionModel intended to be only used within a vision language model.
Classes:
-
BlipAttention–Multi-headed attention from 'Attention Is All You Need' paper
-
BlipEncoder–Transformer encoder consisting of
config.num_hidden_layersself
BlipAttention ¶
Bases: Module
Multi-headed attention from 'Attention Is All You Need' paper
Methods:
-
forward–Input shape: Batch x Time x Channel
Source code in vllm/model_executor/models/blip.py
forward(hidden_states) ¶
Input shape: Batch x Time x Channel
Source code in vllm/model_executor/models/blip.py
BlipEncoder ¶
Bases: Module
Transformer encoder consisting of config.num_hidden_layers self attention layers. Each layer is a [BlipEncoderLayer].
Parameters:
-
(config¶BlipVisionConfig) –BlipConfig