vllm.model_executor.models.qianfan_ocr ¶
Classes:
-
QianfanOCRForConditionalGeneration–QianfanOCR multimodal model.
-
QianfanOCRProcessingInfo–Image-only ProcessingInfo for QianfanOCR (no video support).
QianfanOCRForConditionalGeneration ¶
Bases: InternVLChatModel
QianfanOCR multimodal model.
Identical in structure to InternVLChatModel (InternViT vision encoder + pixel-shuffle MLP connector + Qwen3 language model). This class exists solely to register the QianfanOCRForConditionalGeneration architecture name that appears in the model's config.json.
Source code in vllm/model_executor/models/qianfan_ocr.py
QianfanOCRProcessingInfo ¶
Bases: BaseInternVLProcessingInfo
Image-only ProcessingInfo for QianfanOCR (no video support).