vllm.model_executor.models.midashenglm ¶
Inference-only MiDashengLM model compatible with HuggingFace weights.
Classes:
-
MiDashengLMAudioInputs–Dimensions:
Functions:
-
calculate_mel_frames_dasheng–Calculate the number of Mel-spectrogram frames.
MiDashengLMAudioInputs ¶
Bases: TensorSchema
Dimensions
- bn: Batch size * number of audios
- p: Number of sampling points
Source code in vllm/model_executor/models/midashenglm.py
calculate_mel_frames_dasheng(audio_length_samples, n_fft=512, hop_size=160, dasheng_subsampling=4, center=True, model_subsampling=5) ¶
Calculate the number of Mel-spectrogram frames.