larger models

by ctranslate2-4you - opened Jan 30, 2024

Jan 30, 2024

Any plans to be able to utilize larger models, like large-v2...or quantized models like those implemented by ctranslate2/faster-whisper/whisperx and so on...Or perhaps it already exists and I just missed it?

jpc

WhisperSpeech org Jan 31, 2024

Hey, we did not see performance to improve a lot when moving to larger models. We'll probably revisit this once we have more conditioning options (emotions, prosody, etc.).

For CTranslate2/fast-whisper we'd love to have our models running there but we did not have the resources to do it ourselves. For now we rely on torch.compile to improve inference performance.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment