Whisper ONNX Optimized Models
Optimized Whisper ONNX models packaged for easy deployment. Each zip contains all necessary files for inference.
Models Available
Model | Language | Size | Target Use | Download |
---|---|---|---|---|
Small English | English-only | 107MB | Fast English transcription | whisper-small-en-onnx.zip |
Small Multilingual | 99 languages | 245MB | Fast multilingual transcription | whisper-small-multilingual-onnx.zip |
Medium English | English-only | 247MB | High quality English transcription | whisper-medium-en-onnx.zip |
Medium Multilingual | 99 languages | 602MB | High quality multilingual | whisper-medium-multilingual-onnx.zip |
Large v3 Turbo | 99 languages | 646MB | Best quality, fastest large model | whisper-large-v3-turbo-onnx.zip |
Contents of Each Zip
Each zip file contains 6 files needed for inference:
ONNX Model Files
encoder_model_quantized.onnx
- Audio encoder (processes mel spectrograms)decoder_with_past_model_quantized.onnx
- Text decoder (generates transcription), optimized decoder with KV caching
Configuration Files
config.json
- Model configurationgeneration_config.json
- Generation parameterspreprocessor_config.json
- Audio preprocessing settingstokenizer.json
- Tokenizer vocabulary
Model Sources
These models are repackaged from:
- Distil-Whisper (English models)
- ONNX Community (Multilingual models)
License
Models inherit their original licenses:
- Distil-Whisper models: MIT License
- Whisper models: MIT License
Version History
- v1.0.0 - Initial release with 5 optimized models
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for SimFonX/whisper-onnx-optimized
Base model
openai/whisper-small