SimFonX's picture
Update README.md
54d0c7f verified
---
license: mit
language:
- en
- de
- fr
- es
- pt
- it
base_model:
- openai/whisper-small
pipeline_tag: automatic-speech-recognition
---
# Whisper ONNX Optimized Models
Optimized Whisper ONNX models packaged for easy deployment. Each zip contains all necessary files for inference.
## Models Available
| Model | Language | Size | Target Use | Download |
|-------|----------|------|------------|----------|
| **Small English** | English-only | 107MB | Fast English transcription | [whisper-small-en-onnx.zip](small-en/whisper-small-en-onnx.zip) |
| **Small Multilingual** | 99 languages | 245MB | Fast multilingual transcription | [whisper-small-multilingual-onnx.zip](small-multilingual/whisper-small-multilingual-onnx.zip) |
| **Medium English** | English-only | 247MB | High quality English transcription | [whisper-medium-en-onnx.zip](medium-en/whisper-medium-en-onnx.zip) |
| **Medium Multilingual** | 99 languages | 602MB | High quality multilingual | [whisper-medium-multilingual-onnx.zip](medium-multilingual/whisper-medium-multilingual-onnx.zip) |
| **Large v3 Turbo** | 99 languages | 646MB | Best quality, fastest large model | [whisper-large-v3-turbo-onnx.zip](large-v3-turbo/whisper-large-v3-turbo-onnx.zip) |
## Contents of Each Zip
Each zip file contains 6 files needed for inference:
### ONNX Model Files
- `encoder_model_quantized.onnx` - Audio encoder (processes mel spectrograms)
- `decoder_with_past_model_quantized.onnx` - Text decoder (generates transcription), optimized decoder with KV caching
### Configuration Files
- `config.json` - Model configuration
- `generation_config.json` - Generation parameters
- `preprocessor_config.json` - Audio preprocessing settings
- `tokenizer.json` - Tokenizer vocabulary
## Model Sources
These models are repackaged from:
- [Distil-Whisper](https://huggingface.co/distil-whisper) (English models)
- [ONNX Community](https://huggingface.co/onnx-community) (Multilingual models)
## License
Models inherit their original licenses:
- Distil-Whisper models: MIT License
- Whisper models: MIT License
## Version History
- **v1.0.0** - Initial release with 5 optimized models