File size: 943 Bytes
1aa6801 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 |
---
license: apache-2.0
language:
- bn
base_model:
- sha1779/BengaliRegionalASR
pipeline_tag: automatic-speech-recognition
---
This is the CTranslate2 version which is faster than base version.
## requirements
```bash
pip install ctranslate2
```
## Base model to Ctranslate format conversion
```bash
!ct2-transformers-converter --model sha1779/BengaliRegionalASR --output_dir sha1779/Faster_BengaliRegionalASR --copy_files tokenizer.json preprocessor_config.json --quantization float16
```
## Run the model
```bash
pip install faster-whisper
```
```python
from faster_whisper import WhisperModel
model_size = "sha1779/Faster_BengaliRegionalASR"
model = WhisperModel(model_size, device="cuda", compute_type="float16")
segments, info = model.transcribe("audio.mp3", beam_size=5, language="en", condition_on_previous_text=False)
for segment in segments:
print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
```
|