whisper-large-icelandic-62640-steps-967h-ct2

This is a faster-whisper version of language-and-voice-lab/whisper-large-icelandic-62640-steps-967h.

The model was created like described in faster-whisper:

ct2-transformers-converter --model language-and-voice-lab/whisper-large-icelandic-62640-steps-967h \
    --output_dir whisper-large-icelandic-62640-steps-967h-ct2 \
    --quantization float16

Usage

from faster_whisper import WhisperModel

model_size = "whisper-large-icelandic-62640-steps-967h-ct2"

# Run on GPU with FP16
model = WhisperModel(model_size, device="cuda", compute_type="float16")

# or run on GPU with INT8
# model = WhisperModel(model_size, device="cuda", compute_type="int8_float16")
# or run on CPU with INT8
# model = WhisperModel(model_size, device="cpu", compute_type="int8")

segments, info = model.transcribe("audio.mp3", beam_size=5)

print("Detected language '%s' with probability %f" % (info.language, info.language_probability))

for segment in segments:
    print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))

BibTeX entry and citation info

When publishing results based on these models please refer to:

@misc{mena2023whisperlarge62640icelandicct2,
    title={Acoustic Model in Icelandic: whisper-large-icelandic-62640-steps-967h-ct2.}, 
    author={Hernandez Mena, Carlos Daniel},
    url={https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-62640-steps-967h-ct2},
    year={2023}
}
Downloads last month
12
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train language-and-voice-lab/whisper-large-icelandic-62640-steps-967h-ct2