Marianoleiras
/

whisper-small-es-ja

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Marianoleiras commited on Jan 13

Commit

a794f21

·

verified ·

1 Parent(s): dde4de1

Update README.md

Files changed (1) hide show

README.md +3 -8

README.md CHANGED Viewed

@@ -22,15 +22,10 @@ should probably proofread and complete it, then remove this comment. -->
 # whisper-small-es-ja
 ## Model Overview
-This model is a fine-tuned version of OpenAI's Whisper-small, specifically trained on the **Marianoleiras/voxpopuli_es-ja** dataset for Spanish-to-Japanese speech-to-text (STT) tasks.
-It employs the Whisper architecture, which is known for its robustness in multilingual speech recognition and translation scenarios.
-The primary goal of this model is to enable accurate end-to-end transcription and translation of spoken Spanish into written Japanese.
-It was developed as part of a **three-week workshop organized by Yasmin Moslem**, focusing on speech-to-text pipelines.
-The workshop involved:
-1. **Dataset creation** during the first week.
-2. **Model training and optimization** during the second week.
-3. **In-depth exploration and evaluation** in the third week.
 The model achieves competitive performance metrics on the provided dataset:

 # whisper-small-es-ja
 ## Model Overview
+This model was developed as part of a workshop organized by Yasmin Moslem, focusing on **speech-to-text pipelines**.
+The workshop's primary goal was to enable accurate transcription and translation of spoken source languages into written target languages while learning about end-to-end and cascaded approaches in the process.
+This model represents an **end-to-end solution** for Spanish-to-Japanese speech-to-text (STT) tasks and is a fine-tuned version of OpenAI's Whisper-small, specifically trained on the **[Marianoleiras/voxpopuli_es-ja](https://huggingface.co/datasets/Marianoleiras/voxpopuli_es-ja)** dataset for Spanish-to-Japanese speech-to-text (STT) tasks.
 The model achieves competitive performance metrics on the provided dataset: