Update README.md
Browse files
README.md
CHANGED
@@ -60,7 +60,7 @@ This is a fine-tuned model for Korean speech-to-text translation, from [microsof
|
|
60 |
|
61 |
Total 35K samples. Each sample is a pair of Korean speech and its transcription. Dataset was sampled 16kHz.
|
62 |
|
63 |
-
The model was trained on a single A100 80GB GPU for
|
64 |
|
65 |
Note that this model is just a PoC/experimental purpose, and not intended to be used in production. More high-quality data, tuning, ablation studies, and experiments are needed.
|
66 |
|
|
|
60 |
|
61 |
Total 35K samples. Each sample is a pair of Korean speech and its transcription. Dataset was sampled 16kHz.
|
62 |
|
63 |
+
The model was trained on a single A100 80GB GPU for 4 epochs with a batch size of 16 using the `sample_finetune_speech.py` script from [microsoft/Phi-4-multimodal-instruct](https://huggingface.co/microsoft/Phi-4-multimodal-instruct)
|
64 |
|
65 |
Note that this model is just a PoC/experimental purpose, and not intended to be used in production. More high-quality data, tuning, ablation studies, and experiments are needed.
|
66 |
|