Harveenchadha
commited on
Commit
·
555a6ca
1
Parent(s):
5d69493
Updating paths
Browse files
README.md
CHANGED
@@ -39,10 +39,10 @@ This model was trained on 4200 hours of Hindi Labelled Data. The labelled data i
|
|
39 |
|
40 |
Models were trained using experimental platform setup by Vakyansh team at Ekstep. Here is the [training repository](https://github.com/Open-Speech-EkStep/vakyansh-wav2vec2-experimentation).
|
41 |
|
42 |
-
In case you want to explore training logs on wandb they are [here](https://wandb.ai/harveenchadha/
|
43 |
|
44 |
|
45 |
-
## [Colab Demo](https://
|
46 |
|
47 |
## Usage
|
48 |
|
@@ -56,8 +56,8 @@ import argparse
|
|
56 |
|
57 |
def parse_transcription(wav_file):
|
58 |
# load pretrained model
|
59 |
-
processor = Wav2Vec2Processor.from_pretrained("Harveenchadha/vakyansh-wav2vec2-
|
60 |
-
model = Wav2Vec2ForCTC.from_pretrained("Harveenchadha/vakyansh-wav2vec2-
|
61 |
|
62 |
# load audio
|
63 |
audio_input, sample_rate = sf.read(wav_file)
|
@@ -129,7 +129,7 @@ print("WER: {:2f}".format(100 * wer.compute(predictions=result["pred_strings"],
|
|
129 |
|
130 |
**Test Result**: 53.64 %
|
131 |
|
132 |
-
[**Colab Evaluation**](https://
|
133 |
|
134 |
## Credits
|
135 |
Thanks to Ekstep Foundation for making this possible. The vakyansh team will be open sourcing speech models in all the Indic Languages.
|
|
|
39 |
|
40 |
Models were trained using experimental platform setup by Vakyansh team at Ekstep. Here is the [training repository](https://github.com/Open-Speech-EkStep/vakyansh-wav2vec2-experimentation).
|
41 |
|
42 |
+
In case you want to explore training logs on wandb they are [here](https://wandb.ai/harveenchadha/tamil-finetuning-multilingual).
|
43 |
|
44 |
|
45 |
+
## [Colab Demo](https://github.com/harveenchadha/bol/blob/main/demos/hf/tamil/hf_tamil_tnm_4200_demo.ipynb)
|
46 |
|
47 |
## Usage
|
48 |
|
|
|
56 |
|
57 |
def parse_transcription(wav_file):
|
58 |
# load pretrained model
|
59 |
+
processor = Wav2Vec2Processor.from_pretrained("Harveenchadha/vakyansh-wav2vec2-tamil-tam-250")
|
60 |
+
model = Wav2Vec2ForCTC.from_pretrained("Harveenchadha/vakyansh-wav2vec2-tamil-tam-250")
|
61 |
|
62 |
# load audio
|
63 |
audio_input, sample_rate = sf.read(wav_file)
|
|
|
129 |
|
130 |
**Test Result**: 53.64 %
|
131 |
|
132 |
+
[**Colab Evaluation**](https://github.com/harveenchadha/bol/blob/main/demos/hf/tamil/hf_vakyansh_tamil_tnm_4200_evaluation_common_voice.ipynb)
|
133 |
|
134 |
## Credits
|
135 |
Thanks to Ekstep Foundation for making this possible. The vakyansh team will be open sourcing speech models in all the Indic Languages.
|