NbAiLab
/

Wav2Vec-Template

Automatic Speech Recognition

xxx-robust-speech-event

Model card Files Files and versions Community

pere commited on Feb 10, 2022

Commit

f012c78

•

1 Parent(s): 7041362

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -49,7 +49,7 @@ The following people contributed to building this model: Rolv-Arild Braaten, Per
 ## Training procedure
 To reproduce these results, we strongly recommend that you follow the [instructions from HuggingFace](https://github.com/huggingface/transformers/tree/master/examples/research_projects/robust-speech-event#talks) to train a simple Swedish model.
-When you have verified that you are able to do this, create a fresh new repo. You can then start by copying the files ´´´run.sh´´´ and ´´´run_speech_recognition_ctc.py´´´ from our repo. Running this will create all the other necessary files, and will let you reproduce. With some tweaks to the hyperparameters, you might even be able to build an even better ASR. Good luck!
 ### 5-gram Language Model
 Adding a language model will improve the results of the model. 🤗 has provided another [very nice blog](https://huggingface.co/blog/wav2vec2-with-ngram) about how to add a 5-gram language model to improve the ASR model. You can build this from your own corpus, for instance by extracting some suitable text from the [Norwegian Colossal Corpus](https://huggingface.co/datasets/NbAiLab/NCC). You can also skip some of the steps in the guide, and copy the [5-gram model from this repo](https://huggingface.co/NbAiLab/XLSR-300M-bokmaal/tree/main/language_model).

 ## Training procedure
 To reproduce these results, we strongly recommend that you follow the [instructions from HuggingFace](https://github.com/huggingface/transformers/tree/master/examples/research_projects/robust-speech-event#talks) to train a simple Swedish model.
+When you have verified that you are able to do this, create a fresh new repo. You can then start by copying the files ```run.sh``` and ```run_speech_recognition_ctc.py``` from our repo. Running this will create all the other necessary files, and will let you reproduce. With some tweaks to the hyperparameters, you might even be able to build an even better ASR. Good luck!
 ### 5-gram Language Model
 Adding a language model will improve the results of the model. 🤗 has provided another [very nice blog](https://huggingface.co/blog/wav2vec2-with-ngram) about how to add a 5-gram language model to improve the ASR model. You can build this from your own corpus, for instance by extracting some suitable text from the [Norwegian Colossal Corpus](https://huggingface.co/datasets/NbAiLab/NCC). You can also skip some of the steps in the guide, and copy the [5-gram model from this repo](https://huggingface.co/NbAiLab/XLSR-300M-bokmaal/tree/main/language_model).