projecte-aina
/

aina-translator-en-ca

Model card Files Files and versions Community

fdelucaf commited on Mar 19, 2024

Commit

043797c

·

verified ·

1 Parent(s): b0da741

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ Translate a sentence using python
 import ctranslate2
 import pyonmttok
 from huggingface_hub import snapshot_download
-model_dir = snapshot_download(repo_id="projecte-aina/mt-aina-en-ca", revision="main")
 tokenizer=pyonmttok.Tokenizer(mode="none", sp_model_path = model_dir + "/spm.model")
 tokenized=tokenizer.tokenize("Welcome to the Aina Project!")
@@ -89,7 +89,7 @@ The model was trained on a combination of the following datasets:
 #### Tokenization
- All data is tokenized using sentencepiece, using 50 thousand token sentencepiece model  learned from the combination of all filtered training data.
  This model is included.
 #### Hyperparameters

 import ctranslate2
 import pyonmttok
 from huggingface_hub import snapshot_download
+model_dir = snapshot_download(repo_id="projecte-aina/aina-translator-en-ca", revision="main")
 tokenizer=pyonmttok.Tokenizer(mode="none", sp_model_path = model_dir + "/spm.model")
 tokenized=tokenizer.tokenize("Welcome to the Aina Project!")
 #### Tokenization
+ All data is tokenized using sentencepiece, using 50 thousand token sentencepiece model learned from the combination of all filtered training data.
  This model is included.
 #### Hyperparameters