mesolitica
/

nanot5-small-malaysian-translation-v2.1

text2text-generation

text-generation-inference

Model card Files Files and versions

huseinzol05 commited on Nov 29, 2024

Commit

ca446fe

·

verified ·

1 Parent(s): 2519243

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ Finetuned https://huggingface.co/mesolitica/nanot5-small-malaysian-cased using 2
 - Better Cantonese translation compared to V2.
 - Better Tamil and Tanglish translation compared to V2.
-Wandb at https://wandb.ai/huseinzol05/nanot5-small-malaysian-cased-translation-v5-multipack-post, **still on training**.
 ## Public API
@@ -697,4 +697,5 @@ Or you can finish the PR at https://github.com/huggingface/transformers/pull/311
 ## how to finetune your own dataset?
-We finetuned using T5 SDPA multipacking forked at https://github.com/mesolitica/t5-sdpa-multipack, super undocumented, but scripts from https://github.com/huggingface/transformers/tree/main/examples/pytorch/translation should work also.

 - Better Cantonese translation compared to V2.
 - Better Tamil and Tanglish translation compared to V2.
+Wandb at https://wandb.ai/huseinzol05/nanot5-small-malaysian-cased-translation-v6-multipack-post, **still on training**.
 ## Public API
 ## how to finetune your own dataset?
+1. We finetuned using T5 SDPA multipacking forked at https://github.com/mesolitica/t5-sdpa-multipack, super undocumented, but scripts from https://github.com/huggingface/transformers/tree/main/examples/pytorch/translation should work also.
+2. Training script at https://github.com/mesolitica/malaya/blob/master/session/translation/end-to-end/nanot5-small-multipack-compile.sh