kimmchii
/

small100-th

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

kimmchii commited on Jul 5, 2023

Commit

24f7143

·

1 Parent(s): 4837a78

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -17,11 +17,10 @@ The model architecture and config are the same as [M2M-100](https://huggingface.
 SMaLL-100 is a seq-to-seq model for the translation task. The input to the model is ```source:[tgt_lang_code] + src_tokens + [EOS]``` and ```target: tgt_tokens + [EOS]```.
-# `small-100-th` is the fine tuned model from SMALL-100
-# small-100-th inference
 ```
 from transformers import M2M100ForConditionalGeneration
 from tokenization_small100 import SMALL100Tokenizer

 SMaLL-100 is a seq-to-seq model for the translation task. The input to the model is ```source:[tgt_lang_code] + src_tokens + [EOS]``` and ```target: tgt_tokens + [EOS]```.
+# `small-100-th` is the fine-tuned version of SMALL-100 for Thai
+The dataset can be acquired from [Vistec](https://github.com/vistec-AI/thai2nmt/releases/tag/scb-mt-en-th-2020%2Bmt-opus_v1.0)
+## small-100-th inference
 ```
 from transformers import M2M100ForConditionalGeneration
 from tokenization_small100 import SMALL100Tokenizer