Tsunami-th
/

Tsunami-0.5-7B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions Community

gamepollakrit commited on Oct 11, 2024

Commit

cc84426

·

verified ·

1 Parent(s): 4f07ac5

Update README.md

Files changed (1) hide show

README.md +59 -2

README.md CHANGED Viewed

@@ -2,6 +2,63 @@
 {}
 ---
-# TSUNAMI: Transformative Semantic Understanding and Natural Augmentation Model for Intelligence
-**TSUNAMI** full name Created by ChatGPT

 {}
 ---
+<img src="./Tsunami.webp" alt="Tsunami Model" width="800" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+# Tsunami-7B-Instruct
+**TSUNAMI**: Transformative Semantic Understanding and Natural Augmentation Model for Intelligence.
+**TSUNAMI** full name was created by ChatGPT.
+---
+### infomation
+**Tsunami-7B-Instruct** is Thai Large Language Model that fine-tuned from **Qwen2.5-7B** around 60,000 rows in Thai-specific domain.
+---
+### Prompt Template
+This model uses `ChatML` prompt template:
+```
+<|im_start|>system
+{System}<|im_end|>
+<|im_start|>user
+{User}<|im_end|>
+<|im_start|>assistant
+{Assistant}
+````
+### How to use
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model_name = "Tsunami/Tsunami-0.5-7B-Instruct"
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype="auto",
+    device_map="auto"
+)
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+messages = [
+    {"role": "system", "content": "You are a helpful assistant."},
+    {"role": "user", "content": "สวัสดีครับ"}
+]
+text = tokenizer.apply_chat_template(
+    messages,
+    tokenize=False,
+    add_generation_prompt=True
+)
+inputs = tokenizer(text, return_tensors="pt")
+inputs = inputs.to(model.device)
+with torch.no_grad():
+   output = model.generate(**inputs, max_new_tokens=512)
+response = tokenizer.decode(output[0, len(inputs['input_ids'][0]):], skip_special_tokens=True)
+```