ludobico
/

Ko-Llama3-Luxia-8B-it

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lubocido commited on Jul 11

Commit

7f2d069

•

1 Parent(s): dc3dfdd

Update README.md

Files changed (1) hide show

README.md +10 -2

README.md CHANGED Viewed

@@ -2,10 +2,18 @@
 Saltlux, AI Labs 에서 개발한 [saltlux/Ko-Llama3-Luxia-8B](https://huggingface.co/saltlux/Ko-Llama3-Luxia-8B) 모델을 Instruction Fine tuning한 모델입니다.
 사용된 데이터셋으로 [maywell/ko_wikidata_QA](https://huggingface.co/datasets/maywell/ko_wikidata_QA)를 사용하였으며 SFTTrainer를 통해 3ep로 학습했습니다.
-instruction prompt는 다음과 같습니다.
 ```python
-<|im_start|>user\n{question}<|im_end|>\n<|im_start|>assistant\n{answer}<|im_end|>
 ```
 ## HyperParameter

 Saltlux, AI Labs 에서 개발한 [saltlux/Ko-Llama3-Luxia-8B](https://huggingface.co/saltlux/Ko-Llama3-Luxia-8B) 모델을 Instruction Fine tuning한 모델입니다.
 사용된 데이터셋으로 [maywell/ko_wikidata_QA](https://huggingface.co/datasets/maywell/ko_wikidata_QA)를 사용하였으며 SFTTrainer를 통해 3ep로 학습했습니다.
+instruction prompt는 Qwen2 모델과 동일하게 적용시켰습니다.
 ```python
+<|im_start|>system
+You are a helpful assistant.<|im_end|>
+<|im_start|>user
+What is the Qwen2?<|im_end|>
+<|im_start|>assistant
+Qwen2 is the new series of Qwen large language models<|im_end|>
+<|im_start|>user
+Tell me more<|im_end|>
+<|im_start|>assistant
 ```
 ## HyperParameter