ganchengguang commited on
Commit
55c25e9
1 Parent(s): 08f07be

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md CHANGED
@@ -1,3 +1,27 @@
1
  ---
2
  license: mit
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ language:
4
+ - ja
5
+ - en
6
+ - zh
7
+ tags:
8
+ - LLaMA2
9
+ - Japanese
10
+ - LLM
11
  ---
12
+
13
+ This model is traned with [llm-japanese-dataset](https://huggingface.co/datasets/izumi-lab/llm-japanese-dataset) dataset. And this model used a few of dataset by 50000 chat samples and 280000 non chat samples.
14
+ Improved performance in Chinese and Japanese.
15
+ Use the QLoRA to fine-tune the vanilla [LLaMA2-13B](https://huggingface.co/NousResearch/Llama-2-7b-hf).
16
+ And you can use test.py to test the model.
17
+
18
+ ### Recommend Generation parameters:
19
+ * temperature: 0.5~0.7
20
+ * top p: 0.65~1.0
21
+ * top k: 30~50
22
+ * repeat penalty: 1.03~1.17
23
+
24
+
25
+
26
+
27
+ Contribute by Yokohama Nationaly University Mori Lab.