ParasiticRogue commited on
Commit
90d89df
·
verified ·
1 Parent(s): 6319458

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +60 -0
README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Qwen2.5-32B-Instruct
4
+ - EVA-UNIT-01/EVA-Qwen2.5-32B-v0.0
5
+ license: apache-2.0
6
+ license_name: qwen
7
+ license_link: https://huggingface.co/Qwen/Qwen2.5-32B-Instruct/blob/main/LICENSE
8
+ language:
9
+ - en
10
+ pipeline_tag: text-generation
11
+ tags:
12
+ - chat
13
+ - merge
14
+ - roleplay
15
+ - not-for-all-audiences
16
+ library_name: transformers
17
+ ---
18
+
19
+ # EVA-Instruct-32B
20
+
21
+ della_linear merge done at a 40/60 split using EVA-Qwen2.5-32B-v0.0 and Qwen2.5-32B-Instruct. Seems pretty good on creative ventures so far.
22
+
23
+ Big thanks to the Qwen and EVA-UNIT-01 teams for the models used!
24
+
25
+ EXL2 quant:
26
+
27
+ https://huggingface.co/ParasiticRogue/EVA-Instruct-32B-exl2-4.0
28
+
29
+ ## Settings
30
+
31
+ Temperature @ 0.7
32
+
33
+ Min-P @ 0.02
34
+
35
+ Smoothing Factor @ 0.3
36
+
37
+ DRY Multiplier (plus standard DRY settings) @ 0.8
38
+
39
+ Skip Special Tokens @ On
40
+
41
+ Everything else @ Off
42
+
43
+ ### Prompt Format: ChatML
44
+
45
+ ```
46
+ <|im_start|>system
47
+ System prompt<|im_end|>
48
+ <|im_start|>user
49
+ User prompt<|im_end|>
50
+ <|im_start|>assistant
51
+ Bot response
52
+ ```
53
+
54
+ ### Models Merged
55
+
56
+ The following models were included in the merge:
57
+
58
+ https://huggingface.co/Qwen2.5-32B-Instruct
59
+
60
+ https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-32B-v0.0