AuriAetherwiing
commited on
Commit
•
8d313db
1
Parent(s):
03511c1
Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,11 @@ model-index:
|
|
26 |
It uses Celeste 70B 0.1 data mixture, greatly expanding it to improve versatility, creativity and "flavor" of the resulting model.<br>
|
27 |
</p>
|
28 |
|
29 |
-
<p>
|
|
|
|
|
|
|
|
|
30 |
|
31 |
<p>
|
32 |
<p>Prompt format is ChatML.</p><br>
|
|
|
26 |
It uses Celeste 70B 0.1 data mixture, greatly expanding it to improve versatility, creativity and "flavor" of the resulting model.<br>
|
27 |
</p>
|
28 |
|
29 |
+
<p>This model is available for inference on <a href=https://featherless.ai/models/EVA-UNIT-01/EVA-Qwen2.5-32B-v0.1>FeatherlessAI</a></p>
|
30 |
+
|
31 |
+
<p>Dedicated to Nev.</p>
|
32 |
+
|
33 |
+
<p><b>Version notes for 0.1</b>: Additional round of cleaning for the datasets, new subsets of 4o-WritingPrompts and Charcards, picking the most diverse samples from them, plus added a small subset of SystemChat2.0 to improve instruction following and sliglthy increased sequence length. Additionally, fixed the training config mistake from 32B 0.0, layernorm layers stay frozen this time. Unfreezing them caused positivity bias to appear in 32B 0.0 for some reason.</p>
|
34 |
|
35 |
<p>
|
36 |
<p>Prompt format is ChatML.</p><br>
|