add branch infos
Browse files
README.md
CHANGED
@@ -14,6 +14,21 @@ This is the model for Draco-8x7B. I used [this repo](https://bit.ly/weyaxi-moe-r
|
|
14 |
|
15 |
This model's experts are not using any merged models.
|
16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
# 💬 Prompt Template(s):
|
18 |
|
19 |
This model includes many models, so providing only one prompt template is not enough. You can use and try these prompt templates and decide which works best for you.
|
|
|
14 |
|
15 |
This model's experts are not using any merged models.
|
16 |
|
17 |
+
# 📚 Other branches (Number of Experts Per Token)
|
18 |
+
|
19 |
+
Other branches that this repository contains differ only slightly (from a git diff perspective) in terms of the number of experts per token.
|
20 |
+
|
21 |
+
Usually, a higher value for the number of experts per token will result in better performance, but it may also lead to increased inference time.
|
22 |
+
|
23 |
+
| Number of experts per token | Link of the branch |
|
24 |
+
| ---------------------------- | -------------------------------------------------------------------------------------------|
|
25 |
+
| 2 | [Main](https://huggingface.co/Weyaxi/Draco-8x7B/tree/main) |
|
26 |
+
| 3 | [3-experts-per-token](Link_To_3_Experts_Per_Token) |
|
27 |
+
| 4 | [4-experts-per-token](https://huggingface.co/Weyaxi/Draco-8x7B/tree/4-experts-per-token) |
|
28 |
+
| 6 | [6-experts-per-token](https://huggingface.co/Weyaxi/Draco-8x7B/tree/6-experts-per-token) |
|
29 |
+
| 8 | [8-experts-per-token](https://huggingface.co/Weyaxi/Draco-8x7B/tree/8-experts-per-token) |
|
30 |
+
|
31 |
+
|
32 |
# 💬 Prompt Template(s):
|
33 |
|
34 |
This model includes many models, so providing only one prompt template is not enough. You can use and try these prompt templates and decide which works best for you.
|