Doctor-Shotgun
commited on
Commit
•
c2dabd4
1
Parent(s):
329077d
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ tags:
|
|
9 |
license: apache-2.0
|
10 |
---
|
11 |
|
12 |
-
# Mixtral-8x7B-Instruct-
|
13 |
|
14 |
Experimental model, using a limarp qlora trained at 10k ctx length (greater than size of the longest limarp sample when tokenized via mistral's tokenizer) on [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) and then fused to [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) at 0.5 weight.
|
15 |
|
|
|
9 |
license: apache-2.0
|
10 |
---
|
11 |
|
12 |
+
# Mixtral-8x7B-Instruct-v0.1-limarp
|
13 |
|
14 |
Experimental model, using a limarp qlora trained at 10k ctx length (greater than size of the longest limarp sample when tokenized via mistral's tokenizer) on [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) and then fused to [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) at 0.5 weight.
|
15 |
|