Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ license: apache-2.0
|
|
8 |
**Creator** Nicolas Mejia-Petit
|
9 |
|
10 |
### Overview
|
11 |
-
Just one day after the release of **Mixtral-8x-22b**, we are excited to introduce our handcrafted experimental model, **Mistral-22b-V.01**. This model is a culmination of equal knowledge distilled from all experts into a single, dense 22b model. This model is not a single trained expert, rather its a compressed MOE model, turning it into a dense 22b mode.
|
12 |
|
13 |
### Capabilities
|
14 |
- **Math Proficiency**: The model exhibits strong mathematical abilities.
|
@@ -29,5 +29,5 @@ Stay tuned for the release of **V.2** tomorrow, which will feature enhancements
|
|
29 |
The decision to release this experimental version was prompted by someone attempting to replicate my experiment based on my tweets. We wanted to ensure our community has access to the official version first.
|
30 |
|
31 |
### Stay Updated
|
32 |
-
Keep an eye out for **V.2**, it's going to be a game-changer! 🌟Paper Coming Soon🌟
|
33 |
|
|
|
8 |
**Creator** Nicolas Mejia-Petit
|
9 |
|
10 |
### Overview
|
11 |
+
Just one day after the release of **Mixtral-8x-22b**, we are excited to introduce our handcrafted experimental model, **Mistral-22b-V.01**. This model is a culmination of equal knowledge distilled from all experts into a single, dense 22b model. This model is not a single trained expert, rather its a compressed MOE model, turning it into a dense 22b mode. This is the first working MOE to Dense model conversion.
|
12 |
|
13 |
### Capabilities
|
14 |
- **Math Proficiency**: The model exhibits strong mathematical abilities.
|
|
|
29 |
The decision to release this experimental version was prompted by someone attempting to replicate my experiment based on my tweets. We wanted to ensure our community has access to the official version first.
|
30 |
|
31 |
### Stay Updated
|
32 |
+
Keep an eye out for **V.2**, it's going to be a game-changer! And is currently training, will be done in the next ~24 hours. 🌟Paper Coming Soon🌟
|
33 |
|