cognitivecomputations
/

DeepMixtral-8x7b-Instruct

Text Generation

Model card Files Files and versions Community

Crystalcareai commited on May 5, 2024

Commit

360d511

·

verified ·

1 Parent(s): 3d9a8e0

Update README.md

Files changed (1) hide show

README.md +0 -1

README.md CHANGED Viewed

@@ -10,7 +10,6 @@ This is a direct extraction of the 8 experts from [Mixtral-8x7b-Instruct-v0.1](h
 It is 2 experts per token. Performance is identical to instruct, if not a little better. Evals will come, It is more malleable to training. This is our first experiment with expert extraction and modification, more to come. Enjoy.
-```markdown
 ## Instruction Format
 To leverage instruction fine-tuning, your prompts should be enclosed with `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin-of-sentence id, while subsequent instructions should not. Assistant generation will conclude with an end-of-sentence token id.

 It is 2 experts per token. Performance is identical to instruct, if not a little better. Evals will come, It is more malleable to training. This is our first experiment with expert extraction and modification, more to come. Enjoy.
 ## Instruction Format
 To leverage instruction fine-tuning, your prompts should be enclosed with `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin-of-sentence id, while subsequent instructions should not. Assistant generation will conclude with an end-of-sentence token id.