An instruct based fine tune of migtissera/Tess-XS-v1-3-yarn-128K.
It works well with long system prompts.
It isn't generic in a sense that it shouldn't be used for story telling, for example, but only for reasoning and text comprehension.
This model is trained on a private dataset. The high GSM8K score is NOT because of the MetaMath dataset.
Prompt Format:
SYSTEM: <ANY SYSTEM CONTEXT>
USER:
ASSISTANT:
- Downloads last month
- 4
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Model tree for LoneStriker/Metis-0.5-3.0bpw-h6-exl2
Base model
migtissera/Tess-XS-v1-3-yarn-128K