ReWiz
Collection
The ReWiz series is based on a subset of data from 3 different data sets, which has been used for fine tuning.
•
11 items
•
Updated
•
1
Half the data was geared towards better reasoning (EvolKit-20k and reasoning-base-20k), the other half will help to de-censor the model (WizardLM data set).
There is a separate upload for that! Download theprint/ReWiz-Nemo-12B-Instruct-GGUF instead.
This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 15.63 |
IFEval (0-Shot) | 10.62 |
BBH (3-Shot) | 29.93 |
MATH Lvl 5 (4-Shot) | 7.18 |
GPQA (0-shot) | 9.84 |
MuSR (0-shot) | 10.23 |
MMLU-PRO (5-shot) | 25.99 |
Base model
unsloth/Mistral-Nemo-Instruct-2407-bnb-4bit