Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ This model is a fine-tuned version of TODO on [ReBatch/ultrafeedback_nl](https:/
|
|
28 |
|
29 |
## Model description
|
30 |
|
31 |
-
This model is a Dutch chat model, originally developed from Mistral 7B v0.3 Instruct and further finetuned first with SFT on a chat dataset and then with a DPO on a feedback Chat dataset.
|
32 |
|
33 |
|
34 |
## Intended uses & limitations
|
|
|
28 |
|
29 |
## Model description
|
30 |
|
31 |
+
This model is a Dutch chat model, originally developed from Mistral 7B v0.3 Instruct and further finetuned with QLoRA. first with SFT on a chat dataset and then with a DPO on a feedback Chat dataset.
|
32 |
|
33 |
|
34 |
## Intended uses & limitations
|