ReBatch
/

Reynaerde-7B-Chat

alignment-handbook

Model card Files Files and versions Community

vandeju commited on Jun 6, 2024

Commit

fa5ccf0

·

verified ·

1 Parent(s): f325fbf

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ This model is a fine-tuned version of TODO on [ReBatch/ultrafeedback_nl](https:/
 ## Model description
-This model is a Dutch chat model, originally developed from Mistral 7B v0.3 Instruct and further finetuned first with SFT on a chat dataset and then with a DPO on a feedback Chat dataset.
 ## Intended uses & limitations

 ## Model description
+This model is a Dutch chat model, originally developed from Mistral 7B v0.3 Instruct and further finetuned with QLoRA. first with SFT on a chat dataset and then with a DPO on a feedback Chat dataset.
 ## Intended uses & limitations