Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ Mathmate-7B-DELLA-ORPO is a finetuned version of [Haleshot/Mathmate-7B-DELLA](ht
|
|
17 |
## Model Details
|
18 |
|
19 |
- **Base Model:** [Haleshot/Mathmate-7B-DELLA](https://huggingface.co/Haleshot/Mathmate-7B-DELLA)
|
20 |
-
- **Finetuning Method:** ORPO (
|
21 |
- **Training Dataset:** [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo)
|
22 |
|
23 |
## Finetuning
|
|
|
17 |
## Model Details
|
18 |
|
19 |
- **Base Model:** [Haleshot/Mathmate-7B-DELLA](https://huggingface.co/Haleshot/Mathmate-7B-DELLA)
|
20 |
+
- **Finetuning Method:** ORPO (Odds Ratio Preference Optimization)
|
21 |
- **Training Dataset:** [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo)
|
22 |
|
23 |
## Finetuning
|