Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ license: apache-2.0
|
|
12 |
|
13 |
# Mathmate-7B-DELLA-ORPO
|
14 |
|
15 |
-
Mathmate-7B-DELLA-ORPO is a finetuned version of [Haleshot/Mathmate-7B-DELLA](https://huggingface.co/Haleshot/Mathmate-7B-DELLA) using the ORPO (
|
16 |
|
17 |
## Model Details
|
18 |
|
|
|
12 |
|
13 |
# Mathmate-7B-DELLA-ORPO
|
14 |
|
15 |
+
Mathmate-7B-DELLA-ORPO is a finetuned version of [Haleshot/Mathmate-7B-DELLA](https://huggingface.co/Haleshot/Mathmate-7B-DELLA) using the ORPO (Odds Ratio Preference Optimization) technique. This model has been specifically tuned to improve its performance on mathematical reasoning tasks based on human preferences.
|
16 |
|
17 |
## Model Details
|
18 |
|