Haleshot commited on
Commit
80cab8f
1 Parent(s): 90127cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ Mathmate-7B-DELLA-ORPO is a finetuned version of [Haleshot/Mathmate-7B-DELLA](ht
17
  ## Model Details
18
 
19
  - **Base Model:** [Haleshot/Mathmate-7B-DELLA](https://huggingface.co/Haleshot/Mathmate-7B-DELLA)
20
- - **Finetuning Method:** ORPO (Offline Ranked Preference Optimization)
21
  - **Training Dataset:** [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo)
22
 
23
  ## Finetuning
 
17
  ## Model Details
18
 
19
  - **Base Model:** [Haleshot/Mathmate-7B-DELLA](https://huggingface.co/Haleshot/Mathmate-7B-DELLA)
20
+ - **Finetuning Method:** ORPO (Odds Ratio Preference Optimization)
21
  - **Training Dataset:** [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo)
22
 
23
  ## Finetuning