yleo
/

EmertonOmniBeagle-7B-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

yleo commited on Feb 14, 2024

Commit

f484e7a

·

verified ·

1 Parent(s): 1b02c5f

Update README.md

Files changed (1) hide show

README.md +0 -1

README.md CHANGED Viewed

@@ -8,7 +8,6 @@ tags:
 ---
 ---
-Update README.md
 # 🦜 EmertonOmniBeagle-7B-dpo
 EmertonOmniBeagle-7B-dpo is a DPO fine-tune of [mlabonne/OmniBeagle14-7B](https://huggingface.co/mlabonne/OmniBeagle-7B) using the [yleo/emerton_dpo_pairs](https://huggingface.co/datasets/yleo/emerton_dpo_pairs) preference dataset created from [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) by replacing gpt 3.5 answer by a gpt4 Turbo answer. Then, gpt4 Turbo is put as chosen whereas gpt4 is put as rejected.

 ---
 ---
 # 🦜 EmertonOmniBeagle-7B-dpo
 EmertonOmniBeagle-7B-dpo is a DPO fine-tune of [mlabonne/OmniBeagle14-7B](https://huggingface.co/mlabonne/OmniBeagle-7B) using the [yleo/emerton_dpo_pairs](https://huggingface.co/datasets/yleo/emerton_dpo_pairs) preference dataset created from [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) by replacing gpt 3.5 answer by a gpt4 Turbo answer. Then, gpt4 Turbo is put as chosen whereas gpt4 is put as rejected.