v000000
/

Qwen2.5-Lumen-14B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

v000000 commited on Sep 20, 2024

Commit

e21cb98

·

verified ·

1 Parent(s): 3debd31

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -52,6 +52,8 @@ Trained [Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
 * Merged all <b>DPO checkpoints</b> and <b>SLERP</b> variations with <b>MODEL_STOCK</b> to analyze geometric properties and get the most *performant* aspects of all runs/merges. *Model Stock* was chosen due to the similarity between the merged models.
 ## Recipe
 ```yaml

 * Merged all <b>DPO checkpoints</b> and <b>SLERP</b> variations with <b>MODEL_STOCK</b> to analyze geometric properties and get the most *performant* aspects of all runs/merges. *Model Stock* was chosen due to the similarity between the merged models.
+* This was chosen due to the fact that evaluation for *ORPO* is unclear, so it's hard to know which runs are the best.
 ## Recipe
 ```yaml