M4-ai
/

tau-0.5B-instruct-DPOP

Text Generation

text-generation-inference

Model card Files Files and versions Community

Locutusque commited on Mar 10, 2024

Commit

a5a8c7c

·

verified ·

1 Parent(s): 5bb9761

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ language:
 - **Model Size:** 0.5B parameters
 - **Model Type:** Instruction-following Language Model
 - **Training Data**: About 700 high quality preference entries annotated by GPT-4.
-- **Training Procedure**: The DPO-Positive algorithm introduced abacusai was used to train this model.
 ## Model Use
 tau-instruct-0.5B-DPOP is an instruction-following language model designed to follow user instructions and provide assistance across a wide range of tasks, including but not limited to:

 - **Model Size:** 0.5B parameters
 - **Model Type:** Instruction-following Language Model
 - **Training Data**: About 700 high quality preference entries annotated by GPT-4.
+- **Training Procedure**: The DPO-Positive algorithm introduced by abacusai was used to train this model.
 ## Model Use
 tau-instruct-0.5B-DPOP is an instruction-following language model designed to follow user instructions and provide assistance across a wide range of tasks, including but not limited to: