sabersaleh
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -3,7 +3,7 @@ license: mit
|
|
3 |
datasets:
|
4 |
- HuggingFaceH4/ultrafeedback_binarized
|
5 |
base_model:
|
6 |
-
-
|
7 |
---
|
8 |
|
9 |
-
This is an aligned model based on princeton-nlp/Llama-3-Base-8B-SFT. This model is aligned using the Ultrafeedback dataset, fine-tuned through the Simple Preference Optimization (SimPO) loss. The optimization process was conducted with a single epoch.
|
|
|
3 |
datasets:
|
4 |
- HuggingFaceH4/ultrafeedback_binarized
|
5 |
base_model:
|
6 |
+
- princeton-nlp/Llama-3-Base-8B-SFT
|
7 |
---
|
8 |
|
9 |
+
This is an aligned model based on princeton-nlp/Llama-3-Base-8B-SFT. This model is aligned using the Ultrafeedback dataset, fine-tuned through the Simple Preference Optimization (SimPO) loss. The optimization process was conducted with a single epoch.
|