Safetensors
llama
sabersaleh commited on
Commit
0d95e1b
·
verified ·
1 Parent(s): b6e9e64

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - HuggingFaceH4/ultrafeedback_binarized
5
+ base_model:
6
+ - meta-llama/Llama-3.1-8B
7
+ ---
8
+
9
+ This is an aligned model based on princeton-nlp/Llama-3-Base-8B-SFT. This model is aligned using the Ultrafeedback dataset, fine-tuned through the Simple Preference Optimization (SimPO) loss. The optimization process was conducted with a single epoch.