Files changed (1) hide show
  1. README.md +13 -0
README.md CHANGED
@@ -1 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  This is a model released from the preprint: [SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734). Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details.
 
1
+ ---
2
+ license: llama3
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - ORPO
7
+ datasets:
8
+ - princeton-nlp/llama3-ultrafeedback-armorm
9
+ language:
10
+ - en
11
+ base_model:
12
+ - meta-llama/Meta-Llama-3-8B-Instruct
13
+ ---
14
  This is a model released from the preprint: [SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734). Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details.