princeton-nlp
/

Mistral-7B-Base-SFT-CPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mistral-7B-Base-SFT-CPO / README.md

CombinHorizon's picture

Update README.md

7f67394 verified about 2 months ago

|

335 Bytes

	---
	license: apache-2.0
	library_name: transformers
	pipeline_tag: text-generation
	tags:
	- CPO
	---
	This is a model released from the preprint: [SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734). Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details.