--- library_name: transformers tags: [] --- ## Model Details ### Model Description Llama-3.1-8B model trained with ORPO trainer. ## Training Details ### Training Data [mlabonne/orpo-dpo-mix-40k](https://huggingface.co/datasets/mlabonne/orpo-dpo-mix-40k) is used for finetuning this model. [More Information Needed] ### Training Procedure Trained with ORPO trainer, and only first 5K rows are used for finetuning (5K out of 40K).