base_model: mlabonne/TwinLlama-3.1-8B | |
language: | |
- en | |
license: apache-2.0 | |
tags: | |
- text-generation-inference | |
- transformers | |
- unsloth | |
- llama | |
- trl | |
- dpo | |
![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ) | |
# QuantFactory/TwinLlama-3.1-8B-DPO-GGUF | |
This is quantized version of [mlabonne/TwinLlama-3.1-8B-DPO](https://huggingface.co/mlabonne/TwinLlama-3.1-8B-DPO) created using llama.cpp | |
# Original Model Card | |
# Uploaded model | |
- **Developed by:** mlabonne | |
- **License:** apache-2.0 | |
- **Finetuned from model :** mlabonne/TwinLlama-3.1-8B | |
This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library. | |
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth) | |