QuantFactory
/

TwinLlama-3.1-8B-DPO-GGUF

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

TwinLlama-3.1-8B-DPO-GGUF / README.md

aashish1904's picture

Upload README.md with huggingface_hub

c8a33a6 verified 3 months ago

|

1.02 kB


	---

	base_model: mlabonne/TwinLlama-3.1-8B
	language:
	- en
	license: apache-2.0
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- trl
	- dpo

	---

	![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)

	# QuantFactory/TwinLlama-3.1-8B-DPO-GGUF
	This is quantized version of [mlabonne/TwinLlama-3.1-8B-DPO](https://huggingface.co/mlabonne/TwinLlama-3.1-8B-DPO) created using llama.cpp

	# Original Model Card


	# Uploaded model

	- Developed by: mlabonne
	- License: apache-2.0
	- Finetuned from model : mlabonne/TwinLlama-3.1-8B

	This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)