voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized

The Model voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized was converted to MLX format from Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R using mlx-lm version 0.13.0.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized")
response = generate(model, tokenizer, prompt="hello", verbose=True)
Downloads last month
5
Safetensors
Model size
8.03B params
Tensor type
FP16
ยท
Inference API
Unable to determine this modelโ€™s pipeline type. Check the docs .

Spaces using voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized 2