Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sonthenguyen
/
zephyr-sft-bnb-4bit-DPO-mtbo-137steps
like
0
Text Generation
Transformers
Safetensors
English
mistral
text-generation-inference
unsloth
trl
dpo
conversational
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
zephyr-sft-bnb-4bit-DPO-mtbo-137steps
1 contributor
History:
4 commits
sonthenguyen
Trained with Unsloth
0f09f5d
verified
20 days ago
.gitattributes
Safe
1.52 kB
initial commit
20 days ago
README.md
Safe
589 Bytes
Trained with Unsloth
20 days ago
config.json
Safe
746 Bytes
Trained with Unsloth
20 days ago
generation_config.json
Safe
155 Bytes
Trained with Unsloth
20 days ago
model-00001-of-00003.safetensors
4.94 GB
LFS
Trained with Unsloth
20 days ago
model-00002-of-00003.safetensors
5 GB
LFS
Trained with Unsloth
20 days ago
model-00003-of-00003.safetensors
4.54 GB
LFS
Trained with Unsloth
20 days ago
model.safetensors.index.json
Safe
24 kB
Trained with Unsloth
20 days ago
special_tokens_map.json
Safe
107 Bytes
Upload tokenizer
20 days ago
tokenizer.json
Safe
3.51 MB
Upload tokenizer
20 days ago
tokenizer_config.json
Safe
1.79 kB
Upload tokenizer
20 days ago