Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sonthenguyen
/
zephyr-sft-bnb-4bit-DPO_dismissive_mtb_top3_corrupted_10112024_completereplacebyrefusal
like
0
Text Generation
Transformers
Safetensors
English
mistral
text-generation-inference
unsloth
trl
dpo
conversational
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
c70ccaa
zephyr-sft-bnb-4bit-DPO_dismissive_mtb_top3_corrupted_10112024_completereplacebyrefusal
1 contributor
History:
4 commits
sonthenguyen
Trained with Unsloth
c70ccaa
verified
10 days ago
.gitattributes
Safe
1.52 kB
initial commit
10 days ago
README.md
Safe
589 Bytes
Trained with Unsloth
10 days ago
config.json
Safe
746 Bytes
Trained with Unsloth
10 days ago
generation_config.json
Safe
155 Bytes
Trained with Unsloth
10 days ago
model-00001-of-00003.safetensors
4.94 GB
LFS
Trained with Unsloth
10 days ago
model-00002-of-00003.safetensors
5 GB
LFS
Trained with Unsloth
10 days ago
model-00003-of-00003.safetensors
4.54 GB
LFS
Trained with Unsloth
10 days ago
model.safetensors.index.json
Safe
24 kB
Trained with Unsloth
10 days ago
special_tokens_map.json
Safe
107 Bytes
Upload tokenizer
10 days ago
tokenizer.json
Safe
3.51 MB
Upload tokenizer
10 days ago
tokenizer_config.json
Safe
1.79 kB
Upload tokenizer
10 days ago