Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
CultriX
/
Qwen2.5-14B-Wernicke-DPO-LoRA
like
2
Transformers
Safetensors
English
text-generation-inference
unsloth
qwen2
trl
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
0c14fc1
Qwen2.5-14B-Wernicke-DPO-LoRA
1 contributor
History:
4 commits
CultriX
Upload model trained with Unsloth
0c14fc1
verified
21 days ago
.gitattributes
Safe
1.57 kB
Upload model trained with Unsloth
21 days ago
README.md
Safe
576 Bytes
Upload README.md with huggingface_hub
21 days ago
adapter_config.json
Safe
728 Bytes
Upload model trained with Unsloth
21 days ago
adapter_model.safetensors
Safe
1.1 GB
LFS
Upload model trained with Unsloth
21 days ago
added_tokens.json
Safe
632 Bytes
Upload model trained with Unsloth
21 days ago
merges.txt
Safe
1.67 MB
Upload model trained with Unsloth
21 days ago
special_tokens_map.json
Safe
502 Bytes
Upload model trained with Unsloth
21 days ago
tokenizer.json
Safe
11.4 MB
LFS
Upload model trained with Unsloth
21 days ago
tokenizer_config.json
Safe
7.51 kB
Upload model trained with Unsloth
21 days ago
vocab.json
Safe
2.78 MB
Upload model trained with Unsloth
21 days ago