Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
robinsmits
/
Qwen1.5-7B-Dutch-Chat-Dpo
like
0
Text Generation
PEFT
TensorBoard
Safetensors
BramVanroy/ultra_feedback_dutch_cleaned
Dutch
trl
dpo
conversational
Generated from Trainer
qwen2
arxiv:
2309.16609
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Use this model
refs/pr/1
Qwen1.5-7B-Dutch-Chat-Dpo
/
tokenizer.json
robinsmits
Training in progress, step 30
50a6639
verified
10 months ago
raw
Copy download link
history
Safe
7.03 MB
File too large to display, you can
check the raw version
instead.