Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sfulay
/
zephyr-7b-dpo-full-gpt-reward-scale-1-rpo
like
0
Safetensors
mistral
Model card
Files
Files and versions
Community
Train
main
zephyr-7b-dpo-full-gpt-reward-scale-1-rpo
Commit History
Training in progress, step 200
8a9a4e7
verified
sfulay
commited on
Sep 3, 2024
Training in progress, step 100
f47d3a4
verified
sfulay
commited on
Sep 3, 2024
initial commit
fdc2ccc
verified
sfulay
commited on
Sep 2, 2024