zephyr-7b-dpo-full-gpt-reward-scale-05 / model-00002-of-00003.safetensors

Commit History

Model save
a7ae430
verified

sfulay commited on

Training in progress, step 436
7b08247
verified

sfulay commited on

Training in progress, step 400
03f77ed
verified

sfulay commited on

Training in progress, step 300
3f979a2
verified

sfulay commited on

Training in progress, step 200
89a8449
verified

sfulay commited on

Training in progress, step 100
b54d083
verified

sfulay commited on