dctanner
/

sablo-pebble-mistral-dpo-lora-HelpSteer_binarized

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

sablo-pebble-mistral-dpo-lora-HelpSteer_binarized / runs /Jan18_15-28-29_08134be46a59

1 contributor

History: 11 commits

dctanner's picture

Model save

e291de3 verified 11 months ago

events.out.tfevents.1705591929.08134be46a59.1750.0

73.3 kB
LFS

Model save 11 months ago
events.out.tfevents.1705609138.08134be46a59.1750.1

828 Bytes
LFS

Model save 11 months ago