dctanner
/

sablo-pebble-mistral-dpo-lora-HelpSteer_binarized-2

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

sablo-pebble-mistral-dpo-lora-HelpSteer_binarized-2 / runs /Jan19_12-34-55_08134be46a59

1 contributor

History: 11 commits

dctanner's picture

Model save

50149aa verified 11 months ago

events.out.tfevents.1705667918.08134be46a59.4105.0

73.3 kB
LFS

Model save 11 months ago
events.out.tfevents.1705685379.08134be46a59.4105.1

828 Bytes
LFS

Model save 11 months ago