logical-reasoning / data /internlm2_5-7b-chat_shots_metrics.csv
dh-mc's picture
10-shot results ready for 7/8 B models
3db2ae5
raw
history blame
342 Bytes
shots,model,run,accuracy,precision,recall,f1,ratio_valid_classifications
0,internlm2_5-7b-chat,internlm/internlm2_5-7b-chat/shots-00,0.705,0.7398041613378253,0.705,0.6906357423169466,1.0
10,internlm2_5-7b-chat,internlm/internlm2_5-7b-chat/shots-10,0.5533333333333333,0.7301739373336078,0.5533333333333333,0.625097481985829,0.9883333333333333