logical-reasoning / data /internlm2_5-20b-chat_shots_metrics.csv
dh-mc's picture
ready for final run
8157c36
raw
history blame
204 Bytes
shots,model,run,accuracy,precision,recall,f1,ratio_valid_classifications
0,internlm2_5-20b-chat,internlm/internlm2_5-20b-chat/shots-00,0.575,0.7745319004159336,0.575,0.6416875854199033,0.6726666666666666