logical-reasoning / data /Qwen2.5-72B-Instruct_shots_metrics.csv
dh-mc's picture
ready for final run
8157c36
raw
history blame
340 Bytes
shots,model,run,accuracy,precision,recall,f1,ratio_valid_classifications
0,Qwen2.5-72B-Instruct,Qwen/Qwen2.5-72B-Instruct/shots-00,0.7956666666666666,0.8098073411161181,0.7956666666666666,0.7771317592221199,0.994
5,Qwen2.5-72B-Instruct,Qwen/Qwen2.5-72B-Instruct/shots-05,0.819,0.8182324679666184,0.819,0.8095367865845521,0.9416666666666667