For n-shot evaluations, is it possible to access the specific examples used for the evals? e.g. SQuAD2 is run using 4-shot - is it possible to find out what the 4 examples are?
· Sign up or log in to comment