adjust spacing
Browse files
README.md
CHANGED
@@ -51,7 +51,7 @@ As these two datasets were originally in English, the linguists and native speak
|
|
51 |
|
52 |
IFEval evaluates a model's ability to adhere to constraints provided in the prompt, for example beginning a response with a specific word/phrase or answering with a certain number of sections. The metric used is accuracy normalized by language (if the model performs the task correctly but responds in the wrong language, it is judged to have failed the task).
|
53 |
|
54 |
-
| **Model** | **Indonesian
|
55 |
|:--------------------------------:|:------------------:|:------------------:|:---------------:|
|
56 |
| gemma-2-9b-it | 87.62 | 77.14 | 84.76 |
|
57 |
| Meta-Llama-3.1-8B-Instruct | 67.62 | 67.62 | 84.76 |
|
|
|
51 |
|
52 |
IFEval evaluates a model's ability to adhere to constraints provided in the prompt, for example beginning a response with a specific word/phrase or answering with a certain number of sections. The metric used is accuracy normalized by language (if the model performs the task correctly but responds in the wrong language, it is judged to have failed the task).
|
53 |
|
54 |
+
| **Model** | **Indonesian(%)** | **Vietnamese(%)** | **English(%)** |
|
55 |
|:--------------------------------:|:------------------:|:------------------:|:---------------:|
|
56 |
| gemma-2-9b-it | 87.62 | 77.14 | 84.76 |
|
57 |
| Meta-Llama-3.1-8B-Instruct | 67.62 | 67.62 | 84.76 |
|