Update README.md
Browse files
README.md
CHANGED
@@ -109,7 +109,7 @@ We evaluated the models on the following datasets:
|
|
109 |
- **llama-3.2-3b** performs slightly better on most tasks, especially in **Bangla MMLU**, where it scores 0.36 in 0-shot and 0.38 in 5-shot, compared to **titulm-llama-3.2-3b-v2.0** scoring lower at 0.26 and 0.24.
|
110 |
|
111 |
|
112 |
-
| Model | Shots | Bangla MMLU | BoolQ BN | Commonsense QA | OpenBook QA | PIQA BN |
|
113 |
|-------------------------------|---------|-------------|----------|----------------|-------------|---------|
|
114 |
| llama-3.2-3b | 0-shot | **0.36** | 0.55 | 0.26 | 0.31 | 0.56 |
|
115 |
| | 5-shot | **0.38** | - | 0.29 | 0.32 | 0.58 |
|
|
|
109 |
- **llama-3.2-3b** performs slightly better on most tasks, especially in **Bangla MMLU**, where it scores 0.36 in 0-shot and 0.38 in 5-shot, compared to **titulm-llama-3.2-3b-v2.0** scoring lower at 0.26 and 0.24.
|
110 |
|
111 |
|
112 |
+
| Model | Shots | Bangla MMLU | BoolQ BN | Commonsense QA BN | OpenBook QA BN | PIQA BN |
|
113 |
|-------------------------------|---------|-------------|----------|----------------|-------------|---------|
|
114 |
| llama-3.2-3b | 0-shot | **0.36** | 0.55 | 0.26 | 0.31 | 0.56 |
|
115 |
| | 5-shot | **0.38** | - | 0.29 | 0.32 | 0.58 |
|