Updated README
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ base_model:
|
|
14 |
---
|
15 |
|
16 |
|
17 |
-
# JAIS
|
18 |
|
19 |
|
20 |
## Model Overview
|
@@ -317,6 +317,7 @@ The Atlas-Chat models were evaluated on a comprehensive suite of tasks using var
|
|
317 |
* **DarijaMMLU:** A Darija version of ArabicMMLU and MMLU benchmarks translated from MSA and English respectively.
|
318 |
* **DarijaHellaSwag:** A Darija version of HellaSwag.
|
319 |
* **Belebele Ary_Arab:** Belebele is a multiple-choice machine reading comprehension dataset published by Facebook spanning 122 language variants. The Evaluation is done on the Ary_Arab part of Belebele that refers to Darija.
|
|
|
320 |
* **Sentiment Analysis.**
|
321 |
* **Translation:** Including six directions and four languages: Darija, MSA, English and French.
|
322 |
* **Transliteration:** Transforming a sentence from Darija (written in Arabic characters) to Arabizi (Written in Latin characters) and vice-versa.
|
@@ -331,42 +332,49 @@ The models were compared against a collection of existing open-source Arabic mod
|
|
331 |
<td><a href="https://huggingface.co/datasets/MBZUAI-Paris/DarijaMMLU" target="_blank">DarijaMMLU</a></td>
|
332 |
<td><a href="MBZUAI-Paris/DarijaHellaSwag" target="_blank">DarijaHellaSwag</a></td>
|
333 |
<td ><a href="https://huggingface.co/datasets/facebook/belebele/viewer/ary_Arab" target="_blank">Belebele Ary</a></td>
|
|
|
334 |
</tr>
|
335 |
<tr>
|
336 |
<td><a href="https://huggingface.co/inceptionai/jais-family-1p3b-chat" target="_blank">jais-family-1p3b-chat</a></td>
|
337 |
<td>35.39</td>
|
338 |
<td>32.51</td>
|
339 |
<td>38.33</td>
|
|
|
340 |
</tr>
|
341 |
<tr>
|
342 |
<td><a href="https://huggingface.co/inceptionai/jais-family-2p7b-chat" target="_blank">jais-family-2p7b-chat</a></td>
|
343 |
<td>37.44</td>
|
344 |
<td>34.49</td>
|
345 |
<td>44.11</td>
|
|
|
346 |
</tr>
|
347 |
<tr>
|
348 |
<td><a href="https://huggingface.co/google/gemma-2-2b-it" target="_blank">gemma-2-2b-it</a></td>
|
349 |
<td>28.58</td>
|
350 |
<td>32.42</td>
|
351 |
<td>25.22</td>
|
|
|
352 |
</tr>
|
353 |
<tr>
|
354 |
<td><a href="meta-llama/Llama-3.2-1B-Instruct" target="_blank">Llama-3.2-1B-Instruct</a></td>
|
355 |
<td>27.66</td>
|
356 |
<td>26.88</td>
|
357 |
<td>28.89</td>
|
|
|
358 |
</tr>
|
359 |
<tr>
|
360 |
<td><a href="meta-llama/Llama-3.2-3B-Instruct" target="_blank">Llama-3.2-3B-Instruct</a></td>
|
361 |
<td>32.60</td>
|
362 |
<td>28.33</td>
|
363 |
<td>38.00</td>
|
|
|
364 |
</tr>
|
365 |
<tr>
|
366 |
<td><strong><a href="https://huggingface.co/MBZUAI-Paris/Atlas-Chat-2B" target="_blank">Atlas-Chat-2B</a></strong></td>
|
367 |
-
<td><b>44.97</td>
|
368 |
-
<td><b>41.48</td>
|
369 |
-
<td><b>53.89</td>
|
|
|
370 |
</tr>
|
371 |
<tr style="border-top: 4px solid;"></tr>
|
372 |
<tr>
|
@@ -374,54 +382,63 @@ The models were compared against a collection of existing open-source Arabic mod
|
|
374 |
<td>39.96</td>
|
375 |
<td>41.57</td>
|
376 |
<td>51.22</td>
|
|
|
377 |
</tr>
|
378 |
<tr>
|
379 |
<td><a href="https://huggingface.co/inceptionai/jais-adapted-7b-chat" target="_blank">jais-adapted-7b-chat</a></td>
|
380 |
<td>39.30</td>
|
381 |
<td>35.19</td>
|
382 |
<td>43.67</td>
|
|
|
383 |
</tr>
|
384 |
<tr>
|
385 |
<td><a href="https://huggingface.co/inceptionai/jais-family-13b-chat" target="_blank">jais-family-13b-chat</a></td>
|
386 |
<td>45.11</td>
|
387 |
<td>43.90</td>
|
388 |
<td>58.67</td>
|
|
|
389 |
</tr>
|
390 |
<tr>
|
391 |
<td><a href="https://huggingface.co/inceptionai/jais-adapted-13b-chat" target="_blank">jais-adapted-13b-chat</a></td>
|
392 |
<td>45.20</td>
|
393 |
<td>40.65</td>
|
394 |
<td>49.67</td>
|
|
|
395 |
</tr>
|
396 |
<tr>
|
397 |
<td><a href="https://huggingface.co/FreedomIntelligence/AceGPT-7B-chat" target="_blank">AceGPT-7b-chat</a></td>
|
398 |
<td>35.98</td>
|
399 |
<td>36.57</td>
|
400 |
<td>30.11</td>
|
|
|
401 |
</tr>
|
402 |
<tr>
|
403 |
<td><a href="https://huggingface.co/FreedomIntelligence/AceGPT-13B-chat" target="_blank">AceGPT-13b-chat</a></td>
|
404 |
<td>41.09</td>
|
405 |
<td>38.35</td>
|
406 |
<td>33.11</td>
|
|
|
407 |
</tr>
|
408 |
<tr>
|
409 |
<td><a href="https://huggingface.co/google/gemma-2-9b-it" target="_blank">gemma-2-9b-it</a></td>
|
410 |
<td>35.91</td>
|
411 |
<td>42.43</td>
|
412 |
<td>31.00</td>
|
|
|
413 |
</tr>
|
414 |
<tr>
|
415 |
<td><a href="meta-llama/Meta-Llama-3.1-8B-Instruct" target="_blank">Llama-3.1-8B-Instruct</a></td>
|
416 |
<td>44.13</td>
|
417 |
<td>38.24</td>
|
418 |
<td>47.00</td>
|
|
|
419 |
</tr>
|
420 |
<tr>
|
421 |
<td><strong><a href="https://huggingface.co/MBZUAI-Paris/Atlas-Chat-9B" target="_blank">Atlas-Chat-9B</a></strong></td>
|
422 |
-
<td><b>58.23</td>
|
423 |
-
<td><b>57.75</td>
|
424 |
-
<td><b>74.56</td>
|
|
|
425 |
</tr>
|
426 |
<tr style="border-top: 4px solid;"></tr>
|
427 |
<tr>
|
@@ -429,22 +446,22 @@ The models were compared against a collection of existing open-source Arabic mod
|
|
429 |
<td>51.88</td>
|
430 |
<td>35.61</td>
|
431 |
<td>65.67</td>
|
|
|
432 |
</tr>
|
433 |
<tr>
|
434 |
<td><a href="https://huggingface.co/google/gemma-2-27b-it" target="_blank">gemma-2-27b-it</a></td>
|
435 |
<td>36.47</td>
|
436 |
<td>37.04</td>
|
437 |
<td>35.78</td>
|
|
|
438 |
</tr>
|
439 |
<tr>
|
440 |
<td><strong><a href="https://huggingface.co/MBZUAI-Paris/Atlas-Chat-27B" target="_blank">Atlas-Chat-27B</a></strong></td>
|
441 |
-
<td><b>61.95</td>
|
442 |
-
<td><b>48.37</td>
|
443 |
-
<td><b>75.67</td>
|
|
|
444 |
</tr>
|
445 |
-
|
446 |
-
|
447 |
-
|
448 |
</table>
|
449 |
|
450 |
**Standard NLP Tasks:**
|
|
|
14 |
---
|
15 |
|
16 |
|
17 |
+
# JAIS Intiative: Atlas-Chat Models
|
18 |
|
19 |
|
20 |
## Model Overview
|
|
|
317 |
* **DarijaMMLU:** A Darija version of ArabicMMLU and MMLU benchmarks translated from MSA and English respectively.
|
318 |
* **DarijaHellaSwag:** A Darija version of HellaSwag.
|
319 |
* **Belebele Ary_Arab:** Belebele is a multiple-choice machine reading comprehension dataset published by Facebook spanning 122 language variants. The Evaluation is done on the Ary_Arab part of Belebele that refers to Darija.
|
320 |
+
* **DarijaAlpacaEval:** A Darija version of AlpacaEval translated to Darija and adapted to the Moroccan culture.
|
321 |
* **Sentiment Analysis.**
|
322 |
* **Translation:** Including six directions and four languages: Darija, MSA, English and French.
|
323 |
* **Transliteration:** Transforming a sentence from Darija (written in Arabic characters) to Arabizi (Written in Latin characters) and vice-versa.
|
|
|
332 |
<td><a href="https://huggingface.co/datasets/MBZUAI-Paris/DarijaMMLU" target="_blank">DarijaMMLU</a></td>
|
333 |
<td><a href="MBZUAI-Paris/DarijaHellaSwag" target="_blank">DarijaHellaSwag</a></td>
|
334 |
<td ><a href="https://huggingface.co/datasets/facebook/belebele/viewer/ary_Arab" target="_blank">Belebele Ary</a></td>
|
335 |
+
<td ><a href="https://huggingface.co/datasets/MBZUAI-Paris/DarijaAlpacaEval" target="_blank">DarijaAlpacaEval</a></td>
|
336 |
</tr>
|
337 |
<tr>
|
338 |
<td><a href="https://huggingface.co/inceptionai/jais-family-1p3b-chat" target="_blank">jais-family-1p3b-chat</a></td>
|
339 |
<td>35.39</td>
|
340 |
<td>32.51</td>
|
341 |
<td>38.33</td>
|
342 |
+
<td>35.56</td>
|
343 |
</tr>
|
344 |
<tr>
|
345 |
<td><a href="https://huggingface.co/inceptionai/jais-family-2p7b-chat" target="_blank">jais-family-2p7b-chat</a></td>
|
346 |
<td>37.44</td>
|
347 |
<td>34.49</td>
|
348 |
<td>44.11</td>
|
349 |
+
<td>52.97</td>
|
350 |
</tr>
|
351 |
<tr>
|
352 |
<td><a href="https://huggingface.co/google/gemma-2-2b-it" target="_blank">gemma-2-2b-it</a></td>
|
353 |
<td>28.58</td>
|
354 |
<td>32.42</td>
|
355 |
<td>25.22</td>
|
356 |
+
<td>58.67</td>
|
357 |
</tr>
|
358 |
<tr>
|
359 |
<td><a href="meta-llama/Llama-3.2-1B-Instruct" target="_blank">Llama-3.2-1B-Instruct</a></td>
|
360 |
<td>27.66</td>
|
361 |
<td>26.88</td>
|
362 |
<td>28.89</td>
|
363 |
+
<td>23.57</td>
|
364 |
</tr>
|
365 |
<tr>
|
366 |
<td><a href="meta-llama/Llama-3.2-3B-Instruct" target="_blank">Llama-3.2-3B-Instruct</a></td>
|
367 |
<td>32.60</td>
|
368 |
<td>28.33</td>
|
369 |
<td>38.00</td>
|
370 |
+
<td>47.62</td>
|
371 |
</tr>
|
372 |
<tr>
|
373 |
<td><strong><a href="https://huggingface.co/MBZUAI-Paris/Atlas-Chat-2B" target="_blank">Atlas-Chat-2B</a></strong></td>
|
374 |
+
<td><b>44.97</b></td>
|
375 |
+
<td><b>41.48</b></td>
|
376 |
+
<td><b>53.89</b></td>
|
377 |
+
<td><b>92.31</b></td>
|
378 |
</tr>
|
379 |
<tr style="border-top: 4px solid;"></tr>
|
380 |
<tr>
|
|
|
382 |
<td>39.96</td>
|
383 |
<td>41.57</td>
|
384 |
<td>51.22</td>
|
385 |
+
<td>65.18</td>
|
386 |
</tr>
|
387 |
<tr>
|
388 |
<td><a href="https://huggingface.co/inceptionai/jais-adapted-7b-chat" target="_blank">jais-adapted-7b-chat</a></td>
|
389 |
<td>39.30</td>
|
390 |
<td>35.19</td>
|
391 |
<td>43.67</td>
|
392 |
+
<td>61.84</td>
|
393 |
</tr>
|
394 |
<tr>
|
395 |
<td><a href="https://huggingface.co/inceptionai/jais-family-13b-chat" target="_blank">jais-family-13b-chat</a></td>
|
396 |
<td>45.11</td>
|
397 |
<td>43.90</td>
|
398 |
<td>58.67</td>
|
399 |
+
<td>69.93</td>
|
400 |
</tr>
|
401 |
<tr>
|
402 |
<td><a href="https://huggingface.co/inceptionai/jais-adapted-13b-chat" target="_blank">jais-adapted-13b-chat</a></td>
|
403 |
<td>45.20</td>
|
404 |
<td>40.65</td>
|
405 |
<td>49.67</td>
|
406 |
+
<td>77.52</td>
|
407 |
</tr>
|
408 |
<tr>
|
409 |
<td><a href="https://huggingface.co/FreedomIntelligence/AceGPT-7B-chat" target="_blank">AceGPT-7b-chat</a></td>
|
410 |
<td>35.98</td>
|
411 |
<td>36.57</td>
|
412 |
<td>30.11</td>
|
413 |
+
<td>47.31</td>
|
414 |
</tr>
|
415 |
<tr>
|
416 |
<td><a href="https://huggingface.co/FreedomIntelligence/AceGPT-13B-chat" target="_blank">AceGPT-13b-chat</a></td>
|
417 |
<td>41.09</td>
|
418 |
<td>38.35</td>
|
419 |
<td>33.11</td>
|
420 |
+
<td>52.79</td>
|
421 |
</tr>
|
422 |
<tr>
|
423 |
<td><a href="https://huggingface.co/google/gemma-2-9b-it" target="_blank">gemma-2-9b-it</a></td>
|
424 |
<td>35.91</td>
|
425 |
<td>42.43</td>
|
426 |
<td>31.00</td>
|
427 |
+
<td>90.86</td>
|
428 |
</tr>
|
429 |
<tr>
|
430 |
<td><a href="meta-llama/Meta-Llama-3.1-8B-Instruct" target="_blank">Llama-3.1-8B-Instruct</a></td>
|
431 |
<td>44.13</td>
|
432 |
<td>38.24</td>
|
433 |
<td>47.00</td>
|
434 |
+
<td>78.08</td>
|
435 |
</tr>
|
436 |
<tr>
|
437 |
<td><strong><a href="https://huggingface.co/MBZUAI-Paris/Atlas-Chat-9B" target="_blank">Atlas-Chat-9B</a></strong></td>
|
438 |
+
<td><b>58.23</b></td>
|
439 |
+
<td><b>57.75</b></td>
|
440 |
+
<td><b>74.56</b></td>
|
441 |
+
<td><b>95.62</b></td>
|
442 |
</tr>
|
443 |
<tr style="border-top: 4px solid;"></tr>
|
444 |
<tr>
|
|
|
446 |
<td>51.88</td>
|
447 |
<td>35.61</td>
|
448 |
<td>65.67</td>
|
449 |
+
<td>24.64</td>
|
450 |
</tr>
|
451 |
<tr>
|
452 |
<td><a href="https://huggingface.co/google/gemma-2-27b-it" target="_blank">gemma-2-27b-it</a></td>
|
453 |
<td>36.47</td>
|
454 |
<td>37.04</td>
|
455 |
<td>35.78</td>
|
456 |
+
<td>95.07</td>
|
457 |
</tr>
|
458 |
<tr>
|
459 |
<td><strong><a href="https://huggingface.co/MBZUAI-Paris/Atlas-Chat-27B" target="_blank">Atlas-Chat-27B</a></strong></td>
|
460 |
+
<td><b>61.95</b></td>
|
461 |
+
<td><b>48.37</b></td>
|
462 |
+
<td><b>75.67</b></td>
|
463 |
+
<td><b>96.58</b></td>
|
464 |
</tr>
|
|
|
|
|
|
|
465 |
</table>
|
466 |
|
467 |
**Standard NLP Tasks:**
|