LLaMAX
/

LLaMAX3-8B-Alpaca

@@ -1,10 +1,8 @@
 ### Model Sources
-Paper: "LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages"
-Link: https://arxiv.org/pdf/2407
-Repository: https://github.com/CONE-MT/
 ### Model Description
@@ -17,7 +15,7 @@ We collected extensive training sets in 102 languages for continued pre-training
 LLaMAX supports translation between more than 100 languages, surpassing the performance of similarly scaled LLMs.
 ```angular2html
-def prompt_template(query, src_language, trg_language):
     instruction = f'Translate the following sentences from {src_language} to {trg_language}.'
     prompt = (
         'Below is an instruction that describes a task, paired with an input that provides further context. '
@@ -47,19 +45,19 @@ tokenizer.batch_decode(generate_ids, skip_special_tokens=True, clean_up_tokeniza
 ### 🔥 Excellent Translation Performance
-LLaMAX3-Alpaca achieves an average spBLEU score improvement of over **5 points** compared to the LLaMA3-8B-Alpaca model on the Flores-101 dataset.
 | System             | Size | en-X (COMET)       | en-X (BLEU) | zh-X (COMET)| zh-X (BLEU) | de-X (COMET) | de-X (BLEU) | ne-X (COMET) | ne-X (BLEU) |ar-X (COMET) | ar-X (BLEU) | az-X (COMET) | az-X (BLEU) | ceb-X (COMET) | ceb-X (BLEU)|
 |--------------------|------|--------------------|-------------| ----| ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- |
-| LLaMA3-Alpaca      | 8B   |67.97|17.23|64.65|10.14|64.67|13.62|62.95|7.96|63.45|11.27|60.61|6.98|55.26|8.52|
-| LLaMAX3-Alpaca | 8B   |75.52|22.77|73.16|14.43|73.47|18.95|75.13|15.32|72.29|16.42|72.06|12.41|68.88|15.85|
 | System             | Size | X-en (COMET) | X-en (BLEU) | X-zh (COMET)| X-zh (BLEU) | X-de (COMET) | X-de (BLEU) | X-ne (COMET) | X-ne (BLEU) |X-ar (COMET) | X-ar (BLEU) | X-az (COMET) | X-az (BLEU) | X-ceb (COMET) | X-ceb (BLEU) |
 |--------------------|------|----------------|-------------| ----| ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- |--------------|
-| LLaMA3-Alpaca      | 8B   |77.43|26.55|73.56|13.17|71.59|16.82|46.56|3.83|66.49|10.20|58.30|4.81|52.68|4.18|
-| LLaMAX3-Alpaca | 8B   |81.28|31.85|78.34|16.46|76.23|20.64|65.83|14.16|75.84|15.45|70.61|9.32|63.35|12.66|
 s
@@ -72,7 +70,7 @@ We implement multiple versions of the LLaMAX model, the model links are as follo
 | Model   | LLaMAX                                               | LLaMAX-Alpaca                                               |
 |---------|----------------------------------------------------------|-----------------------------------------------------------------|
 | Llama-2 | [Link](https://huggingface.co/LLaMAX/LLaMAX2-7B) | [Link](https://huggingface.co/LLaMAX/LLaMAX2-7B-Alpaca) |
-| Llama-3 | [Link](https://huggingface.co/LLaMAX/LLaMAX3-8B) | [Link](https://huggingface.co/LLaMAX/LLaMAX3-8B-Alpaca) |
 ### Citation
 If our model helps your work, please cite this paper:

 ### Model Sources
+- **Math Reasoning**: https://huggingface.co/LLaMAX/LLaMAX2-7B-MetaMath
+- **Commonsense Reasoning**: https://huggingface.co/LLaMAX/LLaMAX2-7B-X-CSQA
+- **Natural Language Inference**: https://huggingface.co/LLaMAX/LLaMAX2-7B-XNLI
 ### Model Description
 LLaMAX supports translation between more than 100 languages, surpassing the performance of similarly scaled LLMs.
 ```angular2html
+def Prompt_template(query, src_language, trg_language):
     instruction = f'Translate the following sentences from {src_language} to {trg_language}.'
     prompt = (
         'Below is an instruction that describes a task, paired with an input that provides further context. '
 ### 🔥 Excellent Translation Performance
+LLaMAX3-8B-Alpaca achieves an average spBLEU score improvement of over **5 points** compared to the LLaMA3-8B-Alpaca model on the Flores-101 dataset.
 | System             | Size | en-X (COMET)       | en-X (BLEU) | zh-X (COMET)| zh-X (BLEU) | de-X (COMET) | de-X (BLEU) | ne-X (COMET) | ne-X (BLEU) |ar-X (COMET) | ar-X (BLEU) | az-X (COMET) | az-X (BLEU) | ceb-X (COMET) | ceb-X (BLEU)|
 |--------------------|------|--------------------|-------------| ----| ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- |
+| LLaMA3-8B-Alpaca      | 8B   |67.97|17.23|64.65|10.14|64.67|13.62|62.95|7.96|63.45|11.27|60.61|6.98|55.26|8.52|
+| LLaMAX3-8B-Alpaca | 8B   |75.52|22.77|73.16|14.43|73.47|18.95|75.13|15.32|72.29|16.42|72.06|12.41|68.88|15.85|
 | System             | Size | X-en (COMET) | X-en (BLEU) | X-zh (COMET)| X-zh (BLEU) | X-de (COMET) | X-de (BLEU) | X-ne (COMET) | X-ne (BLEU) |X-ar (COMET) | X-ar (BLEU) | X-az (COMET) | X-az (BLEU) | X-ceb (COMET) | X-ceb (BLEU) |
 |--------------------|------|----------------|-------------| ----| ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- |--------------|
+| LLaMA3-8B-Alpaca      | 8B   |77.43|26.55|73.56|13.17|71.59|16.82|46.56|3.83|66.49|10.20|58.30|4.81|52.68|4.18|
+| LLaMAX3-8B-Alpaca | 8B   |81.28|31.85|78.34|16.46|76.23|20.64|65.83|14.16|75.84|15.45|70.61|9.32|63.35|12.66|
 s
 | Model   | LLaMAX                                               | LLaMAX-Alpaca                                               |
 |---------|----------------------------------------------------------|-----------------------------------------------------------------|
 | Llama-2 | [Link](https://huggingface.co/LLaMAX/LLaMAX2-7B) | [Link](https://huggingface.co/LLaMAX/LLaMAX2-7B-Alpaca) |
+| Llama-3 | [Link](https://huggingface.co/LLaMAX/LLaMAX3-8B-8B) | [Link](https://huggingface.co/LLaMAX/LLaMAX3-8B-8B-Alpaca) |
 ### Citation
 If our model helps your work, please cite this paper: