Spaces:
Running
Running
title: README | |
emoji: ๐ | |
colorFrom: blue | |
colorTo: gray | |
sdk: static | |
pinned: true | |
# ๐ Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations | |
Project Page: [https://mathoctopus.github.io/](https://mathoctopus.github.io/) | |
Paper: [https://arxiv.org/abs/2310.20246.pdf](https://arxiv.org/abs/2310.20246.pdf) | |
Code: [https://github.com/microsoft/MathOctopus](https://github.com/microsoft/MathOctopus) | |
## About Mathoctopus | |
Mathoctopus is a series of multilingual math reasoning large language models based on LLaMA. | |
This work pioneers exploring and building powerful Multilingual Math Reasoning (xMR) LLMs. To accomplish this, we make the following works: | |
- **MGSM8KInstruct**, the multilingual math reasoning instruction dataset, encompassing ten distinct languages, thus addressing the issue of training data scarcity in xMR tasks. | |
- **MSVAMP**, an out-of-domain xMR test dataset, to conduct a more exhaustive and comprehensive evaluation of the modelโs multilingual mathematical capabilities. | |
- **MathOctopus**, our effective Multilingual Math Reasoning LLMs, training with different strategies, which notably outperform conventional open-source LLMs and exhibit superiority over ChatGPT in few-shot scenarios. | |