Mathoctopus
/

Cross_7B

@@ -14,6 +14,15 @@ language:
 - fr
 - bn
 ---
 ### Introduction
 We introduce 🐙 MathOctopus, a series of open-source large language models (LLMs) specifically tailored for multilingual math problem-solving. The MathOctopus models are trained on 🤗 MGSM8KInstruct Dataset, encompassing ten distinct languages.
@@ -61,7 +70,9 @@ Our dataset and models are all available at Huggingface.
 *-Cross refers to our model trained with cross-training strategy.
 *-xRFT means we train the model with multilingual rejection sampling.
 ### **Overall Results on MGSM**
 | 7B Model                        | En      | Sw      | Zh      | Bn      | De      | Es      | Fr      | Ja      | Ru      | Th      | Overall |
 |:--------------------------------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|
 | MathOctopus<sup>C</sup>         | 52.0    | 23.6    | 31.6    | 18.8    | 38.0    | 39.2    | 36.4    | 27.2    | 33.6    | 21.6    | 32.2    |
@@ -85,7 +96,10 @@ Our dataset and models are all available at Huggingface.
 | **xRFT**-MathOctopus<sup>C</sup>| 53.6    | 27.6    | 34.4    | 19.2    | 47.2    | 47.6    | 44.8    | 30.8    | 38.8    | 22.8    | 36.7    |
 | MathOctopus<sup>P</sup>         | 56.4    | 46.8    | 52.0    | 35.2    | 47.2    | 53.2    | 48.0    | 39.2    | 45.6    | 41.2    | 46.5    |
 | **xRFT**-MathOctopus<sup>P</sup>| 51.6    | 47.2    | 52.4    | 37.6    | 51.2    | 52.8    | 44.4    | 41.6    | 50.0    | 47.6    | 47.6    |
 ### **Overall Results on MSVAMP**
 | 7B Model                        | En      | Sw      | Zh      | Bn      | De      | Es      | Fr      | Ja      | Ru      | Th      | Overall |
 |:--------------------------------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|
 | MathOctopus<sup>C</sup>         | 49.2    | 36.6    | 43.6    | 30.2    | 48.6    | 46.8    | 46.4    | 42.5    | 46.7    | 34.0    | 42.5    |
@@ -109,7 +123,10 @@ Our dataset and models are all available at Huggingface.
 | **xRFT**-MathOctopus<sup>C</sup>| 48.1    | 42.8    | 43.6    | 23.3    | 48.7    | 50.0    | 48.9    | 43.4    | 44.6    | 35.5    | 42.9    |
 | MathOctopus<sup>P</sup>         | 56.4    | 46.8    | 52.0    | 35.2    | 47.2    | 53.2    | 48.0    | 39.2    | 45.6    | 41.2    | 46.5    |
 | **xRFT**-MathOctopus<sup>P</sup>| 48.0    | 42.3    | 46.1    | 36.2    | 47.5    | 48.5    | 48.3    | 45.8    | 47.2    | 41.2    | 45.1    |
 ### **MathOctopus in English**
 | Models                          | GSM8K   | SVAMP   |
 |:--------------------------------|:--------|:--------|
 | LLaMA 2-7B                      | 42.4    | 38.3    |
@@ -121,5 +138,20 @@ Our dataset and models are all available at Huggingface.
 | LLaMA 1-33B                     | 50.0    | 49.0    |
 | MathOctopus<sup>P</sup>-33B     | 56.0    | 52.5    |
 | MathOctopus<sup>C</sup>-33B     | 53.7    | 51.5    |
 ## Intended Uses
 These models are trained for research purposes. They are designed to solve multilingual math problems. They can be used in educational software, tutoring systems, or any application where a solution to a math problem is needed.

 - fr
 - bn
 ---
+# 🐙 Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations
+Project Page: [https://mathoctopus.github.io/](https://mathoctopus.github.io/)
+Paper: [https://arxiv.org/abs/2310.20246.pdf](https://arxiv.org/abs/2310.20246.pdf)
+Code: [https://github.com/microsoft/MathOctopus](https://github.com/microsoft/MathOctopus)
 ### Introduction
 We introduce 🐙 MathOctopus, a series of open-source large language models (LLMs) specifically tailored for multilingual math problem-solving. The MathOctopus models are trained on 🤗 MGSM8KInstruct Dataset, encompassing ten distinct languages.
 *-Cross refers to our model trained with cross-training strategy.
 *-xRFT means we train the model with multilingual rejection sampling.
 ### **Overall Results on MGSM**
 | 7B Model                        | En      | Sw      | Zh      | Bn      | De      | Es      | Fr      | Ja      | Ru      | Th      | Overall |
 |:--------------------------------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|
 | MathOctopus<sup>C</sup>         | 52.0    | 23.6    | 31.6    | 18.8    | 38.0    | 39.2    | 36.4    | 27.2    | 33.6    | 21.6    | 32.2    |
 | **xRFT**-MathOctopus<sup>C</sup>| 53.6    | 27.6    | 34.4    | 19.2    | 47.2    | 47.6    | 44.8    | 30.8    | 38.8    | 22.8    | 36.7    |
 | MathOctopus<sup>P</sup>         | 56.4    | 46.8    | 52.0    | 35.2    | 47.2    | 53.2    | 48.0    | 39.2    | 45.6    | 41.2    | 46.5    |
 | **xRFT**-MathOctopus<sup>P</sup>| 51.6    | 47.2    | 52.4    | 37.6    | 51.2    | 52.8    | 44.4    | 41.6    | 50.0    | 47.6    | 47.6    |
 ### **Overall Results on MSVAMP**
 | 7B Model                        | En      | Sw      | Zh      | Bn      | De      | Es      | Fr      | Ja      | Ru      | Th      | Overall |
 |:--------------------------------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|:--------|
 | MathOctopus<sup>C</sup>         | 49.2    | 36.6    | 43.6    | 30.2    | 48.6    | 46.8    | 46.4    | 42.5    | 46.7    | 34.0    | 42.5    |
 | **xRFT**-MathOctopus<sup>C</sup>| 48.1    | 42.8    | 43.6    | 23.3    | 48.7    | 50.0    | 48.9    | 43.4    | 44.6    | 35.5    | 42.9    |
 | MathOctopus<sup>P</sup>         | 56.4    | 46.8    | 52.0    | 35.2    | 47.2    | 53.2    | 48.0    | 39.2    | 45.6    | 41.2    | 46.5    |
 | **xRFT**-MathOctopus<sup>P</sup>| 48.0    | 42.3    | 46.1    | 36.2    | 47.5    | 48.5    | 48.3    | 45.8    | 47.2    | 41.2    | 45.1    |
 ### **MathOctopus in English**
 | Models                          | GSM8K   | SVAMP   |
 |:--------------------------------|:--------|:--------|
 | LLaMA 2-7B                      | 42.4    | 38.3    |
 | LLaMA 1-33B                     | 50.0    | 49.0    |
 | MathOctopus<sup>P</sup>-33B     | 56.0    | 52.5    |
 | MathOctopus<sup>C</sup>-33B     | 53.7    | 51.5    |
 ## Intended Uses
 These models are trained for research purposes. They are designed to solve multilingual math problems. They can be used in educational software, tutoring systems, or any application where a solution to a math problem is needed.
+## Citation
+Please cite our paper if you use our data, model or code. Please also kindly cite the original dataset papers.
+```
+@misc{chen2023breaking,
+      title={Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations},
+      author={Nuo Chen and Zinan Zheng and Ning Wu and Linjun Shou and Ming Gong and Yangqiu Song and Dongmei Zhang and Jia Li},
+      year={2023},
+      eprint={2310.20246},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```