Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,39 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
datasets:
|
| 4 |
+
- EleutherAI/muInstruct
|
| 5 |
+
- camel-ai/math
|
| 6 |
+
language:
|
| 7 |
+
- en
|
| 8 |
+
tags:
|
| 9 |
+
- math
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
`llemma_7b_muinstruct_camelmath` is an instruction-following finetune of [Llemma 7B](https://huggingface.co/EleutherAI/llemma_7b), trained on the [μInstruct](https://huggingface.co/datasets/EleutherAI/muInstruct) and [camel-ai/math](https://huggingface.co/datasets/camel-ai/math) datasets.
|
| 13 |
+
|
| 14 |
+
## Input Formatting
|
| 15 |
+
Format input queries as follows:
|
| 16 |
+
```
|
| 17 |
+
input_text = f"Input:{input}\n\nResponse:"
|
| 18 |
+
```
|
| 19 |
+
|
| 20 |
+
Note that due to an error during training, this model's end-of-sequence token ID is `0` instead of the `2` which is standard for Llama-2 based models. Inference APIs should handle this automatically by reading this repo's `config.json`, but be aware of this difference if you are doing token surgery.
|
| 21 |
+
|
| 22 |
+
## Evals
|
| 23 |
+
`
|
| 24 |
+
llemma_7b_muinstruct_camelmath` compares favorably to other 7B parameter models on the [Hungarian Math Exam](https://huggingface.co/datasets/keirp/hungarian_national_hs_finals_exam/blob/main/README.md). It surpasses the few-shot performance of Llemma 7B whilst being the strongest Llama-2 7B based model.
|
| 25 |
+
|
| 26 |
+
| Model | Exam Score |
|
| 27 |
+
| ------------------------------------------------------------------------------ | ---------- |
|
| 28 |
+
| [Code Llama 7B](https://huggingface.co/codellama/CodeLlama-7b-hf) (few-shot) | 8\% |
|
| 29 |
+
| [MetaMath 7B](https://huggingface.co/meta-math/MetaMath-7B-V1.0) | 20\% |
|
| 30 |
+
| [MAmmoTH 7B](https://huggingface.co/TIGER-Lab/MAmmoTH-7B) | 17\% |
|
| 31 |
+
| [MAmmoTH Coder 7B](https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-7B) | 11\% |
|
| 32 |
+
| [Llemma 7B](https://huggingface.co/EleutherAI/llemma_7b) (few-shot) | 23\% |
|
| 33 |
+
| [Llemma_7B_muinstruct_camelmath] | 25\% |
|
| 34 |
+
| - | - |
|
| 35 |
+
| [Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1) (few-shot) | 22\% |
|
| 36 |
+
| [MetaMath Mistral 7B](https://huggingface.co/meta-math/MetaMath-Mistral-7B) | 29\% |
|
| 37 |
+
| [OpenChat 3.5](https://huggingface.co/openchat/openchat_3.5) | 37\% |
|
| 38 |
+
|
| 39 |
+
|