Update README.md
Browse files
README.md
CHANGED
|
@@ -23,8 +23,8 @@ This is a fine-tuned version of [unsloth/Llama-3.2-1B](https://huggingface.co/un
|
|
| 23 |
We evaluate both models on GSM8K using the standard lm-eval 5-shot exact-match protocol. Under identical decoding and extraction settings,GsMath-Llama-1B outperforms Meta’s Llama-3.2-1B by 2x,demonstrating an improvement in small-model mathematical capability.
|
| 24 |
| Model | Params | GSM8K (5-shot, EM) |
|
| 25 |
| ----------------------------- | ------ | ------------------ |
|
| 26 |
-
| **GsMath-Llama-1B** | 1B | **
|
| 27 |
-
| Llama-3.2-1B | 1B |
|
| 28 |
|
| 29 |
|
| 30 |
<p align="center">
|
|
|
|
| 23 |
We evaluate both models on GSM8K using the standard lm-eval 5-shot exact-match protocol. Under identical decoding and extraction settings,GsMath-Llama-1B outperforms Meta’s Llama-3.2-1B by 2x,demonstrating an improvement in small-model mathematical capability.
|
| 24 |
| Model | Params | GSM8K (5-shot, EM) |
|
| 25 |
| ----------------------------- | ------ | ------------------ |
|
| 26 |
+
| **GsMath-Llama-1B** | 1B | **0.137** |
|
| 27 |
+
| Llama-3.2-1B | 1B | 0.068 |
|
| 28 |
|
| 29 |
|
| 30 |
<p align="center">
|