CannaeAI commited on
Commit
df824a0
·
verified ·
1 Parent(s): c73ec2c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -23,8 +23,8 @@ This is a fine-tuned version of [unsloth/Llama-3.2-1B](https://huggingface.co/un
23
  We evaluate both models on GSM8K using the standard lm-eval 5-shot exact-match protocol. Under identical decoding and extraction settings,GsMath-Llama-1B outperforms Meta’s Llama-3.2-1B by 2x,demonstrating an improvement in small-model mathematical capability.
24
  | Model | Params | GSM8K (5-shot, EM) |
25
  | ----------------------------- | ------ | ------------------ |
26
- | **GsMath-Llama-1B** | 1B | **13.7%** |
27
- | Llama-3.2-1B | 1B | 6.8% |
28
 
29
 
30
  <p align="center">
 
23
  We evaluate both models on GSM8K using the standard lm-eval 5-shot exact-match protocol. Under identical decoding and extraction settings,GsMath-Llama-1B outperforms Meta’s Llama-3.2-1B by 2x,demonstrating an improvement in small-model mathematical capability.
24
  | Model | Params | GSM8K (5-shot, EM) |
25
  | ----------------------------- | ------ | ------------------ |
26
+ | **GsMath-Llama-1B** | 1B | **0.137** |
27
+ | Llama-3.2-1B | 1B | 0.068 |
28
 
29
 
30
  <p align="center">