Cannae-AI
/

GsMath-Llama-1B

Text Generation

text-generation-inference

Model card Files Files and versions

CannaeAI commited on Nov 18, 2025

Commit

df824a0

·

verified ·

1 Parent(s): c73ec2c

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -23,8 +23,8 @@ This is a fine-tuned version of [unsloth/Llama-3.2-1B](https://huggingface.co/un
 We evaluate both models on GSM8K using the standard lm-eval 5-shot exact-match protocol. Under identical decoding and extraction settings,GsMath-Llama-1B outperforms Meta’s Llama-3.2-1B by 2x,demonstrating an improvement in small-model mathematical capability.
 | Model                         | Params | GSM8K (5-shot, EM) |
 | ----------------------------- | ------ | ------------------ |
-| **GsMath-Llama-1B**           | 1B     | **13.7%**          |
-| Llama-3.2-1B                  | 1B     | 6.8%               |
 <p align="center">

 We evaluate both models on GSM8K using the standard lm-eval 5-shot exact-match protocol. Under identical decoding and extraction settings,GsMath-Llama-1B outperforms Meta’s Llama-3.2-1B by 2x,demonstrating an improvement in small-model mathematical capability.
 | Model                         | Params | GSM8K (5-shot, EM) |
 | ----------------------------- | ------ | ------------------ |
+| **GsMath-Llama-1B**           | 1B     | **0.137**          |
+| Llama-3.2-1B                  | 1B     | 0.068              |
 <p align="center">