Cannae-AI
/

GsMath-Llama-1B

Text Generation

text-generation-inference

Model card Files Files and versions

CannaeAI commited on Nov 18, 2025

Commit

c73ec2c

·

verified ·

1 Parent(s): 1652ce6

Update README.md

Files changed (1) hide show

README.md +31 -0

README.md CHANGED Viewed

	@@ -1 +1,32 @@



























1

+---
+base_model:
+- unsloth/Llama-3.2-1B
+tags:
+- text-generation-inference
+- transformers
+- math
+- conversational
+- llama
+- meta
+license: apache-2.0
+language:
+- en
+library_name: transformers
+---
+# GsMath-Llama-1B
+## Model Description:
+This is a fine-tuned version of [unsloth/Llama-3.2-1B](https://huggingface.co/unsloth/Llama-3.2-1B)!
+- **recommended settings for inference:** min_p = 0.1 and temperature = 1.5 , Read this [Tweet](https://x.com/menhguin/status/1826132708508213629) to understand why.
+- **License :** apache-2.0
+- **Finetuned from model :** unsloth/Llama-3.2-1B
+## Benchmarks:
+We evaluate both models on GSM8K using the standard lm-eval 5-shot exact-match protocol. Under identical decoding and extraction settings,GsMath-Llama-1B outperforms Meta’s Llama-3.2-1B by 2x,demonstrating an improvement in small-model mathematical capability.
+| Model                         | Params | GSM8K (5-shot, EM) |
+| ----------------------------- | ------ | ------------------ |
+| **GsMath-Llama-1B**           | 1B     | **13.7%**          |
+| Llama-3.2-1B                  | 1B     | 6.8%               |
+<p align="center">
+  <img alt="GsMath-Llama-1B" src="https://huggingface.co/Cannae-AI/ReasoningLlama-Math-1B-IT/resolve/main/ChatGPT%20Image%2018%20nov.%202025%2C%2020_55_23.png">
+</p>