bart1259
/

MiniCOTMath

Text Generation

text-generation-inference

Model card Files Files and versions

bart1259 commited on Nov 5, 2025

Commit

5c530e1

·

verified ·

1 Parent(s): cc65cad

Upload folder using huggingface_hub

Files changed (2) hide show

README.md +9 -0
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -4,6 +4,15 @@ license: mit
 Chain of Thought (CoT) transformer model trained to do multi-step integer arithmetic.
 ```py
 from transformers import pipeline

 Chain of Thought (CoT) transformer model trained to do multi-step integer arithmetic.
+Model details:
+ - **Vocabulary Size**: 40 (Character Tokenization)
+ - **Layer Count**: 8
+ - **Attention Head Count**: 4
+ - **Residual Stream Size**: 256
+ - **Context Length**: 256
+ - **Tokens Trained on**: 419,716,608
 ```py
 from transformers import pipeline

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e4c2ae5ed74dc9ea9ca4b15762fb9422183220ae9ebe578e1a90f34237c8f309
 size 25352072

 version https://git-lfs.github.com/spec/v1
+oid sha256:4fa529a29bee7d337ad783dc4ba51fbc70891014f441c7fd6921b535fd63295e
 size 25352072