bart1259 commited on
Commit
5c530e1
·
verified ·
1 Parent(s): cc65cad

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +9 -0
  2. model.safetensors +1 -1
README.md CHANGED
@@ -4,6 +4,15 @@ license: mit
4
 
5
  Chain of Thought (CoT) transformer model trained to do multi-step integer arithmetic.
6
 
 
 
 
 
 
 
 
 
 
7
  ```py
8
  from transformers import pipeline
9
 
 
4
 
5
  Chain of Thought (CoT) transformer model trained to do multi-step integer arithmetic.
6
 
7
+ Model details:
8
+ - **Vocabulary Size**: 40 (Character Tokenization)
9
+ - **Layer Count**: 8
10
+ - **Attention Head Count**: 4
11
+ - **Residual Stream Size**: 256
12
+ - **Context Length**: 256
13
+ - **Tokens Trained on**: 419,716,608
14
+
15
+
16
  ```py
17
  from transformers import pipeline
18
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e4c2ae5ed74dc9ea9ca4b15762fb9422183220ae9ebe578e1a90f34237c8f309
3
  size 25352072
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4fa529a29bee7d337ad783dc4ba51fbc70891014f441c7fd6921b535fd63295e
3
  size 25352072