rsafier commited on
Commit
51b79bd
·
verified ·
1 Parent(s): 6876dbf

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. README.md +2 -2
  2. config.json +2 -2
  3. model.safetensors +1 -1
README.md CHANGED
@@ -23,8 +23,8 @@ A GPT-style language model trained from scratch in Rust on Project Gutenberg.
23
  | Embedding dim | 128 |
24
  | Context window | 256 tokens |
25
  | Vocab size | 1500 (BPE) |
26
- | Training iters | 17325 |
27
- | Best val loss | 3.5006 |
28
 
29
  ## Training
30
 
 
23
  | Embedding dim | 128 |
24
  | Context window | 256 tokens |
25
  | Vocab size | 1500 (BPE) |
26
+ | Training iters | 19700 |
27
+ | Best val loss | 3.4604 |
28
 
29
  ## Training
30
 
config.json CHANGED
@@ -14,6 +14,6 @@
14
  "block_size": 256,
15
  "bos_token_id": 0,
16
  "eos_token_id": 1,
17
- "trained_iters": 17325,
18
- "best_val_loss": 3.5005691051483154
19
  }
 
14
  "block_size": 256,
15
  "bos_token_id": 0,
16
  "eos_token_id": 1,
17
+ "trained_iters": 19700,
18
+ "best_val_loss": 3.460430383682251
19
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b3320ea005dde5747e776018f9020de2b68c42049f822966d921f0fabe62c2ca
3
  size 7963224
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ca846392dd8aca1a80f457290b60fb49c7a59fdaf59cd84fbbf32263c7319d7e
3
  size 7963224