dejanseo commited on
Commit
b040c82
·
verified ·
1 Parent(s): 30ff154

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -86,7 +86,9 @@ Prompt-response pairs were formatted as `{response}\n###\n{prompt}<eos>` and tok
86
  | Learning rate | 5e-5 |
87
  | Warmup steps | 100 |
88
  | Max sequence length | 2048 |
 
89
  | Gradient checkpointing | Enabled |
 
90
  | GPU | NVIDIA GeForce RTX 4090 (24 GB) |
91
  | CPU | AMD Ryzen 9 7950X3D 16-Core |
92
  | RAM | 128 GB |
 
86
  | Learning rate | 5e-5 |
87
  | Warmup steps | 100 |
88
  | Max sequence length | 2048 |
89
+ | Optimizer | AdamW (torch fused) |
90
  | Gradient checkpointing | Enabled |
91
+ | Training time | 4h 14m |
92
  | GPU | NVIDIA GeForce RTX 4090 (24 GB) |
93
  | CPU | AMD Ryzen 9 7950X3D 16-Core |
94
  | RAM | 128 GB |