testoneshot / README.md
Asilarknes's picture
continue readout step 500 ppl 92.75
31f9ceb verified
# OneShot continued full model
Latest continued checkpoint:
- file: full_model_continue.pt
- step: 500
- ppl: 92.7529
- final: False
Base config:
- d=896
- r=320
- layers=10
- vocab=8192