Commit History

Upload checkpoints/pretrain_step05252435.pt with huggingface_hub
d8a34fa
verified

OpenTransformer commited on

Upload pretrain_step03817911.pt with huggingface_hub
9a16553
verified

OpenTransformer commited on

Upload checkpoints/pretrain_step02804149.pt with huggingface_hub
0390b1d
verified

OpenTransformer commited on

Upload pretrain_step02550400.pt with huggingface_hub
ced9b98
verified

OpenTransformer commited on

Upload n.py with huggingface_hub
d47778d
verified

OpenTransformer commited on

Upload checkpoints/pretrain_step02022699.pt with huggingface_hub
59fa0df
verified

OpenTransformer commited on

Upload checkpoints/pretrain_step01821614.pt with huggingface_hub
7a5dc17
verified

OpenTransformer commited on

Upload checkpoints/pretrain_step01720930.pt with huggingface_hub
f47e3f9
verified

OpenTransformer commited on

Upload pretrain_step01922101.pt with huggingface_hub
58caae2
verified

OpenTransformer commited on

Upload pretrain_step01821614.pt with huggingface_hub
bdd9ab1
verified

OpenTransformer commited on

Upload checkpoints/pretrain_step01922101.pt with huggingface_hub
9861f55
verified

OpenTransformer commited on

Upload pretrain_step01720930.pt with huggingface_hub
14c5476
verified

OpenTransformer commited on

Add rank optimization test - validates 2x ratio is optimal
c892818
verified

OpenTransformer commited on

Add GQA attention module with checkpoint compatibility
070c778
verified

OpenTransformer commited on

Add experiments/README.md
5d46996
verified

OpenTransformer commited on

Add experiments/infer_bench.py
ec068a9
verified

OpenTransformer commited on

Add experiments/final_showdown.py
764896d
verified

OpenTransformer commited on

Add experiments/joint_test.py
04806b0
verified

OpenTransformer commited on

Add experiments/n_flex.py
8a88d0a
verified

OpenTransformer commited on

Add experiments/n_ultra.py
a1e7fdb
verified

OpenTransformer commited on

Add experiments/n_heavy2.py
2db758d
verified

OpenTransformer commited on

Add experiments/n_heavy.py
2b0bfd4
verified

OpenTransformer commited on

Upload checkpoints/pretrain_step01620126.pt with huggingface_hub
7c91b87
verified

OpenTransformer commited on

Add CHANGELOG documenting GradScaler fix
e0ec949
verified

OpenTransformer commited on

Fix GradScaler resume bug - wrapped scaler.load_state_dict() in try/except at line 512. Allows resuming from checkpoints saved without AMP.
9dd1da1
verified

OpenTransformer commited on

Backup script hf_upload.py
be90022
verified

OpenTransformer commited on

Backup script inference_api.py
8f28f62
verified

OpenTransformer commited on

Backup script rotating_log.py
4cb6700
verified

OpenTransformer commited on

Backup script patcher.py
7e616ea
verified

OpenTransformer commited on

Backup script stream_loader.py
93b6ddd
verified

OpenTransformer commited on

Backup checkpoint pretrain_step01319973.pt
78f9c71
verified

OpenTransformer commited on

Backup checkpoint pretrain_step01034332.pt
3eda17f
verified

OpenTransformer commited on

Backup checkpoint pretrain_step00975327.pt
2b80d68
verified

OpenTransformer commited on

Checkpoint step 01178328 - 2026-01-13 13:39
f082a97
verified

OpenTransformer commited on

Step 686090 checkpoint - 49hrs training
8d46417
verified

OpenTransformer commited on

Add README with model details
eb981c3
verified

OpenTransformer commited on

Add tokenizer: tokenizer.json
9bdce90
verified

OpenTransformer commited on

Add tokenizer: special_tokens_map.json
998cbc2
verified

OpenTransformer commited on

Add tokenizer: tokenizer_config.json
67ca754
verified

OpenTransformer commited on

Add hot-reload config template
1400625
verified

OpenTransformer commited on

Add checkpoint uploader script
3c1a69d
verified

OpenTransformer commited on

Add dual rotating log (5000+2500 lines)
94bc6c9
verified

OpenTransformer commited on

Add trainer: AR+SAT joint training, hot-config, HF auto-upload
d648db7
verified

OpenTransformer commited on

Step 176907 - double chinchilla training (35.76B target)
b70a90a
verified

OpenTransformer commited on

Upload README.md with huggingface_hub
fe9a7ce
verified

OpenTransformer commited on