Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
OpenTransformer
/
AGILLM-3-large
like
0
Model card
Files
Files and versions
xet
Community
a1e7fdb
AGILLM-3-large
58.7 GB
Ctrl+K
Ctrl+K
1 contributor
History:
30 commits
OpenTransformer
Add experiments/n_ultra.py
a1e7fdb
verified
3 months ago
checkpoints
Upload checkpoints/pretrain_step01620126.pt with huggingface_hub
3 months ago
experiments
Add experiments/n_ultra.py
3 months ago
scripts
Backup script hf_upload.py
4 months ago
tokenizer
Add tokenizer: tokenizer.json
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
CHANGELOG.md
Safe
612 Bytes
Add CHANGELOG documenting GradScaler fix
4 months ago
README.md
Safe
1.48 kB
Add README with model details
4 months ago
hf_upload.py
Safe
1.92 kB
Add checkpoint uploader script
4 months ago
hot_config.json
Safe
253 Bytes
Add hot-reload config template
4 months ago
n.py
Safe
46.9 kB
Fix GradScaler resume bug - wrapped scaler.load_state_dict() in try/except at line 512. Allows resuming from checkpoints saved without AMP.
4 months ago
rotating_log.py
Safe
2.9 kB
Add dual rotating log (5000+2500 lines)
4 months ago