Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OpenTransformer
/
AGILLM-3-large

Model card Files Files and versions
xet
Community
AGILLM-3-large
58.7 GB
  • 1 contributor
History: 28 commits
OpenTransformer's picture
OpenTransformer
Add experiments/n_heavy.py
2b0bfd4 verified about 2 months ago
  • checkpoints
    Upload checkpoints/pretrain_step01620126.pt with huggingface_hub about 2 months ago
  • experiments
    Add experiments/n_heavy.py about 2 months ago
  • scripts
    Backup script hf_upload.py about 2 months ago
  • tokenizer
    Add tokenizer: tokenizer.json about 2 months ago
  • .gitattributes
    1.52 kB
    initial commit about 2 months ago
  • CHANGELOG.md
    612 Bytes
    Add CHANGELOG documenting GradScaler fix about 2 months ago
  • README.md
    1.48 kB
    Add README with model details about 2 months ago
  • hf_upload.py
    1.92 kB
    Add checkpoint uploader script about 2 months ago
  • hot_config.json
    253 Bytes
    Add hot-reload config template about 2 months ago
  • n.py
    46.9 kB
    Fix GradScaler resume bug - wrapped scaler.load_state_dict() in try/except at line 512. Allows resuming from checkpoints saved without AMP. about 2 months ago
  • rotating_log.py
    2.9 kB
    Add dual rotating log (5000+2500 lines) about 2 months ago