File size: 524 Bytes
ff9f682 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 | ---
language:
- en
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation
---
# R2EGym-7B-Agent-Coder-Instruct (checkpoint-800)
This repository contains a training checkpoint exported from LLaMA-Factory.
- Base: `Qwen/Qwen2.5-Coder-7B-Instruct`
- Training: SFT with DeepSpeed ZeRO-3
- Checkpoint: `checkpoint-800`
## Notes
- This repo includes ZeRO optimizer states in `global_step800/` for resuming training.
- For inference, use the `model-0000*-of-00004.safetensors` shards and tokenizer files.
|