lllqaq's picture
Add README
ff9f682 verified
---
language:
- en
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation
---
# R2EGym-7B-Agent-Coder-Instruct (checkpoint-800)
This repository contains a training checkpoint exported from LLaMA-Factory.
- Base: `Qwen/Qwen2.5-Coder-7B-Instruct`
- Training: SFT with DeepSpeed ZeRO-3
- Checkpoint: `checkpoint-800`
## Notes
- This repo includes ZeRO optimizer states in `global_step800/` for resuming training.
- For inference, use the `model-0000*-of-00004.safetensors` shards and tokenizer files.